department of mathematics and computer science 1212 1 6BV04 Screening Designs.

Post on 20-Dec-2015

213 views 1 download

Transcript of department of mathematics and computer science 1212 1 6BV04 Screening Designs.

department of mathematics and computer science

1

6BV04

Screening Designs

department of mathematics and computer science

2

Contents• regression analysis and effects• 2p-experiments• blocks• 2p-k-experiments

(fractional factorial experiments)• software• literature

department of mathematics and computer science

3

Three factors: exampleThree factors: example

Response: deviation filling height bottles

Factors: carbon dioxide level (%) Apressure (psi) Bspeed (bottles/min) C

department of mathematics and computer science

4

Effects

How do we determine whether an individual factor is of importance?

Measure the outcome at 2 different settings of that factor.

Scale the settings such that they become the values +1 and -1.

department of mathematics and computer science

5

setting factor A

measurement

-1 +1

department of mathematics and computer science

6

setting factor A

measurement

-1 +1

department of mathematics and computer science

7

setting factor A

measurement

-1 +1

effect

department of mathematics and computer science

8

setting factor A

measurement

-1 +1

effect

N.B. effect = 2 * slope

slope

department of mathematics and computer science

9

setting factor A

measurement

-1 +1

50

35Effect factor A = 50 – 35 = 15

department of mathematics and computer science

10

More factorsWe denote factors with capitals: A, B,…

Each factor only attains two settings: -1 and +1

The joint settings of all factors in one measurement is called a level combination.

department of mathematics and computer science

11

More factors

A B

-1 -1

-1 1

1 -1

1 1

LevelCombination

department of mathematics and computer science

12

NotationA level combination consists of small letters. The small letters denote which factors are set at +1; the letters that do not appear are set at -1.

Example: ac means: A and C at 1, the remaining factors at -1

N.B. (1) means that all factors are set at -1.

department of mathematics and computer science

13

An experiment consists of performing measurements at different level combinations.A run is a measurement at one level combination.

Suppose that there are 2 factors, A and B.

We perform 4 measurements with the following settings:• A -1 and B -1 (short: (1) )• A +1 and B -1 (short: a )

• A -1 and B +1 (short: b )• A +1 and B +1 (short: ab )

department of mathematics and computer science

14

A 22 Experiment with 4 runs

A B yield

(1) -1 -1

b -1 1

a 1 -1

ab 1 1

department of mathematics and computer science

15

Note:Note:

CAPITALS for factors and effects

small letters for level combinations ( = settings of the experiments)

(A, BC, CDEF)

(a, bc, cde, (1))

department of mathematics and computer science

16

Graphical display

A

B

-1 +1

-1

+1

a

ab

(1)

b

department of mathematics and computer science

17

B

A-1 +1

-1

+1

50

60

35

40

department of mathematics and computer science

18

B

A-1 +1

-1

+1

50

60

35

40

2 estimates for effect A:

department of mathematics and computer science

19

B

A-1 +1

-1

+1

50

60

35

40

2 estimates for effect A: 50 - 35 = 15

department of mathematics and computer science

20

B

A-1 +1

-1

+1

50

60

35

40

2 estimates for effect A:

60 - 40 = 20

50 - 35 = 15

department of mathematics and computer science

21

B

A-1 +1

-1

+1

50

60

35

40

2 estimates for effect A:

60 - 40 = 20

50 - 35 = 15

Which estimate is superior?

department of mathematics and computer science

22

B

A-1 +1

-1

+1

50

60

35

40

2 estimates for effect A:

60 - 40 = 20

50 - 35 = 15

Combine both estimates: ½(50-35) + ½(60-40) = 17.5

department of mathematics and computer science

23

B

A-1 +1

-1

+1

50

60

35

40

In the same way we estimate the effect B(note that all 4 measurements are used!):

½(40-35) ½(60-50)+ = 7.5

department of mathematics and computer science

24

B

A-1 +1

-1

+1

50

60

35

40

The interaction effect AB is the difference between the estimates for the effect A:

½(60-40) ½(50-35)- = 2.5

department of mathematics and computer science

25

Interaction effects

Cross terms in linear regression models cause interaction effects:

Y = 3 + 2 xA + 4 xB + 7 xA xB

xA xA +1 YY + 2 + 7 xB,

so increase depends on xB. Likewise for xB xB+1

This explains the notation AB .

department of mathematics and computer science

26

No interaction

Factor A

Outp

ut

low high

B low

B high

20

50

55

25

department of mathematics and computer science

27

Interaction I

Factor A

Outp

ut

low high

B low

B high

20

5055

45

department of mathematics and computer science

28

Interaction II

Factor A

Outp

ut

low high

B low55 50

B high

20

45

department of mathematics and computer science

29

Interaction III

Factor A

Outp

ut

low high

B low

55

20

B high

20

45

department of mathematics and computer science

30

Trick to Compute Effects

A B yield

(1) -1 -1 35

b -1 1 40

a 1 -1 50

ab 1 1 60

(coded)measurement settings

department of mathematics and computer science

31

A B yield

(1) -1 -1 35

b -1 1 40

a 1 -1 50

ab 1 1 60

Effect estimates

Trick to Compute Effects

department of mathematics and computer science

32

A B yield

(1) -1 -1 35

b -1 1 40

a 1 -1 50

ab 1 1 60

Effect estimates

Effect A = ½(-35 - 40 + 50 + 60) = 17.5Effect B = ½(-35 + 40 – 50 + 60) = 7.5

Trick to Compute Effects

department of mathematics and computer science

33

A B AB yield

(1) -1 -1 ? 35

b -1 1 ? 40

a 1 -1 ? 50

ab 1 1 ? 60

Trick to Compute Effects

Effect AB = ½(60-40) - ½(50-35) = 2.5

department of mathematics and computer science

34

A B AB yield

(1) -1 -1 1 35

b -1 1 -1 40

a 1 -1 -1 50

ab 1 1 1 60

Trick to Compute Effects

Effect AB = ½(60-40) - ½(50-35) = 2.5

× =× =× =× =

AB equals the product of the columns A and B

department of mathematics and computer science

35

I A B AB yield

(1) + - - + 35

b + - + - 40

a + + - - 50

ab + + + + 60

Trick to Compute Effects

Computational rules: I×A = A, I×B = B, A×B=AB etc.This holds true in general (i.e., also for more factors).

department of mathematics and computer science

36

I A B AB C AC BC ABC(1)a

++

-+

--

+-

--

+-

++

-+

bab

++

-+

++

-+

--

+-

--

+-

cac

++

-+

--

+-

++

-+

--

+-

bcabc

++

-+

++

-+

++

-+

++

-+

3 Factors: a 23 Design

department of mathematics and computer science

37

3 Factors: a 23 Design

A B C Yield

(1) - - - 5

a + - - 2

b - + - 7

ab + + - 1

c - - + 7

ac + - + 6

bc - + + 9

abc + + + 7

department of mathematics and computer science

38

(1)=5 a=2

ab=1b=7

ac=6

abc=7bc=9

c=7

effect A =

¼(+16-28)=-3

A

B

C

I A B AB C AC BC ABC(1)a

++

-+

--

+-

--

+-

++

-+

bab

++

-+

++

-+

--

+-

--

+-

cac

++

-+

--

+-

++

-+

--

+-

bcabc

++

-+

++

-+

++

-+

++

-+

scheme 23 design

department of mathematics and computer science

39

effect AB =

¼(+20-24)=-1

I A B AB C AC BC ABC(1)a

++

-+

--

+-

--

+-

++

-+

bab

++

-+

++

-+

--

+-

--

+-

cac

++

-+

--

+-

++

-+

--

+-

bcabc

++

-+

++

-+

++

-+

++

-+

scheme 23 design

(1)=5 a=2

ab=1b=7

ac=6

abc=7bc=9

c=7

A

B

C

department of mathematics and computer science

40

Back to 2 factors – Blocking

I A B AB(1) + - - +b + - + -a + + - -

ab + + + +

Suppose that we cannot perform all measurements at the same day. We are not interested in the difference between 2 days, but we must take the effect of this into account. How do we accomplish that?

day 1

day 2

department of mathematics and computer science

41

Back to 2 factors – Blocking

I A B AB day(1) + - - + 1b + - + - 1a + + - - 2

ab + + + + 2

Suppose that we cannot perform all measurements at the same day. We are not interested in the difference between 2 days, but we must take the effect of this into account. How do we accomplish that?

“hidden”blockeffect

department of mathematics and computer science

42

Back to 2 factors – Blocking

I A B AB day(1) + - - + -b + - + - -a + + - - +

ab + + + + +

We note that the columns A and day are the same.

Consequence: the effect of A and the day effect cannot be distinguished. This is called confounding or aliasing).

department of mathematics and computer science

43

Back to 2 factors – Blocking

I A B AB day(1) + - - + ?b + - + - ?a + + - - ?

ab + + + + ?

A general guide-line is to confound the day effect with an interaction of highest possible order. How can we accomplish that here?

department of mathematics and computer science

44

Back to 2 factors – Blocking

Solution:day 1: a, b day 2: (1), abor interchange the days!

I A B AB day(1) + - - + +b + - + - -a + + - - -

ab + + + + +

department of mathematics and computer science

45

Back to 2 factors – Blocking

Solution:day 1: a, b day 2: (1), abor interchange the days!

I A B AB day(1) + - - + +b + - + - -a + + - - -

ab + + + + +

Choose within the days by drawing lots which experiment must be performed first. In general, the order of experiments must be determined by drawing lots. This is called randomisation.

department of mathematics and computer science

46

Here is a scheme for 3 factors. Interactions of order 3 or higher can be neglected in practice. How should we divide the experiments over 2 days?

I A B AB C AC BC ABC

(1)a

++

-+

--

+-

--

+-

++

-+

bab

++

-+

++

-+

--

+-

--

+-

cac

++

-+

--

+-

++

-+

--

+-

bcabc

++

-+

++

-+

++

-+

++

-+

day 1

day 2

department of mathematics and computer science

47

Fractional experiments

Often the number of parameters is too large to allow a complete 2p design (i.e, all 2p possible settings -1 and 1 of the p factors).

By performing only a subset of the 2p

experiments in a smart way, we can arrange that by performing relatively few, it is possible to estimate the main effects and (possibly) 2nd order interactions.

department of mathematics and computer science

48

Fractional experiments

I A B AB C AC BCABC

(1) + - - + - + + -

a + + - - - - + +

b + - + - - + - +

ab + + + + - - - -

c + - - + + - - +

ac + + - - + + - -

bc + - + - + - + -

abc + + + + + + + +

department of mathematics and computer science

49

Fractional experiments

I A B AB C AC BCABC

(1) + - - + - + + -

a + + - - - - + +

b + - + - - + - +

ab + + + + - - - -

c + - - + + - - +

ac + + - - + + - -

bc + - + - + - + -

abc + + + + + + + +

department of mathematics and computer science

50

Fractional experiments

I A B AB C AC BCABC

(1) + - - + - + + -

a + + - - - - + +

b + - + - - + - +

ab + + + + - - - -

department of mathematics and computer science

51

Fractional experiments

I A B AB C AC BCABC

(1) + - - + - + + -

a + + - - - - + +

b + - + - - + - +

ab + + + + - - - -With this half fraction (only 4 = ½×8 experiments) we see that a number of columns are the same (apart from a minus sign):

I = -C, A = -AC, B = -BC, AB = -ABC

department of mathematics and computer science

52

Fractional experiments

I A B AB C AC BCABC

(1) + - - + - + + -

a + + - - - - + +

b + - + - - + - +

ab + + + + - - - -We say that these factors are confounded or aliased. In this particular case we have an ill-chosen fraction, because I and C are confounded.

I = -C, A = -AC, B = -BC, AB = -ABC

department of mathematics and computer science

53

Fractional experiments – Better Choice: I = ABC

I A B AB C AC BCABC

(1) + - - + - + + -

a + + - - - - + +

b + - + - - + - +

ab + + + + - - - -

c + - - + + - - +

ac + + - - + + - -

bc + - + - + - + -

abc + + + + + + + +

department of mathematics and computer science

54

Fractional experiments – Better Choice: I = ABC

I A B AB C AC BCABC

a + + - - - - + +

b + - + - - + - +

c + - - + + - - +

abc + + + + + + + +

The other “best choice” would be: I = -ABC

Aliasing structure: I = ABC, A = BC, B = AC, C = AB

department of mathematics and computer science

55

I A B AB C AC BCABC

a + + - - - - + +

b + - + - - + - +

c + - - + + - - +

abc + + + + + + + +

In the case of 3 factors further reducing the number of experiments is not possible in practice, because this leads to undesired confounding, e.g. : I = A = BC = ABC, B = C = AB = AC,

department of mathematics and computer science

56

I A B AB C AC BCABC

a + + - - - - + +

abc + + + + + + + +

Other quarter fractions also have confounded main effects, which is unacceptable.

department of mathematics and computer science

57

Further remarks on fractions

• there exist computational rules for aliases. E.g., it follows from A=C that AB = BC. Note that I = A2 = B2 = C2 etc. always holds (see the next lecture)

•tables and software are available for choosing a suitable fraction . The extent of confounding is indicated by the resolution. Resolution III is a minimal ; designs with a higher resolution are very much preferred.

department of mathematics and computer science

58

Plackett-Burman designs

So far we discussed fractional designs for screening. This is sensible if one cannot exclude the possibility of interactions.

If one knows based on foreknowledge that there are no interactions or if one is for some reason is only interested in main effects, than Plackett-Burman designs are preferred. They are able to detect significant main effects using only very few runs. A disadvantage of these designs is their complicated aliasing structure.

department of mathematics and computer science

59

Number of measurements

For every main or interaction effect that has to estimated separately, at least one measurement is necessary. If there are k blocks, then this requires additional k - 1 measurements. The remaining measurements are used for estimation of the variance. It is important to have sufficient measurements for the variance.

department of mathematics and computer science

60

Choice of design

After a design has been chosen, the factors A, B, … must be assigned to the factors of the experiment. It is recommended to combine any foreknowledge on the factors with the alias structure. The individual measurements must be performed in a random order.

• never confound two effects that might both be significant

• if you know that a certain effect will not be significant,you can confound it with an effect that might be significant.

department of mathematics and computer science

61

Centre points and Replications

If there are not enough measurements to obtain a good estimate of the variance, then one can perform replications. Another possibility is to add centre points .

B

A-1 +1

-1

+1

a

ab

(1)

bAdding centre points serves two purposes:• better variance estimate• allow to test curvature using a lack-of-fit test

Centre point

department of mathematics and computer science

62

Curvature

A design in which each factor is only allowed to attain the levels -1 and 1, is implicitly assuming a linear model. This is because knowing only the functions values at -1 and +1, then 1 and x2 cannot be distinguished. We can distinguish them by adding the level 0. This is the idea behind adding centre points.

department of mathematics and computer science

63

Analysis of a Design

A B C Yield

(1) - - - 5

a + - - 2

b - + - 7

ab + + - 1

c - - + 7

ac + - + 6

bc - + + 9

abc + + + 7

department of mathematics and computer science

64

Analysis of a Design – With 2-way InteractionsAnalysis Summary----------------File name: <Untitled>

Estimated effects for Yield----------------------------------------------------------------------average = 5.5 +/- 0.25A:A = -3.0 +/- 0.5B:B = 1.0 +/- 0.5C:C = 3.5 +/- 0.5AB = -1.0 +/- 0.5AC = 1.5 +/- 0.5BC = 0.5 +/- 0.5----------------------------------------------------------------------Standard errors are based on total error with 1 d.f.

department of mathematics and computer science

65

Analysis of a Design – With 2-way InteractionsAnalysis of Variance for Yield--------------------------------------------------------------------------------Source Sum of Squares Df Mean Square F-Ratio P-Value--------------------------------------------------------------------------------A:A 18.0 1 18.0 36.00 0.1051B:B 2.0 1 2.0 4.00 0.2952C:C 24.5 1 24.5 49.00 0.0903AB 2.0 1 2.0 4.00 0.2952AC 4.5 1 4.5 9.00 0.2048BC 0.5 1 0.5 1.00 0.5000Total error 0.5 1 0.5--------------------------------------------------------------------------------Total (corr.) 52.0 7

R-squared = 99.0385 percentR-squared (adjusted for d.f.) = 93.2692 percentStandard Error of Est. = 0.707107Mean absolute error = 0.25Durbin-Watson statistic = 2.5Lag 1 residual autocorrelation = -0.375

department of mathematics and computer science

66

Analysis of a Design – Only Main EffectsAnalysis Summary----------------File name: <Untitled>

Estimated effects for Yield----------------------------------------------------------------------average = 5.5 +/- 0.484123A:A = -3.0 +/- 0.968246B:B = 1.0 +/- 0.968246C:C = 3.5 +/- 0.968246----------------------------------------------------------------------Standard errors are based on total error with 4 d.f.

Effect estimates remain the same!

department of mathematics and computer science

67

Analysis of a Design – Only Main EffectsAnalysis of Variance for Yield--------------------------------------------------------------------------------Source Sum of Squares Df Mean Square F-Ratio P-Value--------------------------------------------------------------------------------A:A 18.0 1 18.0 9.60 0.0363B:B 2.0 1 2.0 1.07 0.3601C:C 24.5 1 24.5 13.07 0.0225Total error 7.5 4 1.875--------------------------------------------------------------------------------Total (corr.) 52.0 7

R-squared = 85.5769 percentR-squared (adjusted for d.f.) = 74.7596 percentStandard Error of Est. = 1.36931Mean absolute error = 0.8125Durbin-Watson statistic = 2.16667 (P=0.3180)Lag 1 residual autocorrelation = -0.125

department of mathematics and computer science

68

Analysis of a Design with Blocks

Block A B C Yield

(1) 1 - - - 5

ab 1 + + - 1

ac 1 + - + 6

bc 1 - + + 9

a 2 + - - 2

b 2 - + - 7

c 2 - - + 7

abc 2 + + + 7

department of mathematics and computer science

69

Analysis of a Design with Blocks – With 2-way Interactions

Analysis of Variance for Yield--------------------------------------------------------------------------------Source Sum of Squares Df Mean Square F-Ratio P-Value--------------------------------------------------------------------------------A:A 18.0 1 18.0B:B 2.0 1 2.0C:C 24.5 1 24.5AB 2.0 1 2.0AC 4.5 1 4.5BC 0.5 1 0.5blocks 0.5 1 0.5Total error 0.0 0--------------------------------------------------------------------------------Total (corr.) 52.0 7

R-squared = 100.0 percentR-squared (adjusted for d.f.) = 100.0 percent

Saturated design: 0 df for the error term → no testing possible

department of mathematics and computer science

70

Analysis of a Design with Blocks – Only Main Effects

Analysis of Variance for Yield--------------------------------------------------------------------------------Source Sum of Squares Df Mean Square F-Ratio P-Value--------------------------------------------------------------------------------A:A 18.0 1 18.0 7.71 0.0691B:B 2.0 1 2.0 0.86 0.4228C:C 24.5 1 24.5 10.50 0.0478blocks 0.5 1 0.5 0.21 0.6749Total error 7.0 3 2.33333--------------------------------------------------------------------------------Total (corr.) 52.0 7

R-squared = 86.5385 percentR-squared (adjusted for d.f.) = 76.4423 percentStandard Error of Est. = 1.52753Mean absolute error = 0.75Durbin-Watson statistic = 3.21429 (P=0.0478)Lag 1 residual autocorrelation = -0.642857

department of mathematics and computer science

71

Analysis of a Fractional Design (I = -ABC)

A B C Yield

(1) - - - 5

ac + - + 6

bc - + + 9

ab + + - 1

department of mathematics and computer science

72

Analysis of a Fractional Design (I = -ABC)Analysis of Variance for Yield--------------------------------------------------------------------------------Source Sum of Squares Df Mean Square F-Ratio P-Value--------------------------------------------------------------------------------A:A-BC 12.25 1 12.25B:B-AC 0.25 1 0.25C:C-AB 20.25 1 20.25Total error 0.0 0--------------------------------------------------------------------------------Total (corr.) 32.75 3

R-squared = 100.0 percentR-squared (adjusted for d.f.) = 0.0 percent

Estimated effects for Yield----------------------------------------------------------------------average = 5.25A:A-BC = -3.5B:B-AC = -0.5C:C-AB = 4.5 ----------------------------------------------------------------------No degrees of freedom left to estimate standard errors.

department of mathematics and computer science

73

A B Yield

(1) - - 5

a + - 6

b - + 9

ab + + 1

0 0 8

0 0 8

0 0 7

Pure Error =3

2

1

1 1( )

3 1 3ii

y y

Analysis of a Design with Centre Points

department of mathematics and computer science

74

Analysis of a Design with Centre Points

Analysis of Variance for Yield--------------------------------------------------------------------------------Source Sum of Squares Df Mean Square F-Ratio P-Value--------------------------------------------------------------------------------A:A 12.25 1 12.25 36.75 0.0261B:B 0.25 1 0.25 0.75 0.4778AB 20.25 1 20.25 60.75 0.0161Lack-of-fit 10.0119 1 10.0119 30.04 0.0317Pure error 0.666667 2 0.333333--------------------------------------------------------------------------------Total (corr.) 43.4286 6

R-squared = 75.4112 percentR-squared (adjusted for d.f.) = 50.8224 percentStandard Error of Est. = 0.57735Mean absolute error = 1.18367Durbin-Watson statistic = 0.801839 (P=0.1157)Lag 1 residual autocorrelation = 0.524964

P-Value < 0.05→

Lack-of-fit!

department of mathematics and computer science

75

Software• Statgraphics: menu Special -> Experimental Design

• StatLab: http://www.win.tue.nl/statlab2/

• Design Wizard (illustrates blocks and fractions): http://www.win.tue.nl/statlab2/designApplet.html

• Box (simple optimization illustration): http://www.win.tue.nl/~marko/box/box.html

department of mathematics and computer science

76

Literature• J. Trygg and S. Wold, Introduction to Experimental

Design – What is it? Why and Where is it Useful?, homepage of chemometrics, editorial August 2002: www.acc.umu.se/~tnkjtg/Chemometrics/editorial/aug2002.html

• Introduction from moresteam.com: www.moresteam.com/toolbox/t408.cfm

• V. Czitrom, One-Factor-at-a-Time Versus Designed Experiments, American Statistician 53 (1999), 126-131

• Thumbnail Handbook for Factorial DOE, StatEase