lec6.PDF

STAT3010: Lecture 6

1

Multiple Comparisons Procedures- Contd (Section 9.5, Page 425)

Recall: Last class we looked at the Scheffe Multiple Comparison Procedure. These calculations take some time (especially when you have a lot more than 3 treatments!), and are generally performed using statistical software on a computer, so lets try it with SAS.

SAS CODE:options ps=62 ls=80;data scheffe;

input Drug $ time; cards;1 301 351 401 251 352 252 202 302 252 303 153 203 253 203 20run;

proc anova;class drug;model time=drug;means drug/scheffe; run;

STAT3010: Lecture 6

2

SAS OUTPUT:The SAS System

The ANOVA ProcedureClass Level Information

Class Levels ValuesDrug 3 1 2 3

Number of Observations Read 15Number of Observations Used 15

The ANOVA Procedure Dependent Variable: time

Sum ofSource DF Squares Mean Square F Value Pr > F Model 2 423.3333333 211.6666667 10.16 0.0026 Error 12 250.0000000 20.8333333 Corrected Total 14 673.3333333

R-Square Coeff Var Root MSE time Mean 0.628713 17.33299 4.564355 26.33333

Source DF Anova SS Mean Square F Value Pr > F Drug 2 423.3333333 211.6666667 10.16 0.0026

The ANOVA Procedure Scheffe's Test for time

NOTE: This test controls the Type I experimentwise error rate.

Alpha 0.05Error Degrees of Freedom 12Error Mean Square 20.83333Critical Value of F 3.88529Minimum Significant Difference 8.047

Means with the same letter are not significantly different.

Scheffe Grouping Mean N DrugA 33.000 5 1A

B A 26.000 5 2B B 20.000 5 3

STAT3010: Lecture 6

3

The Tukey Procedure

The Tukey procedure is also called the Studentized Range test. The Tukey procedure is appropriate for pairwise comparisons, but it doesnt handle general contrasts. However, this procedure has better statistical power than the Scheffe procedure.

Before I outline the Tukey procedure, I need to explain the concept of the comparisons.you have to compare in the right order.

The first comparison of treatments uses the largest sample mean compared with the smallest sample mean, if this test is significant (there is a difference), then the next comparison of treatments deals with the largest sample mean compared with the second to smallest sample mean, if this test is significant (there is a difference), then the next comparison of treatments deals with the largest sample mean with the third to smallest

STAT3010: Lecture 6

4

sample mean, and so on. Once you reach a non-significant difference (there is no difference), then the test stops.

Outline of the Tukey Procedure:

1. Set up the hypothesis:

2. Compute the test statistic:

3. Decision Rule:

4. Conclusion.

Example 9.9: Recall Example 9.3;

Summary Statistics by Treatment

51n 52n 53n 331x 262x 203x

7.51s 2.42s 5.33s

STAT3010: Lecture 6

5

Drug A versus Drug C:

1. Hypothesis:

2. Test Statistic:

3. Decision:

4. Conclusion:

Now, because the first test was significant, we proceed to the next test (largest sample mean compared to the second smallest sample mean).

Drug A versus Drug B:

1. Hypothesis:

2. Test Statistic:

3. Decision:

4. Conclusion:

STAT3010: Lecture 6

6

SAS CODE:options ps=62 ls=80;data tukey;

input drug time; cards;1 301 351 401 251 352 252 202 302 252 303 153 203 253 203 20run; proc anova;class drug;model time=drug;means drug/tukey; run;

SAS OUTPUT: The SAS System

The ANOVA Procedure Class Level Information

Class Levels Valuesdrug 3 1 2 3


STAT3010: Lecture 6

7

The ANOVA Procedure Dependent Variable: time

Sum ofSource DF Squares Mean Square F Value Pr > FModel 2 423.3333333 211.6666667 10.16 0.0026 Error 12 250.0000000 20.8333333 Corrected Total 14 673.3333333

R-Square Coeff Var Root MSE time Mean 0.628713 17.33299 4.564355 26.33333

Source DF Anova SS Mean Square F Value Pr > F drug 2 423.3333333 211.6666667 10.16 0.0026

The ANOVA Procedure

Tukey's Studentized Range (HSD) Test for time

NOTE: This test controls the Type I experimentwise error rate, but it generally has a higher Type II error rate than REGWQ.

Alpha 0.05 Error Degrees of Freedom 12 Error Mean Square 20.83333 Critical Value of Studentized Range 3.77278 Minimum Significant Difference 7.7012

Means with the same letter are not significantly different.

Tukey Grouping Mean N drug

A 33.000 5 1A

B A 26.000 5 2B B 20.000 5 3

STAT3010: Lecture 6

8

Contrasts

When we create an ANOVA test, in the ideal situation, specific questions regarding comparisons among the means are posed before the data are collected. Lets look at an example:

This is a randomized comparative experiment to compare three methods for teaching reading. Our response variable is COMP, a measure of reading comprehension that was measured by a test taken after the instruction was completed.

Group COMP Group COMP Group COMPBasal 41 DRTA 31 Strat 53

Basal 41 DRTA 40 Strat 47





















The following is the summary statistics (proc means) and ANOVA table (proc anova) from SAS:

The SAS System The MEANS Procedure

Variable N Mean Std Dev Minimum Maximum Basal 22 41.0454545 5.6355781 32.0000000 54.0000000 DRTA 22 46.7272727 7.3884196 30.0000000 57.0000000 Strat 22 44.2727273 5.7667505 33.0000000 53.0000000

STAT3010: Lecture 6

9

The SAS System The ANOVA Procedure

Class Level Information

Class Levels Values trt 3 Basal DRTA Strat


The SAS System

The ANOVA Procedure

Dependent Variable: COMPSum of

Source DF Squares Mean Square F Value Pr > F

Model 2 357.303030 178.651515 4.48 0.0152 Error 63 2511.681818 39.867965 Corrected Total 65 2868.984848

R-Square Coeff Var Root MSE score Mean 0.124540 14.34531 6.314108 44.01515

Source DF Anova SS Mean Square F Value Pr > F trt 2 357.3030303 178.6515152 4.48 0.0152

This above ANOVA shows:

Lets say the researchers are now investigating a specific theory about reading comprehension. The instruction for the Basal group was the standard method commonly used in schools. The DRTA and Strat groups received innovative methods of teaching that were designed to increase the reading comprehension of the children. The DRTA and Strat methods were not identical, but they both involved teaching the students to use similar comprehension strategies in their reading. Based on this supposition, the relevant contrast is:

STAT3010: Lecture 6

10

This above hypothesis compares the average of the two innovative methods (DRTA and Strat) with the standard method (Basal). The alternative is one sided because the researchers are interested in demonstrating that the new methods are better than the old.

Notice that the combination of population means is 0. These combinations of means are called contrasts. We use , the Greek letter psi, for contrasts among population means:

Here are the relevant formulas for carrying out a contrast

Contrasts A contrast is a combination of population means of the form

iia where the coefficients ia have sum 0. The corresponding sample contrast is

ii xacThe standard error of c is

i

ipc n

asSE2

where ps is the root MSE. To test the null hypothesis 0:oH

use the t statistic

cSEct

with degrees of freedom DFE that are associated with ps . The alternative hypothesis can be one-sided or two sided.

STAT3010: Lecture 6

11

The sample contrast that estimates

with standard error

The t statistic is

lec6.PDF

Documents

Transcript of lec6.PDF