Oneway ANOVA - Overview

Analysis of Variance: Example

Learning ANOVA through an example

• All students were given a math test.

• Ahead of time, thestudents wererandomly assigned to one of three experimentalgroups (but they did not know about it).

• After the first math test, the teacher behaved differentlywith members of the three different experimental groups.

• Data from Section 42 of Success at Statistics by Pyrczak

Creating different conditions in the groups

• Regardless of their actual performanceon the test, the teacher …

• Gave massive amounts of praise for any correct answers to students in Group A.

• Gave moderate amounts of praise for any correct answers to students in Group B.

• Gave no praise for correct answers, just their score, to students in Group C.

Then the variable of interest was measured

• The next day, atthe end of the mathlesson, the teacher gave another test.

• Scores for all the students were recorded, aswell as the amount of praise they had received for correct answers the day before.

• The researchers thought that the earlier praise might have an effect on their scores on the second test.

• ANOVA’s F-ratio will tell us if that’s true.

F test is a ratio of variance BETWEEN groupsand variance WITHIN groups

• On the top: difference between groups, which includes systematic and random components.

• On the bottom: difference within groups, which includes only the random component.

• When the systematic component is large, the groups differ from each other, and F > 1.00

effects treatment no withsdifference

effects treatmentany including sdifferenceF

Stating Hypotheses

• H0: The amount of praise given has no impact on the math post-test.

• HA: Groups who receive different amounts of praise will have different mean scores.

Scores on Test 2 for 18 students

Group X

A 7

A 6

A 5

A 8

A 3

A 7

B 4

B 6

B 4

B 7

B 5

B 7

C 3

C 2

C 1

C 3

C 4

C 1

ΣX 83

Mean 4.6111

Compare scores to M = 4.61

Part II: Variability: distance from the mean of all the scores on the test

Group X Mtotal X-M (X-M)2

A 7 4.6111 2.3889 5.7068

A 6 4.6111 1.3889 1.9290

A 5 4.6111 0.3889 0.1512

A 8 4.6111 3.3889 11.4846

A 3 4.6111 -1.6111 2.5957

A 7 4.6111 2.3889 5.7068

B 4 4.6111 -0.6111 0.3735

B 6 4.6111 1.3889 1.9290

B 4 4.6111 -0.6111 0.3735

B 7 4.6111 2.3889 5.7068

B 5 4.6111 0.3889 0.1512

B 7 4.6111 2.3889 5.7068

C 3 4.6111 -1.6111 2.5957

C 2 4.6111 -2.6111 6.8179

C 1 4.6111 -3.6111 13.0401

C 3 4.6111 -1.6111 2.5957

C 4 4.6111 -0.6111 0.3735

C 1 4.6111 -3.6111 13.0401

G 83 80.27778 SSTOTAL

Mean 4.6111 4.459877 Variance

Just as in Chapter 3, the differences are squaredΣ(X-M)2 = Sum of Squares = SSTOTAL

Sum of squares: all scores = SSTOTAL = 80.278

What about the impact of praise?

• The mean for all thestudents is 4.61.

• Do all three groupsof students havesimilar means?

• H0: The amount of praise given has no impact on the math post-test.

• HA: Groups who receive different amounts of praise will have different mean scores.

3210 : H

Compute mean score in the groups

• Mean of Group A: MA=6.00

• Mean of Group B: MB=5.50

• Mean of Group C: MC=2.33

Group X

A 7

A 6

A 5

A 8

A 3

A 7

Mean 6

Group X

B 4

B 6

B 4

B 7

B 5

B 7

Mean 5.5

Group X

C 3

C 2

C 1

C 3

C 4

C 1

Mean 2.333

Compare each student’s score to the mean score for his or her own group

MA=6.00 MB=5.50 MC=2.33

SSA=16.00 SSB=9.50 SSC=7.333

Variability within each group is random:all within group had same amount of praise

Group X MA (X-MA)2

A 7 6 1

A 6 6 0

A 5 6 1

A 8 6 4

A 3 6 9

A 7 6 1

Mean 6 SSA 16.000

Group X MB (X-MB)2

B 4 5.5 2.25

B 6 5.5 0.25

B 4 5.5 2.25

B 7 5.5 2.25

B 5 5.5 0.25

B 7 5.5 2.25

Mean 5.5 SSB 9.500

Group X MC (X-MC)2

C 3 2.333 0.445

C 2 2.333 0.111

C 1 2.333 1.777

C 3 2.333 0.445

C 4 2.333 2.779

C 1 2.333 1.777

Mean 2.333 SSC 7.333

Variability within each group is random:all within group had same amount of praise

To find the amount of random variability,add the SS from all the groups together.

Within Sum of squares

SSwithin=16+9.5+7.33

SSwithin=32.833

SSA=16.00 SSB=9.50 SSC=7.333

• SSTOTAL is all the variability in the Sample

• Some of it is systematic variability between groups related to

the treatment, level of praise by the teacher

• Some of it is random within groups, due to the many differences

among students besides the praise level

• SSTOTAL = SSWITHIN + SSBETWEEN

Part V: Analysis of Variance:Partitioning variability into components

Variability between groups is due to the teacher’s level of praise• The means of the groups are not the same

• MA=6.00 MB=5.50 MC=2.33

• SSBETWEEN represents the variability due to the different praise level treatments

• SSTOTAL and SSWITHIN have been computed

• SSTOTAL = 80.28 and SSWITHIN = 32.83

• SSBETWEEN = SSTOTAL – SSWITHIN

• SSBETWEEN = 80.28 – 32.83

• SSBETWEEN = 47.45

• Is the SSBETWEEN large

relative to SSWITHIN ?

• If SSBETWEEN is large

relative to the SSWITHIN

then the treatment (teacher praise) had an effect.

• If SSBETWEEN is large, REJECT the null hypothesis.

• The F-statistic is a ratio of those two components

of variability, adjusted for sample size.

Part VI: Asking the research question a new way: as a ratio between variances

effects treatment no y withvariabilit

effects treatmentany includingy variabilitF

Compute degrees of freedom for SstotalSswithinand SSbetween

Each kind of SS has its own df

• Total degrees of freedom for SSTOTAL

dftotal= N – 1 (N is the total number of cases)dftotal = 18 – 1 = 17

• Between-treatments degrees of freedom for SSBETWEEN

dfbetween= k – 1 (k is the number of groups)dfbetween= 3 – 1 = 2

• Within-groups degrees of freedom for SSWITHIN

dfwithin= N – kdfwithin= 18 – 3 = 15

“Average” the SSwithin and SSbetween over their df

These are called “Mean Squares”

Equations for Mean Squares & F

• The between and within sums of squares are divided by their df to create the appropriate variance

• These are called the Mean Squares• The SS is averaged (mean)

across df

• The F-ratio test statistic is the ratio of MSbetween to MSwithin

within

withinwithin

df

SSMS

between

betweenbetween

df

SSMS

within

between

MS

MSF

Computing the Mean Squares

• SSBETWEEN = 47.45dfbetween= 3 – 1 = 2

• SSWITHIN = 32.833dfwithin = 18 – 3 = 15

75.232

45.47

between

betweenbetween

df

SSMS

189.215

833.32

within

withinwithin

df

SSMS

Computing F for the example

F = 23.725 / 2.189F = 10.849849.10

189.2

735.23

within

between

MS

MSF

within

between

MS

MSF

Testing hypotheses with F

• When the p-value for F is less than the alpha you chose for your test, then you can Reject H0

• There are critical values for F that define a rejection region – but they vary by both types of df and (outside of intro statistics courses) no one knows any of them by heart.

• In this class: we use p-value only, from the F Distribution calculator.

849.10189.2

75.23

within

between

MS

MSF

Testing hypotheses with F

• The p-value of F = 10.849 for df = 2, 15 is p=.0012

• Using = .05

• Since the p-value is less than (<) the alpha level, we Reject the null hypothesis.

• Some groups had different levels of performance on the test due to the level of the teacher’s praise.

849.10189.2

75.23

within

between

MS

MSF

ANOVA table – a tool for computing F

• The SS and df columns add up to the total

• In each row, SS divided by df equals MS

• In the final column, F is MSB divided by MSW

Source SS df MS F

Between 47.45 2 23.725 10.84

Within 32.83 15 2.189

Total 80.28 17

1. Fill in the blanks.2. How many subjects were in the study?3. How many groups were in the study?

Review of the ANOVA test

• Hypotheses and significance level are stated

• Sum of Squared differences from the mean of all the scores is computed = SStotal

• Sum of Squared differences from the mean of each group is computed = SSwithin

• Sum of Squared differences between groups is computed by subtraction = SSbetween

• Degrees of Freedom df are computed for each SS

• Mean Squares MSbetween and MSwithin are computed.

• F ratio is computed and its p –value determined.

• Decision is made regarding the null hypothesis.

Oneway ANOVA - Overview

Education

Transcript of Oneway ANOVA - Overview