Oneway ANOVA - Overview
-
Upload
sr-edith-bogue -
Category
Education
-
view
220 -
download
0
description
Transcript of Oneway ANOVA - Overview
Analysis of Variance: Example
Learning ANOVA through an example
• All students were given a math test.
• Ahead of time, thestudents wererandomly assigned to one of three experimentalgroups (but they did not know about it).
• After the first math test, the teacher behaved differentlywith members of the three different experimental groups.
• Data from Section 42 of Success at Statistics by Pyrczak
Creating different conditions in the groups
• Regardless of their actual performanceon the test, the teacher …
• Gave massive amounts of praise for any correct answers to students in Group A.
• Gave moderate amounts of praise for any correct answers to students in Group B.
• Gave no praise for correct answers, just their score, to students in Group C.
Then the variable of interest was measured
• The next day, atthe end of the mathlesson, the teacher gave another test.
• Scores for all the students were recorded, aswell as the amount of praise they had received for correct answers the day before.
• The researchers thought that the earlier praise might have an effect on their scores on the second test.
• ANOVA’s F-ratio will tell us if that’s true.
F test is a ratio of variance BETWEEN groupsand variance WITHIN groups
• On the top: difference between groups, which includes systematic and random components.
• On the bottom: difference within groups, which includes only the random component.
• When the systematic component is large, the groups differ from each other, and F > 1.00
effects treatment no withsdifference
effects treatmentany including sdifferenceF
Stating Hypotheses
• H0: The amount of praise given has no impact on the math post-test.
• HA: Groups who receive different amounts of praise will have different mean scores.
Scores on Test 2 for 18 students
Group X
A 7
A 6
A 5
A 8
A 3
A 7
B 4
B 6
B 4
B 7
B 5
B 7
C 3
C 2
C 1
C 3
C 4
C 1
ΣX 83
Mean 4.6111
Compare scores to M = 4.61
Part II: Variability: distance from the mean of all the scores on the test
Group X Mtotal X-M (X-M)2
A 7 4.6111 2.3889 5.7068
A 6 4.6111 1.3889 1.9290
A 5 4.6111 0.3889 0.1512
A 8 4.6111 3.3889 11.4846
A 3 4.6111 -1.6111 2.5957
A 7 4.6111 2.3889 5.7068
B 4 4.6111 -0.6111 0.3735
B 6 4.6111 1.3889 1.9290
B 4 4.6111 -0.6111 0.3735
B 7 4.6111 2.3889 5.7068
B 5 4.6111 0.3889 0.1512
B 7 4.6111 2.3889 5.7068
C 3 4.6111 -1.6111 2.5957
C 2 4.6111 -2.6111 6.8179
C 1 4.6111 -3.6111 13.0401
C 3 4.6111 -1.6111 2.5957
C 4 4.6111 -0.6111 0.3735
C 1 4.6111 -3.6111 13.0401
G 83 80.27778 SSTOTAL
Mean 4.6111 4.459877 Variance
Just as in Chapter 3, the differences are squaredΣ(X-M)2 = Sum of Squares = SSTOTAL
Sum of squares: all scores = SSTOTAL = 80.278
What about the impact of praise?
• The mean for all thestudents is 4.61.
• Do all three groupsof students havesimilar means?
• H0: The amount of praise given has no impact on the math post-test.
• HA: Groups who receive different amounts of praise will have different mean scores.
3210 : H
Compute mean score in the groups
• Mean of Group A: MA=6.00
• Mean of Group B: MB=5.50
• Mean of Group C: MC=2.33
Group X
A 7
A 6
A 5
A 8
A 3
A 7
Mean 6
Group X
B 4
B 6
B 4
B 7
B 5
B 7
Mean 5.5
Group X
C 3
C 2
C 1
C 3
C 4
C 1
Mean 2.333
Compare each student’s score to the mean score for his or her own group
MA=6.00 MB=5.50 MC=2.33
SSA=16.00 SSB=9.50 SSC=7.333
Variability within each group is random:all within group had same amount of praise
Group X MA (X-MA)2
A 7 6 1
A 6 6 0
A 5 6 1
A 8 6 4
A 3 6 9
A 7 6 1
Mean 6 SSA 16.000
Group X MB (X-MB)2
B 4 5.5 2.25
B 6 5.5 0.25
B 4 5.5 2.25
B 7 5.5 2.25
B 5 5.5 0.25
B 7 5.5 2.25
Mean 5.5 SSB 9.500
Group X MC (X-MC)2
C 3 2.333 0.445
C 2 2.333 0.111
C 1 2.333 1.777
C 3 2.333 0.445
C 4 2.333 2.779
C 1 2.333 1.777
Mean 2.333 SSC 7.333
Variability within each group is random:all within group had same amount of praise
To find the amount of random variability,add the SS from all the groups together.
Within Sum of squares
SSwithin=16+9.5+7.33
SSwithin=32.833
SSA=16.00 SSB=9.50 SSC=7.333
• SSTOTAL is all the variability in the Sample
• Some of it is systematic variability between groups related to
the treatment, level of praise by the teacher
• Some of it is random within groups, due to the many differences
among students besides the praise level
• SSTOTAL = SSWITHIN + SSBETWEEN
Part V: Analysis of Variance:Partitioning variability into components
Variability between groups is due to the teacher’s level of praise• The means of the groups are not the same
• MA=6.00 MB=5.50 MC=2.33
• SSBETWEEN represents the variability due to the different praise level treatments
• SSTOTAL and SSWITHIN have been computed
• SSTOTAL = 80.28 and SSWITHIN = 32.83
• SSBETWEEN = SSTOTAL – SSWITHIN
• SSBETWEEN = 80.28 – 32.83
• SSBETWEEN = 47.45
• Is the SSBETWEEN large
relative to SSWITHIN ?
• If SSBETWEEN is large
relative to the SSWITHIN
then the treatment (teacher praise) had an effect.
• If SSBETWEEN is large, REJECT the null hypothesis.
• The F-statistic is a ratio of those two components
of variability, adjusted for sample size.
Part VI: Asking the research question a new way: as a ratio between variances
effects treatment no y withvariabilit
effects treatmentany includingy variabilitF
Compute degrees of freedom for SstotalSswithinand SSbetween
Each kind of SS has its own df
• Total degrees of freedom for SSTOTAL
dftotal= N – 1 (N is the total number of cases)dftotal = 18 – 1 = 17
• Between-treatments degrees of freedom for SSBETWEEN
dfbetween= k – 1 (k is the number of groups)dfbetween= 3 – 1 = 2
• Within-groups degrees of freedom for SSWITHIN
dfwithin= N – kdfwithin= 18 – 3 = 15
“Average” the SSwithin and SSbetween over their df
These are called “Mean Squares”
Equations for Mean Squares & F
• The between and within sums of squares are divided by their df to create the appropriate variance
• These are called the Mean Squares• The SS is averaged (mean)
across df
• The F-ratio test statistic is the ratio of MSbetween to MSwithin
within
withinwithin
df
SSMS
between
betweenbetween
df
SSMS
within
between
MS
MSF
Computing the Mean Squares
• SSBETWEEN = 47.45dfbetween= 3 – 1 = 2
• SSWITHIN = 32.833dfwithin = 18 – 3 = 15
75.232
45.47
between
betweenbetween
df
SSMS
189.215
833.32
within
withinwithin
df
SSMS
Computing F for the example
F = 23.725 / 2.189F = 10.849849.10
189.2
735.23
within
between
MS
MSF
within
between
MS
MSF
Testing hypotheses with F
• When the p-value for F is less than the alpha you chose for your test, then you can Reject H0
• There are critical values for F that define a rejection region – but they vary by both types of df and (outside of intro statistics courses) no one knows any of them by heart.
• In this class: we use p-value only, from the F Distribution calculator.
849.10189.2
75.23
within
between
MS
MSF
Testing hypotheses with F
• The p-value of F = 10.849 for df = 2, 15 is p=.0012
• Using = .05
• Since the p-value is less than (<) the alpha level, we Reject the null hypothesis.
• Some groups had different levels of performance on the test due to the level of the teacher’s praise.
849.10189.2
75.23
within
between
MS
MSF
ANOVA table – a tool for computing F
• The SS and df columns add up to the total
• In each row, SS divided by df equals MS
• In the final column, F is MSB divided by MSW
Source SS df MS F
Between 47.45 2 23.725 10.84
Within 32.83 15 2.189
Total 80.28 17
1. Fill in the blanks.2. How many subjects were in the study?3. How many groups were in the study?
Review of the ANOVA test
• Hypotheses and significance level are stated
• Sum of Squared differences from the mean of all the scores is computed = SStotal
• Sum of Squared differences from the mean of each group is computed = SSwithin
• Sum of Squared differences between groups is computed by subtraction = SSbetween
• Degrees of Freedom df are computed for each SS
• Mean Squares MSbetween and MSwithin are computed.
• F ratio is computed and its p –value determined.
• Decision is made regarding the null hypothesis.