Lecture slides stats1.13.l21.air

37
Statistics One Lecture 21 Assumptions Revisited 1

description

Lecture slides stats1.13.l21.air

Transcript of Lecture slides stats1.13.l21.air

Page 1: Lecture slides stats1.13.l21.air

Statistics One

Lecture 21 Assumptions Revisited

1

Page 2: Lecture slides stats1.13.l21.air

Two segments

•  Assumptions •  Transformations

2

Page 3: Lecture slides stats1.13.l21.air

Lecture 21 ~ Segment 1

Assumptions

3

Page 4: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Correlation •  Normal distribution in X and Y •  Linear relationship between X and Y •  Homoscedasticity

•  Plot histograms and scatterplots •  Print summary statistics

Page 5: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Regression •  Normal distribution in Y •  Linear relationship between X and Y •  Homoscedasticity •  Correlations among predictor variables all < .80 •  No multicolinearity

•  Plot histograms and scatterplots and residuals •  Print summary statistics

Page 6: Lecture slides stats1.13.l21.air

Review of main assumptions

•  t-tests •  Normal distribution in Y (DV) •  Homogeneity of variance •  Equivalent sample size

•  Levene’s test

Page 7: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Between groups and Factorial ANOVA •  Normal distribution in Y (DV) •  Homogeneity of variance •  Equivalent sample size

•  Levene’s test

Page 8: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Repeated measures ANOVA •  Normal distribution in Y (DV) •  Sphericity assumption •  Homogeneity of variance •  Homogeneity of covariance

•  Equivalent sample size

•  Levene’s test •  Mauchly’s test

Page 9: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Chi-square •  Independence •  Adequate expected cell counts

Page 10: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Two primary constraints of the assumptions •  Normal distribution in Y (DV) •  Linear relationship between predictor variables

and outcome variable (see Lecture 23)

Page 11: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Normal distribution in Y (DV) •  How to test •  Histograms and summary statistics •  Q-Q plots •  Empirical tests, such as D’Agostino’s K2 test

Page 12: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Normal distribution in Y (DV) •  How to test •  Histograms and summary statistics

•  Look for extreme skew and/or kurtosis and/or outliers (for example, cases +/- 3 SDs from the mean)

•  Rule of thumb is (skew > 3) and/or (kurtosis > 10) indicates a non-normal distribution

Page 13: Lecture slides stats1.13.l21.air

Positive skew with outliers

Page 14: Lecture slides stats1.13.l21.air

Positive skew

Page 15: Lecture slides stats1.13.l21.air

Review of main assumptions

•  Normal distribution in Y (DV) •  How to test •  Q-Q plot

•  A plot of the sorted values from the data set against the expected values of the corresponding quantiles from the standard normal distribution.

•  If the distribution is normal then the plotted points should approximately lie on a straight line.

Page 16: Lecture slides stats1.13.l21.air

Normal distribution

Page 17: Lecture slides stats1.13.l21.air

Positive skew

Page 18: Lecture slides stats1.13.l21.air

Segment summary

•  The inferential statistics covered in this course involve several assumptions

•  Two primary assumptions – Normal distribution in the outcome Y (DV) – Linear relationship between predictors and Y

Page 19: Lecture slides stats1.13.l21.air

Segment summary

•  Normal distribution in Y (DV) •  How to test •  Histograms and summary statistics •  Q-Q plots •  Empirical tests, such as D’Agostino’s K2 test

Page 20: Lecture slides stats1.13.l21.air

END SEGMENT

20

Page 21: Lecture slides stats1.13.l21.air

Lecture 21 ~ Segment 2

Transformations

21

Page 22: Lecture slides stats1.13.l21.air

Transformations •  If a distribution is not normal then it is sometimes possible

to transform the data in an attempt to make the distribution normal

•  A transformation is a single function applied to all data points in the distribution (for example, square root)

•  The rank order of cases should remain the same but the distance between cases may change

Page 23: Lecture slides stats1.13.l21.air

Transformations

•  Most common transformations for positive skew •  Square root •  Logarithm •  Inverse

Page 24: Lecture slides stats1.13.l21.air

Transformations

Page 25: Lecture slides stats1.13.l21.air

Transformations

•  Most common transformations for negative skew •  Reflect and Square root •  Reflect and Logarithm •  Reflect and Inverse

Page 26: Lecture slides stats1.13.l21.air

Transformations

Page 27: Lecture slides stats1.13.l21.air

Positive skew

Skew = 3.05 Kurtosis = 9.26

Page 28: Lecture slides stats1.13.l21.air

Square root transform

Skew = 2.86 Kurtosis = 7.91

Page 29: Lecture slides stats1.13.l21.air

Log transform

Skew = 2.69 Kurtosis = 6.75

Page 30: Lecture slides stats1.13.l21.air

Negative skew

Skew = -3.36 Kurtosis = 11.73

Page 31: Lecture slides stats1.13.l21.air

Reflect and square root

Skew = 2.58 Kurtosis = 6.62

Page 32: Lecture slides stats1.13.l21.air

Reflect and log

Skew = 1.59 Kurtosis = 2.92

Page 33: Lecture slides stats1.13.l21.air

Segment summary

•  If a distribution is not normal then it is sometimes possible to transform the data in an attempt to make the distribution normal

Page 34: Lecture slides stats1.13.l21.air

Segment summary

•  Most common transformations for positive skew •  Square root •  Logarithm •  Inverse

Page 35: Lecture slides stats1.13.l21.air

Segment summary

•  Most common transformations for negative skew •  Reflect and Square root •  Reflect and Logarithm •  Reflect and Inverse

Page 36: Lecture slides stats1.13.l21.air

END SEGMENT

36

Page 37: Lecture slides stats1.13.l21.air

END LECTURE 21

37