lecture 11lecturecontent.s3.amazonaws.com/pdf/14751.pdf · Learning Objectives In this lecture...

Hypothesis Testing: Comparing Parameters (Means, Proportions, Incidence Rates) Between More Than 2 Populations With One Test

Lecture 11

Section A: (Hypothesis Testing) Comparing Means Between More Than Two Populations: Analysis of Variance (ANOVA)

2

Learning Objectives

In this lecture section you will learn to interpret a p-value from a hypothesis test for (any) mean differences between more than two populations

The method for getting the p-value is called the Analysis of Variance (frequently called ANOVA)

3

4

Example 1

Researchers were interested in the relationship between smoking and mid-expiratory flow (FEF), a measure of pulmonary health. The researchers recruited study subjects and classified them into one of six smoking categories1

Nonsmokers (NS) Passive smokers (PS) Non-inhaling smokers (NI) Light smokers (LS) Moderate smokers (MS) Heavy smokers (HS)

1 White, J.R., Froeb, H.F. Small-Airways Dysfunction in Non-Smokers Chronically Exposed to Tobacco Smoke, New England Journal of Medicine 302: 13 (1980)

5

Example 1

To start, the researchers were interested in whether there were any statistically significant differences in pulmonary outcomes (FEV1, FEF, etc..) between the six underlying populations

One strategy would be to perform lots of two-sample t-tests (for each possible two-group comparison)

In this example, there would be 15 such comparisons ! NS to PS, NS to NI, and so on . . .

6

Example 1

Analysis of Variance (One-Way ANOVA) is an extension of the two-sample (unpaired) t-test to compare means between more than two populations with one test

General idea behind ANOVA, comparing means for k-groups (k > 2):

Ho : µ1 = µ2 = …… µkHA : At least one mean different

7

Example 1

Smoking and FEF From a pool of over 5,200 potential participants, a random

sample of 200 men and 200 women was drawn from each smoking group (except for the non-inhalers, where 50 men and 50 women were selected)

FEF measurements were taken on each of the subjects

8

Example 1

Table 1

9

Example 1

Data Summary, Males (FEF 25- 75%)

Group Mean FEF SD FEF

(L/s) (L/s) n

NS 3.78 0.79 200

PS 3.30 0.77 200

NI 3.32 0.86 50

LS 3.23 0.78 200

MS 2.73 0.81 200

HS 2.59 0.82 200

10

Example 1

ANOVA comparing FEF values for males by smoking group

Ho: µNS=µPS=µNI=µLS=µMS=µHS

HA: At least two groups have different means

The p-value from ANOVA is very small: p < 0.01

Conclusion?

11

Example 1

Overall summary of pulmonary comparisons, from abstract (based on follow-up two sample comparisons)

12

Example 2

FEV1 and three medical centers 2

Data was collected on 60 patients with coronary artery disease at 3 difference medical centers (Johns Hopkins, Ranchos Los Amigos Medical Center, St. Louis University School of Medicine)

Purpose of study to investigate effects of carbon monoxide exposure on these patients

Prior to analyzing CO effects data, researchers wished to compared the respiratory health of these patients across the three medical centers

2 Pagano M, Gauvreau K. Principles of Biostatistics. Second Edition. Duxbury Press (2000).

13

Boxplots

FEV1 Values by Center1.

52

2.5

33.

54

FEV1

(L/s

)

JH RLA SL

CHD Patients At 3 Medical CentersBoxplot of FEV1 Measurements

(n=21) (n=16) (n=23)

14

Boxplots

ANOVA

Conclusions?

15

Example 2

ANOVA approach, conceptually

Assume the null hypothesis is true Compute a measure of discrepancy between what was observed

in the sample compared to what is expected under the null (in ANOVA this is called a “F-statistic”

Compared this measure of discrepancy (F-statistic) to the distribution of such measures because of random sampling variability, when the null is true

Convert to a p-value

16

Example 2

For this example, the F-statistic is 3.12. To get a p-value, this observed value is compared to a F-distribution with 2 numerator, and 57 denominator degrees of freedom

17

Example 2

For this example, the F-statistic is 3.12. To get a p-value, this observed value is compared to a F-distribution with 2 numerator, and 57 denominator degrees of freedom

Example 3: Academic Physician Salaries3

From abstract

3 Jagsi R, et al. Gender Differences in the Salaries of Physician Researchers. Journal of the American Medical Association (2012); 307(22); 2410-2417.

18

Example

Article Table

19

Example 4: Academic Physician Salaries

Article Table

20

21

Summary

Section B: (Hypothesis Testing) Comparing Proportions Between More Than Two Populations: Chi-Square Tests

22

Learning Objectives

In this lecture set you will learn to interpret a p-value for a hypothesis test comparing proportions between more than two populations

The method for getting the p-value is an extension of the chi square test shown in lecture 10, and is also called a chi square test

23

24

Example 1

Health care indicators by immigrant status1

1 Huang, Z., et al. Health Status and Health Service Access and Use Among Children in U.S. Immigrant Families, American Journal of Public Health 96: 4 (2006)

25

Example 1

Example: health care indicators by immigrant status1

1 Huang, Z., et al. Health Status and Health Service Access and Use Among Children in U.S. Immigrant Families, American Journal of Public Health 96: 4 (2006)

26

Example 1

Zoom in

27

Example 1

Chi Square Approach

28

Example 1

Zoom in

29

Example 2

Out of pocket spending, medication adherence: dialysis patients, 12 countries2

2 Hirth R, et al. Out-Of-Pocket Spending And Medication Adherence Among Dialysis Patients In Twelve Countries. Health Affairs (2008). 27 (1).

30

Example 2

Characteristics of dialysis patients, 12 countries (six shown)

31

Example 2

Percentage of minorities, compared across the 12 countries (only 6 countries shown)

32

Example 2

Percentage of minorities, compared across the 12 countries (only 6 countries shown)

33

Example 2

Distribution of income categories

34

Example 2

Distribution of income categories

35

Example 2

ANOVA as well (previously lecture section infiltration!)


From abstract

3 Jagsi R, et al. Gender Differences in the Salaries of Physician Researchers. Journal of the American Medical Association (2012); 307(22); 2410-2417.

36


Table: characteristics of physicians by sex

37



38



39

Summary

40

Section C: (Hypothesis Testing) Comparing Survival Curves Between More Than Two Populations: Log-rank Tests

41

Learning Objectives

In this lecture set you will learn interpret a p-value for a hypothesis test comparing survival curves (and hence incidence rates) between more than two populations

The method for getting the p-value is an extension of the log rank test shown in lecture 10, and is also called a log rank test

42

Maternal Vitamin Supplementation and Infant Mortality

1 Katz J, West K et al. Maternal low-dose vitamin A or β-carotene supplementation has no effect on fetal loss and early infant mortality: a randomized cluster trial in Nepal. American Journal of Clinical Nutrition (2000) Vol. 71, No. 6, 1570-1576.

Example 11

43

05.1deaths/day 00039.0deaths/day 00041.0

ˆˆˆ placebo

vitAvitA RI

RIRIR

00.1deaths/day 00039.0deaths/day 00039.0

ˆˆˆ placebo

BCBC RI

RIRIR


Example 1

44

0.90

0.95

1.00

Prop

ortio

n St

ill Al

ive

0 50 100 150 200Follow-Up Time Since Enrollment in Study (days)

treat = Beta C treat = Placebotreat = Vit A

Kaplan-Meier survival estimates


Example 1

45

0.00

0.05

0.10

Prop

ortio

n Th

at H

ad D

ied

0 50 100 150 200Follow-Up Time Since Enrollment in Study (days)

treat = Beta C treat = Placebotreat = Vit A

Kaplan-Meier failure estimates

Log rank test

Interpretation?

Example 1

46

47

Examples 2

Return to Work Following Injury: The Role of Economic, Social, and Job-Related Factors2

“The main dependent variable in the analysis is the time (in days) from injury to the first time the study patient returned to work. Kaplan-Meier estimates of the cumulative proportion of patients returning to work were computed. These estimates take into account how long patients were followed as well as when they returned to work. A log-rank test was used to test the association between the cumulative probability of RTW and each of the risk factors considered one at a time”

2 MacKenzie E, et al Return to Work Following Injury:The Role of Economic, Social, and Job-Related Factors America Journal of Public Health 88; 11 (1998)

48

Example 2

Kaplan Meier estimates of )(ˆ1 tS

49

Example 3

Post partum ART therapy for children born to HIV positive women3

3 Nielson-Saines K, et al. Three Postpartum Antiretroviral Regimens to Prevent Intrapartum HIV Infection. New England Journal of Medicine (2012). 366 (25).

50

Example 3

Post partum ART therapy for children born to HIV positive women

“Kaplan–Meier curves for intrapartum transmission differed significantly (P = 0.03 for the overall comparison). Transmission rates were highest in the zidovudine-alone group (3.4% at 4 to 6 weeks vs. 1.6% in the two-drug group and 1.4% in the three-drug group; 4.8% at 3 months vs. 2.2% in the two-drug group and 2.4% in the three-drug group).”

51

Example 3

Post partum ART therapy for children born to HIV positive women

52

Summary

lecture 11lecturecontent.s3.amazonaws.com/pdf/14751.pdf · Learning Objectives In this lecture...

Documents

Transcript of lecture 11lecturecontent.s3.amazonaws.com/pdf/14751.pdf · Learning Objectives In this lecture...