Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project...

33
Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital & Univeristy of Toronto

Transcript of Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project...

Page 1: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Contextual Analysis: Applying the Statistical Concepts to Real Data

Seminar IIIMODE-PTD ProjectJuly 2007

Patricia O’Campo Ph.D.St. Michael’s Hospital &Univeristy of Toronto

Page 2: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Project Areas Four University – Government health

departments partnerships; 8 study sites total (5 suburban, 3 urban)

Baltimore City, MD Baltimore County, MD Prince Georges County, MD Montgomery County, MD 16 pooled cities, MI Durham County, NC Wake County, NC Philadelphia, PA

Page 3: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Overview How was an index of neighborhood

deprivation developed?

Is the Neighborhood Deprivation Index associated with Preterm Birth in a single site?

Is the Neighborhood Deprivation Index associated with Preterm Birth across multiple sites?

What are some resources available for those wanting to learn more about multilevel modeling?

Page 4: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood characteristics

Existing domains in the literature

Poverty/income Racial/ethnic

composition Education Employment Occupation Housing/crowding Residential stability

Limitations:

Use of theory rare in selecting area factors

Inconsistent operationalization of neighborhood domains

Poverty alone is not a good proxy for other neighborhood factors

Page 5: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Data source & Specification of “neighborhood”

Neighborhood=Census tract

Data come from 2000 Census

Census tracts are small (~4000 residents), relatively permanent statistical subdivisions of counties, designed to be fairly homogeneous united with respect to socio-demographic characteristics and living conditions

Page 6: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Construction of neighborhood deprivation index (NDI)

Identified 20 census variables in 7 domains used consistently to approximate neighborhood-level environments for inclusion in deprivation index

One education variable Two employment variables Five housing variables Four occupation variables Five poverty variables One racial composition variable Two residential stability variables

Page 7: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood index: Data reduction

Census tract data from the eight study sites were used in principal components analyses (PCA) to create the index

PCA is a data reduction technique that transforms several correlated variables into a smaller set of uncorrelated variables or components

The first component explains as much of the overall variance as possible

Page 8: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood index: Data reduction

Across study areas, loadings on first principal component ranged from -0.041 to 0.295, with a mean loading of 0.211

Item loadings (>.25) were used to retain 8 items in the index

Final item loadings were used to weight each variable’s contribution to the summary score for each census tract

The index was standardized to have a mean of 0 and a standard deviation of 1.

Page 9: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood deprivation index

The neighborhood deprivation index accounted for 67% of the total variance

Consistency of loading for each item across sites suggests that variables function similarly across geography, despite heterogeneity in demographics and economic status

Page 10: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Principal Components Analysis

8 variables were retained for the neighborhood deprivation

% males in managerial or professional occup % crowded housing % HH in poverty % female headed HH % HH on public assistance % HH earnings <$30k % < HS educ % unemployed

Page 11: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Socio-demographic variability across geographic areas

0

20

40

60

80

% < 30K % NH White

Baltimore City, MD Baltimore County, MD

Montgomery County, MD Prince Georges County, MD

16 Cities, MI Durham County, NC

Wake County, NC Philadelphia, PA

Page 12: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Distribution of deprivation scores across eight study sites

Page 13: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Individual-level Data from Vital Records (1999-2001)

Individual-level variables Categories

Preterm birth <=37 weeks (preterm)*>=38 weeks

Maternal Age < 2020-24 25-29 30-3435+ years of age

Maternal Race Non-Hispanic BlackNon-Hispanic White

Maternal Education < 20 years & < High school > 20 years & < High school High school or equiv > High school

*preterm infants had to weigh less than 3888 grams

Page 14: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Baltimore City: Preterm Birth %

9.13

15.97

0

2

4

6

8

10

12

14

16

18

NH White (n=5707)

NH Black (n=18038)

Page 15: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Baltimore City Demographics: Age

0

5

10

15

20

25

30

35

<20 20-24 25-29 30-34 35+

NH White (n=5707)NH Black (n=18038)

Page 16: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Baltimore City Demographics: Education

0

10

20

30

40

50

60

<20 years & < HS

>= 20 & < HS

HS or equiv

> HS

NH White (n=5707)

NH Black (n=18038)

Page 17: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood Deprivation Index (NDI)

NH White (174 Census tracts)

Median -0.267 25% -0.629 75% 0.202

Min -1.34 Max 2.59

NH Black (191 Census tracts)

Median 0.742 25% 0.082 75% 1.191

Min -1.27 Max 2.59

Page 18: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Methods: Data Analysis

Race-stratified models

1. Multi-level (random intercept) logistic regression models with NDI as only predictor

2. Multi-level (random intercept) logistic regression models with NDI plus the potential confounders maternal age and education

Statistical methods were presented in detail in the previous DataSpeak session on “Methods for Understanding and Interpreting Multilevel Analysis” (June 07)

Page 19: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood deprivation odds ratios comparing the 1st to the 5th quintile

Odds ratiosX Data

1 1.5 2 2.5 3 3.51 1.5 2 2.5 3 3.5

Unadjusted modelsAdjusted models

Non-Hispanic Black women

Non-Hispanic White women 2.79

1.87

1.37

1.23

Page 20: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Percent preterm among non-Hispanic White and non-Hispanic Black women by site

02468

1012141618

White Black

Baltimore City, MD Baltimore County, MDMontgomery County, MD Prince Georges County, MD16 Cities, MI Durham County, NCWake County, NC Philadelphia, PA

%

Page 21: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Percent age 30-34 by race and site

0

10

20

30

40

50

White Black

Baltimore City, MD Baltimore County, MD Montgomery County, MD

Prince Georges County, MD 16 Cities, MI Durham County, NC

Wake County, NC Philadelphia, PA

%

Page 22: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Percent with more than high school or equivalent education by race and site

0

20

40

60

80

100

White Black Baltimore City, MD Baltimore County, MDMontgomery County, MD Prince Georges County, MD16 Cities, MI Durham County, NCWake County, NC Philadelphia, PA

%

Page 23: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood Deprivation Index Values by Site

Page 24: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood deprivation odds ratios for the outcome preterm birth

White-unadj White-adj Black-unadj Black-adj

Philadelphia 1.68*

(1.45, 1.92)

1.52*

(1.28, 1.75)

1.40* (1.27, 1.53)

1.26* (1.13, 1.39)

Wake 1.82*

(1.48, 2.16)

1.52*

(1.15, 1.88)

1.33*

(1.02, 1.64)

1.20 (0.89, 1.51)

Durham 2.18* (1.69, 2.68)

1.68*(1.14, 2.23)

1.48* (1.24, 1.71)

1.20 (0.97, 1.43)

16 Cities 1.64* (1.51, 1.77)

1.48* (1.32, 1.63)

1.14*(1.06, 1.22)

1.05

(0.95, 1.16)

PG Cnty 2.07* (1.29, 2.85)

1.68 (0.90, 2.46)

1.20 (0.97, 1.43)

1.14

(0.90, 1.37)

Montgomery Cnty

1.82*

(1.38, 2.26)

1.56*

(1.09, 2.02)

0.86

(0.36, 1.35)

0.88 (0.38, 1.37)

Baltimore County,

3.14* (2.72, 3.56)

2.24* (1.77, 2.71)

1.37

(0.74, 1.99)

1.40 (0.78, 2.03)

Baltimore City, MD

2.79*

(2.30, 3.08)

1.87* (1.42, 2.31)

1.37*

(1.24, 1.50)

1.23* (1.08, 1.39)

Adjusted models include maternal age and maternal education

Page 25: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood-level deprivation and preterm birth: similarity across sites

The Cochran’s Q statistic for homogeneity was calculated across sites to test the null hypothesis that the neighborhood deprivation betas are similar across sites

Cochran’s Q was not significant for both non-Hispanic White women (p=0.83) and non-Hispanic Black women (p=0.50).

Page 26: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Neighborhood-level deprivation and preterm birth: similarity across sites

Because the hypothesis of homogeneity was not rejected, we calculated a summary statistic representing the effect of neighborhood deprivation and preterm birth across sites.

The summary effect for non-Hispanic White women was 1.57 (95% CI: 1.41 to 1.74) and for non-Hispanic Black women it was 1.15 (95% CI: 1.08 to 1.23).

Page 27: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Are the associations similar across geographic area?

White Non-Hispanic Women Ages 20+

-3.5

-3.0

-2.5

-2.0

-1.5

-1.0

-10.0 -5.0 0.0 5.0 10.0 15.0

Deprivation Score

Est

imat

ed L

og

-Od

ds

Black Non-Hispanic Women Ages 20+

-3.5

-3.0

-2.5

-2.0

-1.5

-1.0

-10.0 -5.0 0.0 5.0 10.0 15.0

Deprivation Score

Estimated Log-Odds of Preterm Delivery by Deprivation Score and By Site

Baltimore County Baltimore City Montgomery Co

PG County Wake Co Michigan Cities

Philadelphia Durham Co Combined

(adjusted for maternal age and education)

White Non-Hispanic Women Ages 20+

-3.5

-3.0

-2.5

-2.0

-1.5

-1.0

-10.0 -5.0 0.0 5.0 10.0 15.0

Deprivation Score

Est

imat

ed L

og

-Od

ds

Black Non-Hispanic Women Ages 20+

-3.5

-3.0

-2.5

-2.0

-1.5

-1.0

-10.0 -5.0 0.0 5.0 10.0 15.0

Deprivation Score

Estimated Log-Odds of Preterm Delivery by Deprivation Score and By Site

Baltimore County Baltimore City Montgomery Co

PG County Wake Co Michigan Cities

Philadelphia Durham Co Combined

(adjusted for maternal age and education)

Page 28: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Summary of deprivation and preterm birth in multiple sites

Despite the diversity of geographic settings, there was homogeneity in the effect of neighborhood deprivation on preterm birth across sites for both non-Hispanic Blacks and non-Hispanic Whites.

The weaker effect for non-Hispanic Blacks versus Whites was explored but could not be explained with the data we had.

Page 29: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Additional project analyses

Examine separate components of the deprivation index

Examine other exposures such as smoking and maternal age

Examine other outcomes such as small for gestational age

Page 30: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Selected examples

Extensive resource listing on the Dataspeak website

Books Luke DA. Multilevel Modeling: Quantitative

Applications in the Social Sciences. July 2004 Sage Publications Inc.

Raudenbush, S. W., and A. S. Bryk.. Heirarchical linear models: Applications and data analysis methods. 2002 Thousand Oaks, CA: Sage.

Page 31: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Selected examples

Websites for General Information & software

Centre for Multilevel Modeling, University of Bristol: http://www.mlwin.com/

For SAS users: http://gseweb.harvard.edu/~faculty/singer/

Page 32: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Selected examples

Websites for General Information & software (cont’d)

Muthen and Muthen, MPLUS http://www.statmodel.com/

For SPSS users: http://www.unc.edu/~painter/SPSSMixed/SPSSMixedModels.PDF

Page 33: Contextual Analysis: Applying the Statistical Concepts to Real Data Seminar III MODE-PTD Project July 2007 Patricia O’Campo Ph.D. St. Michael’s Hospital.

Contact Information

Patricia O’Campo, PhDSt. Michael’s Hospital & University of Toronto

E-mail: [email protected]