Issues in the Implementation of Longitudinal Growth Models for Student Achievement Joseph Stevens...

Issues in the Implementation of Longitudinal Growth Models for Student AchievementJoseph StevensUniversity of Oregon

Keith ZvochUniversity of Nevada-Las VegasNevada Center for Evaluation and Assessment

Presented at the Conference on Longitudinal Modeling of Student Achievement, University of Maryland, College Park, November, 2005.

© Stevens, 2005

Contact Information:

170 College of Education

5267 University of Oregon

Eugene, OR 97403

(541) 346-2445

[email protected]

Presentation available at:

http://www.uoregon.edu/~stevensj/issues.ppt

mailto:[email protected]


No Child Left Behind NCLB requires the use of a cross-sectional, case

study design for the study of achievement and school effectiveness (Stevens, 2005)

How best to model student achievement and regularities in achievement by teacher or school?

Increasing interest in and development of longitudinal methods and modeling over several decades

Purposes Draw attention to certain research design issues in

the study of school effectiveness

Review things we know about the analysis of change

Describe issues in the design and use of longitudinal models and some empirical results

Discuss issues in the potential use of longitudinal modeling for accountability purposes

Design Issues

Status versus growth (Stevens, 2000)

State mandated test data on TerraNova Survey Plus for all middle schools in six NM School Districts from 1999-2001

36 Middle Schools; 5,544 students tested in grades 6, 7, and 8

Growth

18161412108

Sta

tus

690

680

670

660

650

640

630

620

Analysis of Change “Investigators who ask questions regarding gain

scores would ordinarily be better advised to frame their questions in other ways.” Cronbach & Furby (1970)

“Problems in measuring change abound and the virtues in doing so are hard to find.” Linn & Slinde (1977)

Longitudinal research has been recognized as the “sine qua non of evaluation in nonexperimental settings” (Marco, 1974)

The Analysis of Change Height example Cross sectional comparisons do not measure

change effectively/accurately Cross sectional comparisons of cohorts

produce different results than analysis of change

Common to confuse longitudinal research hypotheses and language with cross sectional designs and results

Grade

876

19

99

-00

Ma

th A

chie

vem

en

t

680

670

660

650

640

Grade

876

Ma

th A

chie

vem

en

t

680

670

660

650

640

Cohort

Longitudinal

Grade

876

Re

ad

ing

Ach

ieve

me

nt

680

670

660

650

640

Cohort

Longitudinal

Analysis of Change Change versus variability (Nesselroade,

1991) Sources of individual differences (Cattell, 1966):

Stable individual differences Intraindividual variability - short-term, relatively

reversible change or fluctuation Intraindividual changes - long-term, relatively not

reversible

Interindividual differences in intraindividual growth

Student Growth Trajectories

Analysis of Change

What covariates are systematically related to interindividual differences in growth?

Aggregated Growth Teachers Schools Districts States

School Growth Trajectories

Design Issues

What are some design issues arising from the analysis of change literature?

Measurement Occasions Initial Starting Point Choice of initial starting point impacts model

results

Time

4321

Ach

ieve

me

nt

10.0

9.0

8.0

7.0

6.0

5.0

4.0

3.0

2.0

1.0

0.0

Time

4321

Ach

ieve

me

nt

10

9

8

7

6

5

4

3

2

1

0

Design Issues

Number of Measurement Occasions Pre-post, two wave studies most common More is better

“Two waves of data are better than one, but maybe not much better” (Rogosa, 1995)

Shape of growth function Reliability of estimation

Time

41

Ach

ieve

me

nt

10

9

8

7

6

5

4

3

2

1

0

Two Waves

Time

42.51

Ach

ieve

me

nt

10

9

8

7

6

5

4

3

2

1

0

Three Waves

Time

4321

Ach

ieve

me

nt

10

9

8

7

6

5

4

3

2

1

0

Four Waves

Design Issues

Lag between treatment and outcome Treatment effects take time to realize The more distal the outcome measurement from

the treatment impact, the less the effect

Design Issues

Size or Length of Measurement Intervals Use shortest interval possible until temporal effect

established Intervals too short, cost, reactivity to

measurement, too little treatment effect Intervals too long, attrition, distal treatment

effects, inability to model growth function accurately

0

2

4

6

8

10

1 7

0

2

4

6

8

10

1 2 3 4 5 6 7

Design Issues Attrition

Incomplete data How does attrition bias results?

Mobility At the student level, accountability for some,

opportunity to learn At the school level, accountability for all,

opportunity to teach Cohort Stability

Effects of AttritionZvoch & Stevens, 2005

An investigation of sample exclusion and student attrition effects on the longitudinal study of middle school mathematics performance

Mathematics achievement on state mandated TerraNova

24 middle schools Analytic Method: 2 and 3-level HLM growth

models

Effects of Attrition

One longitudinal cohort: Two samples Complete cohort sample: All students who

participated in accountability testing in 1998-99 (N = 6,098)

District accountability sample: A subset of the complete cohort sample – same school for three years, complete test data, no modified test administrations (N = 3,334)

Student Demographic Characteristics by Analytic Sample

_________________________________________________________________

Accountability Sample Complete Cohort Sample

(N = 3,334) (N = 6,098)

_________________________________________________________________

Student Characteristic Frequency Percent Frequency Percent

_________________________________________________________________

Female 1,710 51.3 3,016 49.5

Non-Anglo 1,797 53.9 3,536 58.0

English Language Learner 397 11.9 1,121 18.4

Free Lunch Recipient 1,170 35.1 2,628 43.1

Special Education 101 3.0 1,092 17.9

__________________________________________________________________

Note. Chi-square tests comparing the accountability sample (N = 3,334) with the group of students lost from the complete cohort sample (N = 2,764) on student background characteristics revealed that males and students in special populations were statistically over-represented in the group of excluded students (p < .01 in each comparison).

Two-Level Conditional Model Relating Individual Characteristics to Mathematics Achievement by Sample

__________________________________________________________________________ Accountability Sample Complete Cohort Sample (N = 3,334) (N = 6,098)___________________________________________________________________________

Fixed Effect Estimate SE t Estimate SE t___________________________________________________________________________Mean Achievement, β00 661.68 0.55 1200.18*** 648.60 0.44 1472.05***Biological Sex, β01 -0.11 1.10 -0.10 0.90 0.88 1.02Minority Status, β02 -12.42 1.25 -9.79*** -12.16 1.00 -12.09***Free Lunch Status, β03 -13.60 1.27 -10.75*** -14.21 0.98 -14.44***ELL Status, β04 -22.12 1.70 -12.99*** -21.78 1.27 -17.13***SPED Status, β05 -21.85 3.66 -5.97*** -34.97 1.31 -26.62***___________________________________________________________________________Mean Growth, β10 18.97 0.24 77.85*** 17.54 0.22 80.14***Biological Sex, β11 -2.96 0.51 -5.82*** -2.75 0.45 -6.16***Minority Status, β12 -0.78 0.59 -1.31 -2.12 0.52 -4.06***Free Lunch Status, β13 -0.85 0.61 -1.37 -1.69 0.53 -3.22**ELL Status, β14 -0.18 0.88 -0.20 -0.01 0.68 -0.01SPED Status, β15 1.10 1.51 0.73 -2.81 0.71 -

3.98***Initial Achievement Status, β16 0.06 0.01 4.42*** 0.02 0.01 2.25*___________________________________________________________________________*** p < .001; **p < .01; * p < .05

Three-Level Unconditional Model for Mathematics Achievement by Sample ______________________________________________________________________________

Accountability Sample Complete Cohort Sample

(N = 3,334) (N = 5,168) ______________________________________________________________________________

Fixed Effect Estimate SE t Estimate SE t______________________________________________________________________________ School Mean Achievement, γ000 659.19 2.97 222.29*** 648.96 3.09 209.84***School Mean Growth, γ100 18.44 0.92 20.12*** 17.64 0.87 20.16***______________________________________________________________________________

Variance VarianceRandom Effect Component df 2 Component df 2______________________________________________________________________________ Individual Achievement, r0ij 766.01 3310 8978.37*** 1132.02 4548 22708.86***Individual Growth, r1ij 27.98 3310 3858.62*** 44.04 4548 5736.01*** Level-1 Error, etij 313.12 361.59School Mean Achievement, u00j 202.73 23 704.61*** 222.42 23 855.44*** School Mean Growth, u10j 18.68 23 354.14*** 17.01 23 347.30***

Percentage of Variation Between Schools Achievement Status, π0ij 20.7 16.4Achievement Growth, π1ij 40.0 27.9______________________________________________________________________________Note. In this analysis, students who transferred schools within the district were dropped from the complete

cohort sample as these students could not be uniquely assigned to one school location (N = 930). *** p < .001

Effects of Attrition

Were cross-sample changes in school performance associated with the percentage of students from special populations excluded from the district accountability sample?

Figure 1. Cross-Sample School Achievement Mean Change in Mathematics as a Function of the Proportion of Students from Special Populations Excluded from the Accountability Sample

Figure 2. Cross-Sample School Growth Rate Change in Mathematics as a Function of the Proportion of Students

from Special Populations Excluded from the Accountability Sample

Conclusions Mathematics performance estimates differed

across two sample conditions District and school achievement higher and

student performance more similar in the restricted sample

Cross-sample school changes in student achievement closely related to the proportion of students from special student populations excluded

Cohort StabilityZvoch & Stevens, in press

An investigation of stability of cohorts from one year to the next

Mean achievement status and growth of students across cohorts

Changes in the achievement status and growth of students between student cohorts

Predictors of school achievement outcomes

Cohort Data StructureYear

Grade 99-00 00-01 01-02 02-03 03-04

3 C1 C2 C3

4 C1 C2 C3

5 C1 C2 C3

Note. Cohort 1 (N = 3,325), Cohort 2 (N = 3,347), Cohort 3 (N = 3,322); School N = 79

School Performance Indices Accountability Model

Outcome Focus

Across or Within Cohorts (Current

Performance)

Between Cohorts

(Improvement Over Time)

Status School Mean Achievement,

Percent Proficient

School Mean Achievement/

Proficiency Change

Growth School Mean Growth

School Mean Growth Change

Figure 1. Cross-cohort relationship between school mean achievement and school

mean growth in mathematics

High Mean Low Growth

High Mean High Growth

Low Mean Low Growth

Low Mean High Growth

Cohort Stability

Do the cross-cohort estimates of school performance vary with schools’ social context?

Figure 2. Cross-cohort school mean growth in mathematics as a function of the

percentage of free lunch recipients

0.00 0.25 0.50 0.75 1.00

Percentage of Free Lunch Recipients

590

600

610

620

630

640

650

Figure 3. Cross-cohort school mean achievement in mathematics as a function of the

percentage of free lunch recipients

0.00 0.25 0.50 0.75 1.00

Percentage of Free Lunch Recipients

5

10

15

20

25

Cohort Stability

To what degree do estimates of the mean achievement status and achievement growth of schools change with each successive student cohort?

580

590

600

610

620

630

640

650

660

cohort 1 cohort 2 cohort 3

Student Cohort

Scho

ol M

ean

Ach

ieve

men

t

Figure 4. School mean achievement in mathematics as a function of student cohort

0

5

10

15

20

25

30

cohort 1 cohort 2 cohort 3

Student Cohort

Scho

ol M

ean

Gro

wth

Figure 6. School mean growth in mathematics as a function of student cohort

Cohort Stability

Do the between-cohort estimates of school improvement covary with cohort enrollment size?

Figure 7. Relationship between cohort-to-cohort changes in school mean growth and cohort enrollment size by school

Conclusions Cross-cohort performance differed by

outcome: mean achievement status or growth Cohort-to-cohort changes in student

performance also varied by outcome: change in school mean achievement or change in school mean growth

Across cohorts, schools’ social context was associated only with student achievement levels, not achievement growth

Changes in school performance were closely related to cohort enrollment size

Alternative Statistical Models for Longitudinal Analysis

Difference or Gain scores Residuals Growth Curve Models Latent Growth Curves Mixture Models Autoregressive Models

Alternative Statistical Models

Strengths and weaknesses of each model Results differ based on model used Models answer different questions Model complexity versus transparency

Measurement Issues

Carryover Effects (Learning, Sensitization, etc.) Need for Parallel Test forms

Measurement Issues: Scaling and Equating Do scales change over time?

Structure/dimensionality Units Standardization or scaling may mask change

Equating Vertical equating, usually one point in time across

grades/cohorts Need for longitudinal equating What spans can reasonably be equated? What content can reasonably be equated?

Measurement Issues: Reliability Regression to the mean in two wave studies Reliability of Difference Scores Measured Variables versus Latent Variables

Reliability Generalization Over time Interactions of time and other characteristics

Measurement Issues: Validity

Construct Equivalence over time Temporal Invariance

Measurement invariance Structural invariance

LANG

Spell

u3

.60

LangMech

u2

.76

Lang

u1

.85

READ

Vocab

u5

.82

Read

u4

.86

MATH

MathComp

u7

Math

u6

CONTENT

SocialStudies

u9

Science

u8

.83.81.91 .79

.89

.88 .98

.92.83 .87

Measurement Issues

Measurement instruments that are designed to measure cognitive growth (Collins, 1991)

True Developmental Scales


Messick (1989): Validity and consequences of alternative approaches

Pattern Matching (Shadish, Cook, & Campbell, 2002)

Riechardt (2000): study of plausible threats to validity of treatment effects

Measurement Issues: Validity Stevens (2005)

Three level HLM curvilinear growth models applied to state data.

23,469 sixth grade children took the state mandated TerraNova in 1999-00

Study includes the 23,296 sixth graders (99.3%) who took the mathematics subtest

These students were matched longitudinally to 7th, 8th, and 9th grade records for the years 2001, 2002, and 2003

Sample included only those students who were in their middle school for 2 or 3 years (17,596; 75.5% of students).

Schools with less than 5 students were also excluded (13 schools with a total of 24 students), resulting in an analytic sample of 242 schools (94% of schools) with 17,572 students.

This sample differs from the population in having about 1% more White and Hispanic, 1% fewer Native American, 1% fewer LEP and Special Education, and 2% fewer bilingual students.

Mathematics Achievement Predicted by Individual Characteristics _______________________________________________________________

Fixed Effect Coefficient SE t df _ p___ School Mean Achievement, γ000 663.54 1.28 513.86 241 < .001 White Student, γ010 14.62 0.77 18.88 241 < .001 LEP, γ020 -16.00 1.19 -13.50 241 < .001 Title 1 Student, γ030 -11.10 1.44 -7.71 241 < .001 Special Education, γ040 -33.09 1.88 -17.62 241 < .001 Modified Test, γ050 -16.83 2.63 -6.40 241 < .001 Free Lunch Student, γ060 -7.75 1.13 -6.85 241 < .001 Gender, γ070 -1.21 0.59 -2.03 241 .042

School Linear Growth, γ100 19.40 0.70 27.88 241 < .001 White Student, γ110 -1.20 0.64 -1.86 241 .062 LEP, γ120 0.70 1.13 0.60 241 .547 Title 1 Student, γ130 -2.58 0.95 -2.72 241 .007 Special Education, γ140 -2.16 1.67 -1.29 241 .196 Modified Test, γ150 -2.43 2.47 -0.99 241 .325 Free Lunch Student, γ160 -0.75 1.03 -0.73 241 .466 Gender, γ170 -4.68 0.59 -7.98 241 < .001_______________________________________________________________

Mathematics Achievement Predicted by Individual Characteristics (continued)______________________________________________________________Fixed Effect Coefficient SE t df p______________________________________________________________School Curvilinear Growth, γ200 -2.09 0.21 -9.78241 < .001 White Student, γ210 0.48 0.20 2.35 241 .019 LEP, γ220 -0.10 0.36 -0.27 241 .790 Title 1 Student, γ230 0.61 0.28 2.17 241 .030 Special Education, γ240 0.61 0.50 1.22 241 .224 Modified Test, γ250 -0.10 0.75 -0.14 241 .890 Free Lunch Student, γ260 0.26 0.33 0.79 241 .427 Gender, γ270 1.05 0.19 5.64 241 < .001______________________________________________________________School Level Level-1 Level-2 VarianceVariance Component Explained ______________________________________________________________

Mean Achievement, u00 242.78 184.89 23.8%Linear Growth, u10 41.46 30.68 26.0%Curvilinear Growth, u10 2.94 2.60 11.6%______________________________________________________________

Pattern matching: relationships between alternative measures of school effectiveness and confounding variables NCLB Proficiency (percent proficient or above using state determined cutpoint)

State rating of schools (weighted combination of proficiency score, attendance, dropout rates)

HLM Empirical Bayes (EB) intercept estimates

HLM EB slope estimates

r2 = .38, p < .001 r2 = .44, p < .001

School Rating

4.54.03.53.02.52.01.51.0.5

Pe

rce

nt

of

Stu

de

nts

Re

ceiv

ing

Fre

e L

un

ch

1.0

.8

.6

.4

.2

0.0

NCLB Proficiency

100806040200

Pe

rce

nt

of

Stu

de

nts

Re

ceiv

ing

Fre

e L

un

ch

1.0

.8

.6

.4

.2

0.0

r2 = .26, p < .001 r2 = .06, p < .001

EB Intercept Estimates

700680660640620600

Pe

rce

nt

of

Stu

de

nts

Re

ceiv

ing

Fre

e L

un

ch

1.0

.8

.6

.4

.2

0.0

EB Slope Estimates

3020100

Pe

rce

nt

of

Stu

de

nts

Re

ceiv

ing

Fre

e L

un

ch

1.0

.8

.6

.4

.2

0.0


Pattern Matching: Relation to schooling effects If schooling policies and practice impact student

learning they should emerge as correlates of growth

Growth measures are more sensitive to the effects of schooling than status measures (Bryk & Raudenbush, 1989)

Zvoch & Stevens (2005)

Study Purpose: To examine correlates of status and growth in mathematics achievement over a three year period.

Individual math achievement scores on the TerraNova were used from a longitudinal sample of middle school students in the sixth grade in 1998-99, seventh grade in 1999-00, and eighth grade in 2000-01

Study conducted in one urban school district in NM: 24 middle schools, over 20,000 students; 51% female, 49% male 47% Hispanic, 44% Anglo, 3% Native American, 3%

African American, 2% Asian, and 1% Other 17% of students were classified as ELL 17% special education 40% of students receive a free or reduced price lunch

Percent Free-Lunch (M = .49, SD = .28)

Mean Educational Level of Mathematics Staff (M = 17.61, SD = .58)

Mathematics Curricula (0 = Traditional Program, 1 = NSF Reform Curricula, 9 of the 24 middle schools (38%)

Pattern of results differed depending on whether status scores or growth scores were examined

Fixed Effect Coefficient SE t _________________________________________________________________ School Mean Achievement, γ000 650.18 1.51 429.77***

Percent Free Lunch, γ001 -0.22 0.05 -4.70***

Math Teacher Education, γ002 0.93 2.00 0.470.47

Math Curricula, γ003 -0.32 2.55 -0.13-0.13 School Mean Growth, γ100 18.96 0.83 22.75***

Percent Free Lunch, γ101 -0.02 0.02 -0.92-0.92 Math Teacher Education, γ102 3.29 1.08 3.06**

Math Curricula, γ103 -3.00 1.40 -2.14*

_________________________________________________________________Zvoch, K., & Stevens, J. J. (in press). Longitudinal effects of school

context and practice on mathematics achievement. Journal of Educational Research.

Using Longitudinal Models for Accountability SystemsDesign: Number of measurement occasions

Single occasion case studies Annual measurement interval, size of interval

Initial starting point, prior achievement

Some states cannot track students over time

Using Longitudinal Models for Accountability SystemsMeasurement: Need for new assessments designed to

measure cognitive growth Assessments sometimes vertically equated,

seldom longitudinally equated; Need for true developmental scales,

measurement invariance over time Construct equivalence over time

Using Longitudinal Models for Accountability Systems Attrition, Mobility, Cohort Effects

Need for further study How are school estimates biased? Can statistical adjustments be used?

School size, disaggregated group sizes and stability or bias

Summary and Conclusions Different designs, measures, and methods of

analysis are likely to provide different evaluations of student growth and school effectiveness

However, despite difficulties of longitudinal modeling, cross-sectional designs can not address fundamental issues of growth, change, and learning

Need for the inclusion of policy and practice variables

Importance of empirically validating assessment instruments and accountability systems

Bibliography

Bryk, A. S., & Raudenbush, S. W. (1987). Application of hierarchical linear models to assessing change. Psychological Bulletin, 101, 147-158.

Cattell, R. B. (1966). Patterns of change: Measurement in relation to state dimension, trait change, lability and process concepts. In R. B. Cattell (Ed.), Handbook of multivariate experimental psychology. Chicago: Rand McNally.

Collins, L. (1991). Measurement in longitudinal research. In L. M. Collins & J. L. Horn (Eds.), Best methods for the analysis of change: Recent advances, unanswered questions, future directions. Washington, DC: American Psychological Association.

Collins, L. M., & Horn, J. L. (1991). Best methods for the analysis of change: Recent advances, unanswered questions, future directions. Washington, DC: American Psychological Association.

Collins, L. M., & Sayer, A. (2001). New methods for the analysis of change. Washington, DC: American Psychological Association.

Cronbach, L. J., & Furby, L. (1970). How we should measure change—or should we? Psychological Bulletin, 74, 68-80.

CTB/McGraw-Hill (1997). TerraNova Technical Bulletin 1. Monterey, CA: Author.

Duncan, S. C., & Duncan, T. E. (1995). Modeling the processes of development via latent variable growth curve methodology. Structural Equation

Modeling: A Multidisciplinary Journal, 2(3), 187-213.

Ferrer, E., Hamagami, F., & McArdle, J. J. (2004). Teacher’s corner: Modeling latent growth curves with incomplete data using different types of structural equation modeling and multilevel software. Structural Equation Modeling: A Multidisciplinary Journal, 11(3), 452-483.

Linn, R.L., & Haug, C. (2002). Stability of school-building accountability scores and gains. Educational Evaluation and Policy Analysis, 24 (1), 29-36.

Linn, R. L., & Slinde, J. A. (1977). The determination of the significance of change between pre- and posttesting periods. Review of Educational Research, 47(1), 121-150.

Marco, G. L. (1974). A comparison of selected school effectiveness measures based on longitudinal data. Journal of Educational Measurement, 11, 225-234.

Meredith, W. & Horn, J. (2001). The role of factorial invariance in modeling growth and change. In L. M. Collins & A. Sayer (Eds.), New methods for the analysis of change. Washington, DC: American Psychological Association.

Messick, S. (1989). Validity. In R.L. Linn (Ed.), Educational Measurement (3rd Ed., pp. 13-103). New York: MacMillan.

Messick, S. (1994). The interplay of evidence and consequences in the validation of performance assessments. Educational Researcher, 23, 13-23.

Nesselroade, J. R. (1991). Interindividual differences in intraindividual change. In L. M. Collins & J. L. Horn (Eds.), Best methods for the analysis of change: Recent advances, unanswered questions, future directions. Washington, DC: American Psychological Association.

Raudenbush, S. W. (2001). Comparing personal trajectories and drawing causal inferences from longitudinal data. Annual Review of Psychology, 52, 501-525.

Raudenbush, S.W., Bryk, A.S., Cheong, Y.F., & Congdon, R.T. (2001). HLM 5: Hierarchical linear and nonlinear modeling. Chicago: Scientific Software International.

Raudenbush, S. W., & Willms, J. D. (1995). The estimation of school effects, Journal of Educational and Behavioral Statistics, 20 (4), pp. 307-35.

Reichardt, C. S. (2000). A typology of strategies for ruling out threats to validity. In L. Bickman (Ed.), Research design: Donald Campbell’s legacy (Vol. 2, pp. 89-115). Thousand Oaks, CA: Sage.

Rogosa, D. (1995). Myths about longitudinal research. In J.M. Gottman (Ed.). The analysis of change. Mahwah, NJ: Erlbaum.

Shadish, W. R., Cook, T. D., & Campbell, D. T. (2002). Experimental and quasi-experimental designs for generalized causal inference. Boston, MA: Houghton Mifflin Company.

Stevens, J. J. (2000). Educational Accountability Systems: Issues and Recommendations for New Mexico. Research report prepared for the New Mexico State Department of Education.

Stevens, J. J. (2005). The study of school effectiveness as a problem in research design. In R. Lissitz (Ed.), Value-added models in education: Theory and applications. Maple Grove, MN: JAM Press.

Stevens, J. J., Estrada, S., & Parkes, J. (2000). Measurement issues in the design of state accountability systems. Paper presented at the annual meeting of the AERA, New Orleans, LA.

Stone, C.A., & Lane, S. (2003). Consequences of a state accountability program: Examining relationships between school performance gains and teacher, student, and school variables. Applied Measurement in Education, 16, 1-26.

Teddlie, C., Reynolds, D., & Sammons, P. (2000). The methodology and scientific properties of school effectiveness research. In C. Teddlie, & D. Reynolds (Eds.), The International handbook of school effectiveness research. New York: FalmerPress.

Willett, J. B., & Sayer, A. G. (1994). Using covariance structure analysis to detect correlates and predictors of individual change over time, Psychological Bulletin, 116(2), 363-381.

Willett, J. B., Singer, J. D., & Martin, N. C. (1998). The design and analysis of longitudinal studies of development and psychopathology in context: Statistical models and methodological recommendations, Development and Psychopathology, 10, 395-426.

Zvoch, K., & Stevens, J. J. (in press). Successive Student Cohorts and Longitudinal Growth Models: An Investigation of Elementary School Mathematics Performance. Educational Policy Analysis Archives.

Zvoch, K., & Stevens, J. J. (in press). Longitudinal effects of school context and practice on mathematics achievement. Journal of Educational Research.

Zvoch, K., & Stevens, J. J. (2003). A multilevel, longitudinal analysis of middle school math and language achievement. Educational Policy Analysis Archives, 11 (20). (Available at: http://epaa.asu.edu/epaa/v11n20/).

Zvoch, K., & Stevens, J. J. (2005). Sample exclusion and student attrition effects in the longitudinal study of middle school mathematics performance. Educational Assessment, 10(2), 105-123.

Issues in the Implementation of Longitudinal Growth Models for Student AchievementJoseph Stevens & Keith Zvoch

170 College of Education, 5267 University of Oregon

Eugene, OR 97403, (541) 346-2445

[email protected]

Presentation available at:


Issues in the Implementation of Longitudinal Growth Models for Student Achievement Joseph Stevens...

Documents

Transcript of Issues in the Implementation of Longitudinal Growth Models for Student Achievement Joseph Stevens...