1. Statistical power in educational settings Workshop at Wellcome seminar on educational research,...

Statistical power in educational settings

Workshop at Wellcome seminar on educational research, May 2008

Dylan Wiliam

Institute of Education, University of London

www.dylanwiliam.net

The argument…Premise 1 Learning is insensitive to instruction Measures of learning even more so So even small system-wide gains in learning are educationally important

Premise 2 Education systems are inherently multi-levelled Taking account of clustering in data lowers statistical power Educational experiments are inherently weak

Conclusion RCTs in education frequently need to be very large, and therefore expensive

Learning is slow…

6 7 8 9 10 11 12

Age (years)

Facility

Source: Leverhulme Numeracy Research Programme

860+570=?

…especially for deep learning…Achievement in decimals by age

1 2 3 4 5 6

Level achieved

Proportion

Age 12

Age 13

Age 14

Age 15

Hart, 1981

…and measures are insensitive…Annual growth in school attainment (STEP)

5 6 7 8 9 10 11 12 13 14 15

Annual increase (standard deviations)

ReadingWritingListeningSoc. Stud.ScienceMath

Sequential tests of educational progress (ETS, 1957)

…and measures are insensitive…Annual growth in school attainment (STEP)

5 6 7 8 9 10 11 12 13 14 15

Annual increase (standard deviations)

ReadingWritingListeningSoc. Stud.ScienceMath

NAEPTIMSS

…so small gains in learning are worthwhileAverage rate of progress of cohorts is 0.3 standard deviations per year

Average cost of one year’s education for a cohort in England is £3bn

An effect size of 0.05 sd might be regarded as “small”

But system-wide, is worth £6bn

…but hard to detect…Statistical power: The likelihood that a statistical test will reject a false null hypothesisDepends on

The level set for statistical significance The size of the difference between compared groups (effect size) The sensitivity of the measures

Clustering reduces statistical power, but is an inherent feature of educational settings, and especially for school-wide interventionsTeacher qualityAbility grouping

…especially in educational settings

(Konstantopoulos,2006)

p = #studentsn = #classrooms = effect sizec= classroom clustering

s= school clustering

So…The most important question is not “Are RCTs good?” but “When are RCTs good?”

How should we answer?

Institute of Education Sciences (USA)Five goals

1. identify existing programs, practices, and policies that may have an impact on student outcomes and the factors that may mediate or moderate the effects of these programs, practices, and policies;

2. develop programs, practices, and policies that are theoretically and empirically based;

3. evaluate the efficacy of fully developed programs, practices, and policies;

4. evaluate the impact of programs, practices, and policies implemented at scale;

5. develop and/or validate data and measurement systems and tools.

1. Statistical power in educational settings Workshop at Wellcome seminar on educational research,...

Documents

Transcript of 1. Statistical power in educational settings Workshop at Wellcome seminar on educational research,...

Wellcome News 68

Dylan Wiliam October 2013 - Taunton Teaching Alliance · Dylan Wiliam is emeritus professor of educational assessment at the ... as Denis Lawton wrote: ... she proposed a seven-step

Wellcome NEWS

curriculum vitae WILIAM ABIEZER CRUZ NARVAEZ.docx

wiliam chittick on psychiatry

PowerPoint Presentation - Lexicon Reading Center …...“When implemented well, formative assessment can double the speed of students’ learning ” Dylan Wiliam Educational Leadership

Acknowledgements - Wellcome

Wellcome History 47

Salzburg Seminar Talk 2014 alt - Dylan Wiliam Center · The!educational!achievementof!acountry’s!population!is!akey!determinantof!its! ... factthatthe!class!sizes!in!private ...

Black & Wiliam (2009) Proof

The - Wellcome

Wiliam Sharpe

Playing By The Rules Wiliam

Chimney Sweeper by Wiliam Blake

Sustaining formative assessment with teacher learning ... Wiliam Presentation Slides.pdf · assessment with teacher learning communities Dylan Wiliam ... The formative assessment

Contextual Considerations: Revision of the Wiliam and ...

How can assessment support learning? Keynote address to Network Connections Pittsburgh, PA; February 9th, 2006 Dylan Wiliam, Educational Testing Service.

Wellcome News 66

Wellcome History 46

Computer Organisation ansd Archtecture - Wiliam