Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of...

43
Measurement for Improvement Melissa Chabran Associate, Networked Improvement Science Carnegie Foundation for the Advancement of Teaching Jon Norman Associate, Improvement Analytics & Measurement Carnegie Foundation for the Advancement of Teaching April 4, 2018

Transcript of Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of...

Page 1: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measurement for Improvement Melissa Chabran

Associate, Networked Improvement Science Carnegie Foundation for the Advancement of Teaching

Jon Norman

Associate, Improvement Analytics & Measurement Carnegie Foundation for the Advancement of Teaching

April 4, 2018

Page 2: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

2

Spread and Scale

Understand the Problem and the

System that Produces It

Test and Build Evidence

Generate Ideas for Change

Focus Collective Efforts

Improvement Science Cheat Sheet

Page 3: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

3

Spread and Scale

Understand the Problem and the

System that Produces It

Test and Build Evidence

Generate Ideas for Change

Focus Collective Efforts

Improvement Science Cheat Sheet

Page 4: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

4

Spread and Scale

Understand the Problem and the

System that Produces It

Test and Build Evidence

Generate Ideas for Change

Focus Collective Efforts

Improvement Science Cheat Sheet

Page 5: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Session Objectives

1. The Purpose of Measurement: Explore how measurement for improvement is different than measurement used for accountability or research.

2. System of Measures: Motivate the need for a “system of measures” for improvement that includes outcome, driver, process and balancing measures.

3. Case: Show how a measurement system supports improvement efforts.

5

Page 6: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

6 6

1997 2003

6

2014

Page 7: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

“We are increasingly realizing how critical measurement is for the improvement we seek, yet how counterproductive it can sometimes be to mix measurement for accountability with measurement for improvement. Considered one by one, measurement for each purpose can be good and very important. If done poorly, it can be bad. If the measurements are mixed together in inappropriate ways, they can indeed be harmful or destructive, with the mixed purposes interfering with one another.” Leif I. Solberg, MD

Lessons from Healthcare

7

Page 8: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

What is measured:

Needs to be closely aligned to the actual work and specific to the processes and outcomes you hope to change.

How & when it is measured:

Needs to be embedded in the daily workflow. Must produce data accessible in a timely manner.

Social processes

shaping use:

Requires transparency, trust, low stakes, and the safety to take risks. Language needs to be meaningful to those doing the work.

To provide usable information for improvement, we need to consider:

8

Page 9: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

ASPECT IMPROVEMENT ACCOUNTABILITY RESEARCH

Why? Develop and evaluate changes in practice

Identify exemplary or problematic performers

(teachers, schools, districts)

Develop and test theories about the relationships

between conceptual variables

What? Outcomes and processes that are the object of change

End of the line outcomes Latent variables

Lief Solberg, Gordon Mosser and Sharon McDonald Journal on Quality Improvement vol. 23, no. 3, (March 1997), 135-147.

9

Page 10: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

ASPECT IMPROVEMENT ACCOUNTABILITY RESEARCH

Why? Develop and evaluate changes in practice

Identify exemplary or problematic performers

(teachers, schools, districts)

Develop and test theories about the relationships

between conceptual variables

What? Outcomes and processes that are the object of change

End of the line outcomes Latent variables

How often? Frequently as practice occurs Usually collected once a year (after the fact)

Typically once or twice per study (after the fact)

Testing your theory

Sequential tests No theory to test One large test

Sample size

“Just enough” data, small sequential samples

Obtain 100% of available, relevant data

“Just in case” data

10

Lief Solberg, Gordon Mosser and Sharon McDonald Journal on Quality Improvement vol. 23, no. 3, (March 1997), 135-147.

Page 11: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

ASPECT IMPROVEMENT ACCOUNTABILITY RESEARCH

Why? Develop and evaluate changes in practice

Identify exemplary or problematic performers

(teachers, schools, districts)

Develop and test theories about the relationships

between conceptual variables

What? Outcomes and processes that are the object of change

End of the line outcomes Latent variables

How often? Frequently as practice occurs Usually collected once a year (after the fact)

Typically once or twice per study (after the fact)

Testing your theory

Sequential tests No theory to test One large test

Sample size

“Just enough” data, small sequential samples

Obtain 100% of available, relevant data

“Just in case” data

Social Conditions of Use?

Data shared in a low-stakes, safe environment conducive

to change.

Publically available. Formal collection process to assure appearances of

neutrality and objectivity.

Meets scientific standards that are held in the field.

11

Lief Solberg, Gordon Mosser and Sharon McDonald Journal on Quality Improvement vol. 23, no. 3, (March 1997), 135-147.

Page 12: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

ASPECT IMPROVEMENT ACCOUNTABILITY RESEARCH

Why? Develop and evaluate changes in practice

Identify exemplary or problematic performers

(teachers, schools, districts)

Develop and test theories about the relationships

between conceptual variables

What? Outcomes and processes that are the object of change

End of the line outcomes Latent variables

How often? Frequently as practice occurs Usually collected once a year (after the fact)

Typically once or twice per study (after the fact)

Testing your theory

Sequential tests No theory to test One large test

Sample size

“Just enough” data, small sequential samples

Obtain 100% of available, relevant data

“Just in case” data

Social Conditions of Use?

Data shared in a low-stakes, safe environment conducive

to change.

Publically available. Formal collection process to assure appearances of

neutrality and objectivity.

Meets scientific standards that are held in the field.

Key limitation for improvement:

Does not illuminate WHY

the outcomes occur or what should be done to change

them

Lief Solberg, Gordon Mosser and Sharon McDonald Journal on Quality Improvement vol. 23, no. 3, (March 1997), 135-147.

12

Page 13: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

ASPECT IMPROVEMENT ACCOUNTABILITY RESEARCH

Why? Develop and evaluate changes in practice

Identify exemplary or problematic performers

(teachers, schools, districts)

Develop and test theories about the relationships

between conceptual variables

What? Outcomes and processes that are the object of change

End of the line outcomes Latent variables

How often? Frequently as practice occurs Usually collected once a year (after the fact)

Typically once or twice per study (after the fact)

Testing your theory

Sequential tests No theory to test One large test

Sample size “Just enough” data, small sequential samples

Obtain 100% of available, relevant data

“Just in case” data

Social Conditions of Use?

Data shared in a low-stakes, safe environment conducive

to change.

Publically available. Formal collection process to

assure appearances of neutrality and objectivity.

Meets scientific standards that are held in the field.

Key limitation for improvement:

IMPRACTICAL to

administer; not designed to inform changes in practice

Lief Solberg, Gordon Mosser and Sharon McDonald Journal on Quality Improvement vol. 23, no. 3, (March 1997), 135-147.

13

Page 14: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

ASPECT IMPROVEMENT ACCOUNTABILITY RESEARCH

Why? Develop and evaluate changes in practice

Identify exemplary or problematic performers

(teachers, schools, districts)

Develop and test theories about the relationships

between conceptual variables

What? Outcomes and processes that are the object of change

End of the line outcomes Latent variables

How often? Frequently as practice occurs Usually collected once a year (after the fact)

Typically once or twice per study (after the fact)

Testing your theory

Sequential tests No theory to test One large test

Sample size “Just enough” data, small sequential samples

Obtain 100% of available, relevant data

“Just in case” data

Social Conditions of Use?

Data shared in a low-stakes, safe environment conducive

to change.

Publically available. Formal collection process to

assure appearances of neutrality and objectivity.

Meets scientific standards that are held in the field.

Measurement for improvement supports the

ongoing refinement of knowing what works for whom and under what

conditions

Lief Solberg, Gordon Mosser and Sharon McDonald Journal on Quality Improvement vol. 23, no. 3, (March 1997), 135-147.

14

Page 15: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

A System of Measures to Inform Improvement

Page 16: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Schools That Lead – Shue-Medill – Milford Central Academy

Dana Diesel Wallace

Sofi Frankowski

Nancy Carnvale

Michele Savage

16

Page 17: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

83% 80%

17% 20%

0%

20%

40%

60%

80%

100%

MCA Shue-Medill

Proportion of Graduates Retained in Grade 9 (2015)

Retained in Gr 9

Successful Gr 9

17

Page 18: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balancing Measures

PDSA = Learning Cycle Measure

O B

D1 D2

System of Feedback

PDSA 5

PDSA 6

PDSA 7

PDSA 8

P6 P5 P4

Visual adapted from Improvement Science Consulting

Daily/Weekly

Monthly

Yearly

Monthly

P3 P2 P1

PDSA 1

PDSA 2

PDSA 3

PDSA 4

18

Page 19: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

O B

System of Feedback

Visual adapted from Improvement Science Consulting

Monthly

Yearly

How is the system performing?

What is the result?

19

Page 20: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

20

83% 80%

17% 20%

0%

20%

40%

60%

80%

100%

MCA Shue-Medill

Proportion of Graduates Retained in Grade 9 (2015)

Retained in Gr 9

Successful Gr 9

Page 21: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

B Monthly

Aim:

O: % of Graduates Repeating Grade 9

System of Feedback: STL-Delaware Case

21

Page 22: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

B Monthly

Aim:

O (Lagging): % of Graduates Repeating Grade 9

O (Leading): % of Students on Watch Lists

System of Feedback: STL-Delaware Case

22

Page 23: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

10.1% 13.0% 12.2%

9.7%

8.3% 10.1%

0%

5%

10%

15%

20%

Q1 Q2 Q3 Q4

Percent Students on Math Watch List, 15-16 and 16-17

'15-'16

'16-'1711.2%

13.7% 12.9% 12.5% 12.1%

11.7%

0%

5%

10%

15%

20%

Q1 Q2 Q3 Q4

Percent Students on ELA Watch List, 15-16 and 16-17

'15-'16

'16-'17

Leading Outcome Measures: 4 Early Warning Indicators

3.4%

10.7% 10.3% 8.9%

3.0%

9.4%

0%

5%

10%

15%

20%

MP1 MP2 MP3 MP4

Percent Students on Attendance Watch List, 15-16 and 16-17

'15-'16

'16-'17

1.4% 3.4% 4.0%

1.1% 1.1%

1.3% 0%

5%

10%

15%

20%

MP1 MP2 MP3 MP4

Percent Students on Discipline Watch List, 15-16 and 16-17

'15-'16

'16-'17

23

Page 24: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

O B

D1 D2

System of Feedback

Visual adapted from Improvement Science Consulting

Monthly

Yearly

Monthly

Are we making progress on the intermediate

outcomes?

24

Page 25: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

B Monthly

Aim:

O (Lagging): % of Graduates Repeating Grade 9

O (Leading): % of Students on Watch Lists

D2: Student Mindsets

System of Feedback: STL-Delaware Case

D1: Instructional Practices

25

Page 26: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Student Mindsets That Promote Resilience Being a "math person" or not is something about you that you really can't change. Some people are good at math and other people aren't.

15.4 34.3

84.6 65.7

0%

20%

40%

60%

80%

100%

Sept Jan

AgreeDisagree

Driver Measure: Growth Mindset Student Survey Responses in Class A,

September and January

26

Page 27: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

O B

D1 D2

System of Feedback

P6 P5 P4

Visual adapted from Improvement Science Consulting

Monthly

Yearly

Monthly

P3 P2 P1

Are the parts of your system performing

as planned?

27

Page 28: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

B

System of Feedback: STL-Delaware Case

Monthly

Aim:

O (Lagging): % of Graduates Repeating Grade 9

O (Leading): % of Students on Watch Lists

D2: Student Mindsets D1: Instructional Practices

Retesting Process P3 P2

28

Page 29: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Process Measure: Percent of Targeted Students Revising Tests,

Brandy Cooper, Grade 6 Math

25%

9%

17%

0%

20% 16%

38%

57%

0%

20%

40%

60%

80%

T1 (16) T2 (11) T3 (23) T4 (8) T5 (10) T6 (19) T7 (13) T8 (21)

Perc

ent o

f Stu

dent

s

Tests (Number of Targeted Students)

Request to Retest

Median = 17%

29

Page 30: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Process Measure: Average Original and Revised Grades

for Students Revising Tests, Brandy Cooper, Grade 6 Math

68

60

67

92

54 58

79

73

92 96

90

77

40

50

60

70

80

90

100

Test 1 Test 2 Test 3 Test 5 Test 6 Test 7

Aver

age

Test

Sco

re Average

OriginalScore

AverageRevisedScore

30

Page 31: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

O B

D1 D2

System of Feedback

PDSA 5

PDSA 6

PDSA 7

PDSA 8

P6 P5 P4

Adapted from Improvement Science Consulting

Daily/Weekly

Monthly

Yearly

Monthly

P3 P2 P1

PDSA 1

PDSA 2

PDSA 3

PDSA 4

31

Page 32: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

B

System of Feedback: STL-Delaware Case

Monthly

Aim:

O (Lagging): % of Graduates Repeating Grade 9

O (Leading): % of Students on Watch Lists

D2: Student Mindsets

Retesting Process P3 P2

PDSA 1

PDSA 2

PDSA 3

PDSA 4

1. Will students use a form to make revisions?

2. How can I get more students to use form?

# Forms picked up

Reasons why students don’t

use form

D1: Instructional Practices

32

Page 33: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

O B

D1 D2

System of Feedback

PDSA 5

PDSA 6

PDSA 7

PDSA 8

P6 P5 P4

Visual adapted from Improvement Science Consulting

Daily/Weekly

Monthly

Yearly

Monthly

P3 P2 P1

PDSA 1

PDSA 2

PDSA 3

PDSA 4

What’s happening to the parts of the system

we aren’t currently focused on?

33

Page 34: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

PDSA = Learning Cycle Measure

B

System of Feedback: STL-Delaware Case

Adapted from Schools That Lead

Monthly

Aim:

O (Lagging): % of Graduates Repeating Grade 9

O (Leading): % of Students on Watch Lists

D2: Student Mindsets

Retesting Process P3 P2

Quality of Work

D1: Instructional Practices

34

Page 35: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Outcome Measure(s)

Process Measures

Driver Measures

Balancing Measure(s)

Get movement here by

35

Page 36: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Quality of First Attempt

Primary Drivers

Secondary Drivers

Change Ideas

Student Behavioral Supports

Student Mindsets That

Promote Resilience

Others…

Revising Work & Assessments

Feedback on Student Work

Grading Policies

Request to Retest Form

Etc. Adult

Instructional Practices & Mindsets

AIM: By May 2017,

halve the proportion

(from 1 in 5 to 1 in 10) of

students who get held back in

9th grade

Etc.

Leading Outcome

Lagging Outcome

Driver Process

Balancing

36

Page 37: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

In the Future?

Quality of First Attempt

Primary Drivers

Secondary Drivers

Change Ideas

Student Behavioral Supports

Student Mindsets That

Promote Resilience

Others…

Revising Work & Assessments

Feedback on Student Work

Grading Policies

Request to Retest Form

Etc. Adult

Instructional Practices & Mindsets

AIM: By May 2017,

halve the proportion

(from 1 in 5 to 1 in 10) of

students who get held back in

9th grade

Etc.

Systematic collection Dec 2016

Started tracking early 2016

Common network measure.

First survey September 2015

Charter. Summer 2015

37

Page 38: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Activity

Page 39: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Activity: Developing a Set of Measures

Scenario: You are part of an improvement team at an elementary school. The students at your school are struggling with fractions. You launch an improvement project with your school team to tackle this problem.

Goal: The goal of the project is to increase students’ understanding of fractions.

Your Task: Develop a system of measures for this project: – Outcome Measures: 1-2 measures – Process or Driver Measures: 2-3 measures – Balancing Measures: 1 measure

39

Page 40: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

O B

D1 D2

FRACTIONS MEASUREMENT SYSTEM

P6 P5 P4

Visual adapted from Improvement Science Consulting

Daily/Weekly

Monthly

Yearly

Monthly

P3 P2 P1

Prof Dev for Ts Instructional Materials

Students’ Understanding of

Fractions

Interactions w Math Coach Use of Math Manipulatives

40

Page 41: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Measure Types O = Outcome Measures D = Driver Measures P = Process Measures B = Balance Measures

O B

D1 D2

FRACTIONS MEASUREMENT SYSTEM

P6 P5 P4

Visual adapted from Improvement Science Consulting

Daily/Weekly

Monthly

Yearly

Monthly

P3 P2 P1

Prof Dev for Ts Instructional Materials

Students’ Understanding of

Fractions

Interactions w Math Coach Use of Math Manipulatives

M1 M2

M6

M4

M3

M5

41

Page 42: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.

Key Takeaways

Data help us understand our systems and whether changes we make lead to improvement

Measurement for improvement is different than measurement for accountability and research

A family or system of measures, which includes outcome, driver, process, and balancing measures is needed to guide improvement efforts

42

Page 43: Measurement for Improvementsummit.carnegiefoundation.org/session_materials/C1...“system of measures” for improvement that includes outcome, driver, process and balancing measures.