Why Scale -- 1

• Summarising data–Allows description of developing

competence• Construct validation

–Dealing with many items• rotated test forms

– check how reasonable it is to summarise data (through sums, or weighted sums)

What do we want to achieve in our measurement?

Locate students on a line of developing proficiency that describe what they know and can do.

================================So, we need to make sure that• Our measures are accurate (reliability);• Our measures are indeed tapping into the

skills we set out to measure (validity);• Our measures are “invariant” even if

different tests are used.

Properties of an Ideal Approach

• Scores we obtained are meaningful.

Ann Bill Cath

What can each of these students do? Scores are independent of the sample of items

used If a different set of items are used, we will get the

same results.

Using Raw Scores?

• Can raw scores provide the properties of an ideal measurement?

• Distances between differences in scores are not easily interpretable.

• Difficult to link item scores to person scores.

Equating raw scores - 2

0 100%Score on the easy test

Link Raw Scores on Items and Persons

single digit addition

Task Difficulties

multi-step arithmetic

word problems

arithmetic with vulgar fractions

Object Scores

Item Response Theory (IRT)

• Item response theory helps us address the shortcomings of raw scores– If item response data fit and IRT (Rasch)

model, measurement is at its most powerful level.• Person abilities and item difficulties are calibrated

on the same scale.• Meanings can be constructed to describe scores• Student scores are independent of the particular set

of items in the test.– IRT provides tools to assess the extent to which

good measurement properties are achieved.

• IRT models give the probability of success of a person on items.

• IRT models are not deterministic, but probablistic.

• Given the item difficulty and person ability, one can compute the probability of success for each person on each item.

Building a Model

Probability of Success

Very low achievement Very high achievement

Imagine a middle difficulty task

Item Characteristic Curve

Item Difficulty -- 1

-4 -3 -2 -1 0 1 2 3 4

Variation in item difficulty

-4 -3 -2 -1 0 1 2 3 41 23

Variation in item difficulty

-4 -3 -2 -1 0 1 2 3 4

Estimating Student Ability

10 34 76 39 67 29 3 7 89 5 56 40 2 8 11 13 27 66 77 64 4 9 1 45 46 14 35 21 23 81 75 6 12

3 | | | | X| | X| | XX| | 2 XX| |9 22 XXX| | XXX| |6 16 XXXXX| |8 11 27 29 1 XXXXX| | XXXXXXX|* |31 XXXXXXX|* |2 30 XXXXXXXXX|* * * |13 XXXXXXXXXX|* * * * * |19 0 XXXXXXX|* * * * * * |5 32 XXXXXXXX|* * * * * |7 15 28 XXXXXXX|* |4 14 21 XXXXXXXX|* * |3 17 20 23 XXXXXXXXX| |10 18 24 -1 XXXXXX| | XXXX|* |1 XXXX| | XX| |12 26 -2 XXX| |25 XX| | X| | X| | X| | -3 X| |

Tasks at level 1 require mainly recall of knowledge, with little interpretation or reasoning.

Tasks at level 3 require doing mathematics in a somewhat "passive way", such as manipulating expressions, carrying out computations, verifying propositions, etc, when the modelling has been done, the strategies given, the propositions stated, or the needed information is explicit.

Tasks at level 5 require doing mathematics in an active way: finding suitable strategies, selecting information, posing problems, constructing explanations and so on.

3 | | | | X| | X| | XX| | 2 XX| |9 22 XXX| | XXX| |6 16 XXXXX| |8 11 27 29 1 XXXXX| | XXXXXXX|* |31 XXXXXXX|* |2 30 XXXXXXXXX|* * * |13 XXXXXXXXXX|* * * * * |19 0 XXXXXXX|* * * * * * |5 32 XXXXXXXX|* * * * * |7 15 28 XXXXXXX|* |4 14 21 XXXXXXXX|* * |3 17 20 23 XXXXXXXXX| |10 18 24 -1 XXXXXX| | XXXX|* |1 XXXX| | XX| |12 26 -2 XXX| |25 XX| | X| | X| | X| | -3 X| |

Distance between the location of items and students fully describe students’ chances of success on the item

This property permits the use of described scales

Why a Rasch Model?

Why Scale -- 1

Documents

Transcript of Why Scale -- 1

Gherasimov, Cristina Why Brussels Needs to Scale Up Its ...

Why are Geographers Concerned with Scale and Connectedness?

Geography of Scale. Definitions of Scale Why does Scale Matter Scale of Patterns/Distributions Scale of Processes Analytical/Conceptual Scale.

Globalization & Organizational Structure. Entering the Global Market Why Go Global? Why Go Global? Economies of scale Economies of scale Economies of.

Scale Patterns for Guitar and Why You Need Them

Why Bitcoin will Fail to Scale? - Krannert School of ... · 4 Why Bitcoin will Fail to Scale? Figure 1 The average block size has been signi cantly below full capacity since Jan 2018,

Lecture 1 Why is Nano-scale Special

Moodle at scale why assigning a role can cause a catastrophe

Why are Geographers Concerned with Scale and Connectedness? Key Question:

Why Azure? Hybrid Enterprise Grade Hyper-scale.

Why Unizin? Digital Education at Cloud Scale (242724442)

Why Large Scale Scrum (LeSS)?

WHY IS IT SO IMPORTANT? - in1touch€¦ · Terrain Analysis, Why is it so Important? 16 WHAT IS TERRAIN ANALYSIS – MINIMUM POLYGON SIZE Scale influences minimum polygon size Scale

Why is Raid a Problem for Data At Scale?

Why small-scale tuna fishers in Indonesia enter into contracts

Why You Should Scale Your Global Business

The future… Where are we? Why the TeV scale? What’s the next scale above a TeV?

Tucson’s Birds: Why Scale Matters Rachel McCaffrey.

Why is artisanal and small-scale (ASM) mining still informal?

FNI 1B 1 The Nano Scale of Things. FNI 1B2 Why nano? The nanometer scale is where the sciences come together: Physics Biology Chemistry New properties.