On the statistical meaning of the item parameters in IRT models - … · 2018-09-12 ·...

Ernesto San Martín (LIES) Statistical meaning of IRT parametersAgosto 2018, Juiz de Fora, Brasil 1

On the statistical meaning of the item parame-ters in IRT modelsCONBRATRI VI

Ernesto San Martín1Faculty of Mathematics, Pontificia Universidad Católica de Chile, Chile2The Economics School of Louvain, Université catholique de Louvain, Belgium3Laboratorio Interdisciplinario de Estadística Social, LIES, Pontificia Universidad Católica de Chile, Chile

Agosto 2018, Juiz de Fora, Brasil

What means to model aeducational phenomenon?

We observe the response patterns of 50 students on 7 items:

It1 It2 It3 It4 It5 It6 It7Student 1 1 0 0 1 1 0 1Student 2 0 1 1 0 1 1 0Student 3 1 1 1 1 0 1 0Student 4 0 0 1 0 0 0 1Student 5 0 0 0 1 1 1 1Student 6 1 0 0 1 1 0 1Student 7 0 1 1 0 0 1 0Student 8 1 1 0 0 1 1 0Student 9 0 0 1 1 0 0 0Student 10 1 0 0 1 1 1 1

We observe the response patterns of 50 students on 7 items:

It1 It2 It3 It4 It5 It6 It7Student 1 1 0 0 1 1 0 1Student 2 0 1 1 0 1 1 0Student 3 1 1 1 1 0 1 0Student 4 0 0 1 0 0 0 1Student 5 0 0 0 1 1 1 1Student 6 1 0 0 1 1 0 1Student 7 0 1 1 0 0 1 0Student 8 1 1 0 0 1 1 0Student 9 0 0 1 1 0 0 0Student 10 1 0 0 1 1 1 1

We intend to describe the behavior of items.

We intend to describe the behavior of students.

In particular, we intend to identify test fraud as answer copyingbetween two examinees.

We can face these questions using psychometric models.

Detecting answer copyingand IRT models

IRT models: student’s answer on an item depends on both anindividual characteristic (ability) and item characteristics(difficulty, discrimination, guessing).

3PL model: P(Ypi = 1) = ci + (1− ci )exp(αiθp − βi )

1+ exp(αiθp − βi ).

If αi = 1 then we obtain the 1PL-G model.If ci = 0 then we obtain the 2PL model.If αi = 1 and ci = 0 then we obtain the 1PL model.

There are other extensions: the 4PL Model . . .

−4 −2 0 2 4

beta=0beta=1beta=−1

●●

●●●●●●●●●●●●●●●●

●●●

●●

●●●

10 20 30 40

DE=0.31; CIT=0.37

Puntaje Total

Zopluoglu (2016) analyzes the statistical behavior of answer-copying indexes underdifferent IRT models (dichotomous and polychotomous)

He remarks that copying indexes share the same rationale, but the computation ofthe probability of choosing the alternative k of the item j for the person p is doneusing either an IRT model or the CTT framework.

The objective of his contribution is to analyze the type I error behavior and thestatistical power of copying indexes.

Data: 40 items of mathematics applied to 67896 students.

Adjust 1PL, 2PL and 3PL using IRTPRO (which provides MML estimators for theparameters that characterize the items, although it also provides Bayesianestimates).

●●

● ●●

●●

−4 −3 −2 −1 0 1

Con datos de Zopluoglu (2016)

Dificultades 1PL

●●

● ●

1PL vs 2PL (0.81)1PL vs 3PL (0.86)

Study 1: manipulate sample sizes, amount of copy, type of copy, levels of itemsdifficulties.

Item difficulties: easy and medium difficulties . . . “overall test difficulty wasmanipulated through the b parameters for the dichotomous IRT models” (p.595).

Remark: Given the high correlations between the estimators of the difficulties, it ispossible to keep this part of the design comparable with respect to the different IRTmodels . . .

¿or not? ¿why?

For easy test difficulty conditions, test difficulty was around .80 for typical simulated responsedata which was similar to the real dataset. For medium test difficulty conditions, the b parametersfor dichotomous IRT models [. . . ] were manipulated such that test difficulty was around .50for typical simulated response data. This was accomplished by adding a constant of 1.52 forthe 1PL, 1.56 for the 2PL, and 1.53 for the 3PL model to the b parameters used for the easyconditions (pp.595-596.

Remark: Given the high correlations between the estimators of the difficulties, it ispossible to keep this part of the design comparable with respect to the different IRTmodels . . . ¿or not? ¿why?

Copying types:

Random copying: it is assumed that the student who copies, copies responsesfrom a student-source in a random manner, so that all items are equally likelyto be copied.

Difficulty weighted copying: the items are ordered from the easiest to the moredifficult. The probability that each item is copied is proportional to their rank.

Outcome of interest: the AUC is used as a measure ofclassification accuracy for how well an index separates theanswer-copying and honest pairs of students.

Results:

Amount of copy is the most important factor that affect the classification’sperformance.

The difficulty level of items and the copy types has a negligible effect.

It was expected that the indexes would be reduced to some degree given thepresence of guessing, but it doesn’t.

Critical review?

Initial question: What means to model an educationalphenomenon?

It means to provide structural definitions of educational concepts

either by introducing explicit structural definitions,

or by introducing statistical models the parameters of which“represent some properties of the population under analysis”(Fisher, 1922).

Critical review?

On the statistical meaning of the item parameters in IRT models - … · 2018-09-12 ·...

Documents

Transcript of On the statistical meaning of the item parameters in IRT models - … · 2018-09-12 ·...

Log parameters & meaning - BIMMERPOST

An easy procedure to determine Magic Formula parameters ...Procedure to determine Magic Formula parameters 691 meaning of the tyre model parameters. Consequently, we need not select

IRT RESUMES - ed

IRT Fixed Parameter Calibration and Other Approaches to Maintaining Item Parameters on a Common Ability Scale Seonghoon Kim, PhD Keimyung University Email:

[IRT] Item Response Theory - Survey Design · Title irt — Introduction to IRT models DescriptionRemarks and examplesReferencesAlso see Description Item response theory (IRT) is

IRT to IPIP.pdf

Unidimensional and Multidimensional IRT Modeling · PDF fileUnidimensional and Multidimensional IRT Modeling with the mirt Package ... 3PL, 2PL, 1PL, and Rasch model ... parameters

IRT Evaluation 280508initialridertraining.eu/docs/IRTConf_PekkaRanta_Tampere...1 IRT Conference IRT e-ChiCoaching An Evaluation of the Potential of e-Coaching for Riders Senior Researcher

IRT in PIDD’s - Indications & Applications 2019 presentations/irt-indictions...IRT in PIDD’s - Indications & Applications - André van Niekerk Paediatric Pulmonologist UP & Netcare

CAT5508 2 001 003 - 5508(2) TA-2 - Accent Bearings · 2016. 4. 1. · irt 1212-1 irt 1216-1 irt 1222-1 irt 1216-1 irt 1220-1 irt 1215-2 irt 1220-2 irt 1225-2 irt 1215-2 irt 1225-2

Product specification FlexTrack IRT 501 … specification - Robot user documentation 3HAC024534-001 Product manual - FlexTrack IRT 501-66 IRT 501-66R IRT 501-90 IRT 501-90R 3HAW050008590

IRT Model Specifications and Scale Characteristics · PDF fileIRT Model Specifications and Scale Characteristics ... A.K.A. “the Rasch model ... • Parameters in an IRT model are

IRT 424 DTP IRT 425 DTP IRT 428 DTP - Hedson · (IRT 464 DTP OPER MANUAL INT) utom paragraferna 5, 11:2 och 12. Per montaggio su guida vedere il manuale 713683. (IRT 3-20_4-20 Rail

IRT Research Overview

Laney IRT Studio

IRT - A2V Mécatronique · Instruction Manual 2000 & 4000 - Pro˜le Add-on IRT qUALITY IN MOTION IRT qUALITY IN MOTION

IRT 4520 IRT 4020 ThermoScan - roteskreuz.at · IRT 4520 IRT 4020 Type 6022 Type 6023 IRT 4520/4020 EK KURTZ DESIGN 12.02.03-800-32 ThermoScan 226 T 6022 I / O mem 6022351_IRT_CE_S1

IRT-350 IR-Thermometer IRT-350 IR thermometer IRT-350 ... · IRT-350 IR-Thermometer Bedienungsanleitung Seite 2 - 21 Page 22 - 41 Page 42 - 61 Pagina 62 - 81 Version 03/14 Operating

Visual Irt

IRT 4520/4020 EK KURTZ DESIGN 12.02.03 IRT 4520 IRT · PDF fileIRT 4520 IRT 4020 Type 6022 Type 6023 IRT 4520/4020 EK KURTZ DESIGN 12.02.03 7 ThermoScan-7 226 pe: 6022 I / O mem 6022434_IRT_AP_S1