Copyright © 2016 by Educational Testing Service. All...
Transcript of Copyright © 2016 by Educational Testing Service. All...
![Page 1: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/1.jpg)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 2: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/2.jpg)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 3: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/3.jpg)
Copyright © 2016 by Educational Testing Service. All rights reserved. Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 4: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/4.jpg)
The aim is “measuring a construct,” framed in trait or
behavioral psychology.
Usually a single measure is desired.
Each task (item) is a self-contained situation that evokes
a response that provides evidence about the construct.
Each response is evaluated to provide an item score.
A test score accumulates evidence over items, usually
summing item scores, sometimes through a latent-
variable model such as item response theory (IRT).
The Standard Ed Measurement Paradigm
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 5: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/5.jpg)
A snippet of SimCityEDU: Pollution Challenge!
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 6: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/6.jpg)
A snippet of data from SimCityEDU
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 7: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/7.jpg)
• Probability-based reasoning.
• Building models that suited an inferential problem cast
in psychological theory, with germane data.
• Seeing reliability, validity, comparability,
generalizability, and fairness not just as measurement
issues, but “social values that have meaning and force
outside of measurement wherever evaluative
judgments and decisions are made.”
(Messick, 1994)
Insights in the Development of
Psychometrics / Educational Measurement
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 8: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/8.jpg)
The Standard Ed Measurement Paradigm
Psychology
• Measurement paradigm: observation & control (150 years)
is a layer over the Examination paradigm (2000 years!)
• Not much focus on cognitive or learning processes.
Data
• Human ratings of performances hide complexity, & don’t scale.
• “Objective scoring” does scale and can be automated, but at cost of
constraining observational situations and performances.
Models
• Galton, Cattell, Spearman, Thurstone, etc. were tackling problems
jointly in psychology, observation methods, modeling, and statistics.
• Early data mining: Regression, correlation, cluster analysis, factor
analysis, path diagrams.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 9: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/9.jpg)
The Standard Ed Measurement Paradigm
Probability-Based Reasoning
Probability isn’t really about numbers;
it’s about the structure of reasoning.
Glenn Shafer (quoted in Pearl, 1988)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 10: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/10.jpg)
The Standard Ed Measurement Paradigm
Probability-Based Reasoning
q Xj
Classical Test Theory
𝑿𝒋 = 𝜽 + 𝒆
𝑿𝒋~𝑵 𝜽,𝝈𝒆𝟐
XN
X1
:
:
Conditional independence
among Xs given q.posited
Item response theory (IRT) has
same structure at the level of items
rather than tests.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 11: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/11.jpg)
The Standard Ed Measurement Paradigm
Probability-Based Reasoning
q1 X2
Factor analysis
𝑿𝒋 = 𝒃𝒋𝟏𝜽𝒌 +⋯+ 𝒃𝒋𝑲𝜽𝑲 + 𝒆𝒋
X3
X1
𝑿𝒋~𝑵 𝐁𝛉, 𝝈𝒋𝟐
q2
q3 Xj
XN
:
:
Both discovery and guided
exploration of underlying,
psychologically-relevant, structure to
“explain” patterns in data.
Same basic idea as current
exploratory use of Bayes nets,
multidimensional scaling, Gaussian
mixture cluster analysis.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 12: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/12.jpg)
𝒑 𝜽
The Standard Ed Measurement Paradigm
Probability-Based Reasoning
q Xj
Bayesian inference
Prior: 𝒑 𝜽Likelihood: 𝒑 𝑿𝟏 𝜽
XN
X1
:
:
𝒑 𝑿𝟏 𝜽
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 13: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/13.jpg)
𝒑 𝜽
The Standard Ed Measurement Paradigm
Probability-Based Reasoning
q Xj
Bayesian inference
Posterior:
𝒑 𝜽 𝑋𝟏 ∝ 𝒑 𝑿𝟏 𝜽 𝒑 𝜽
XN
X1
:
:
X1
𝒑 𝜽 𝑋𝟏
𝒑 𝑿𝟏 𝜽
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 14: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/14.jpg)
𝒑 𝜽
The Standard Ed Measurement Paradigm
Probability-Based Reasoning
q Xj
• Modularity
• Real-time inference
• Real-time decisions
XN
X1
:
:
𝒑 𝑿𝟏 𝜽
𝒑 𝑿𝒋 𝜽
𝒑 𝑿𝑵 𝜽
q
q
q
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 15: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/15.jpg)
The Standard Ed Measurement Paradigm
Probability-Based Reasoning
• Metric for quantifying evidence.
• Common framework for synthesizing different
observations for different people.
• Tools to investigate how well do the patterns the
model can express accord with the patterns that
are in the data.
• Conceptual framework will lend itself to inference
beyond SEMP.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 16: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/16.jpg)
The Standard Ed Measurement Paradigm
Social Values
Validity, reliability, comparability,
[generalizability], and fairness are not just
measurement issues, but social values that
have meaning and force outside of
measurement wherever evaluative judgments
and decisions are made.
Messick, 1994
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 17: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/17.jpg)
Most approaches to curriculum, instruction, and
assessment are based on theories and models that
have not kept pace with modern knowledge of how
people learn.
They are based on implicit and limited conceptions of
learning that tend to be fragmented, outdated, and
poorly delineated for subject-matter domains.
Jim Pellegrino (2016)
Situative / Sociocognitive Psychology
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 18: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/18.jpg)
Confluence of ideas & research across domains –
• e.g., learning sciences; domain-based learning;
sociolinguistics; “new literacy”; anthropology; cognitive,
situated, social, neuro psychology.
Situative / Sociocognitive Psychology
Person Acting in Situation
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 19: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/19.jpg)
Human-level activity,
persons acting within
situations--the actions,
events, and activities
we experience as
individuals.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 20: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/20.jpg)
SOCIO-COGNITIVE
Extrapersonal, or between-
persons, patterns: Regularities in
interactions of people in
communities, affinity spaces.
Language; cultural models;
schemas for classrooms;
scientific models. (LCS patterns)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 21: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/21.jpg)
SOCIO-COGNITIVE
Within-person processes give rise
to individuals’ actions. Must both
relate to LCS patterns and adapt to
suit unique situations.
Resources to assemble particular
patterns to understand, create, &
act in particular kinds of situations.
KLI, CI theory, ACT-R; Lave,
Hutchins, Engeström; Language as
a complex adaptive system.Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 22: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/22.jpg)
Data live at this level.
We try to make sense of them
in terms of what we learn &
conjecture about the layers
above and below.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 23: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/23.jpg)
Patterns vary to various degrees with social
milieu – over time, by culture, with language,
by neighborhood, classroom, etc.
Implications for validity & fairness. Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 24: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/24.jpg)
Situative / Sociocognitive Psychology
Person acting in situation.
• What is important to notice?
• What does it mean?
• What will happen next?
• What kinds of things can I say / do next?
• How can I create / negotiate situations?
What does this imply for assessment?
• A great change in psychology and implied task environments…
which changes what the variables and distributions mean.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 25: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/25.jpg)
Q: How do we think of constructs (hence, latent variables)?
A: Tendencies / capabilities / manners of perceiving,
processing, and acting in certain kinds of situations—
constellations of certain kinds of resources.
But now realizing that resources are…
• Idiosyncratic, but similarities due to practices and LCS
patterns that structure situations.
• Contingent, and local in time and associations among people.
• Initially strongly connected to contexts of learning.
What is the range of a model’s “as if” usefulness?
For what purposes?
Implications for Psychometric Models
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 26: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/26.jpg)
• Continuous activity.
– We must characterize evidence, not “score responses.”
• Examinee actions change the situation.
• Changing proficiencies (esp. learning).
• Multiple proficiencies.
• Conditional dependence.
• Different proficiency / observable combinations.
• Multiple modalities.
• Interaction among examinees (e.g., collaboration).
Implications for Environments & Models
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 27: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/27.jpg)
The structure of assessment arguments
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 28: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/28.jpg)
Warrant sinceso
Claim about
student
Data
concerning
student
performance
Concerns features of
(possibly evolving)
context as seen from the
view of the assessor – in
particular, those seen as
relevant to targets of
inference.
Evaluation of performance
seeks evidence of
attunement to features of
targeted LCS patterns.
Depends on features of
task situation, since actions
in situations are evaluated
in light of targeted practices
/ LCS patterns.
More on this shortly.
Student acting in
assessment
situation
Data
concerning
task situation
Warrant in terms of
resources and LCS
patterns
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 29: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/29.jpg)
Student acting in
assessment
situation
Alternative
explanations
unless
Warrant sinceso
Claim about
student
Data
concerning
student
performance
Data
concerning
task situation
Social / cultural contextualization of assessment
Copyright © 2016 by Educational Testing Service. All rights reserved.
Interpretation and
argument from the
assessor’s perspective
![Page 30: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/30.jpg)
Student acting in
assessment
situation
Alternative
explanations
unless
Warrant sinceso
Claim about
student
Data
concerning
student
performance
Data
concerning
task situation
Social / cultural contextualization of assessment
LCS milieu of each student
Copyright © 2016 by Educational Testing Service. All rights reserved.
Interpretation and action from
the student’s perspective
![Page 31: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/31.jpg)
Student acting in
assessment
situation
Alternative
explanations
unless
Warrant sinceso
Claim about
student
Data
concerning
student
performance
Data
concerning
task situation
Social / cultural contextualization of assessment
LCS milieu of each student
Other information
concerning student
vis a vis assessment
situation
Copyright © 2016 by Educational Testing Service. All rights reserved.
Can extend assessment
procedures & arguments to
better reconcile perspectives:
• Adapt tasks.
• Adapt interpretation of task
features.
• Adapt interpretation of
student actions.
Improves validity & fairness.
![Page 32: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/32.jpg)
Copyright © 2016 by Educational Testing Service. All rights reserved.
This argument
structure suffices for
pre-constructed static
tasks.
![Page 33: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/33.jpg)
Copyright © 2016 by Educational Testing Service. All rights reserved.
Extending the argument structure to interactive tasks such as simulations:
• Situation changes in response to student actions (and often other reasons).
• Interpretation may need to account for certain past actions, situation features.
![Page 34: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/34.jpg)
Instantiating an assessment argument
in objects and processes
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 35: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/35.jpg)
X1q3
q4
q5
X2
X7
X1q3
X1
q3
q4
X2
X1
X2
X1q3
q4 X7
Copyright © 2016 by Educational Testing Service. All rights reserved.
More measurement model /
“psychometricy” elements.
![Page 36: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/36.jpg)
Copyright © 2016 by Educational Testing Service. All rights reserved.
More “learning analyticy”
elements.
![Page 37: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/37.jpg)
New Forms of Assessment
Sao Pedro, Gobert, Toto, & Paquette, AERA 2015
SimCityEDU. GlassLab Packet Tracer. Cisco Networking Academy
Behrens & DiCerbo, 2013
Tetralogues. Khan & Suendermann-Oeft, ETS
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 38: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/38.jpg)
Constructs
• Systems thinking, Interactional speaking, Troubleshooting,
Cross-cultural communication, Inquiry, Collaboration.
Activity Models (née Task Models)
• Simulation spaces, Trialogue w avatars, Inquiry space.
Situations & interactions designed to evoke evidence.
Work Product(s)
• Log files, videos, artifacts, speech/chats, artifacts/designs.
Psychometric Models
• SMVs tuned to theory, data, interaction, & purpose.
OVs allow different particulars same construct-driven theory.
Evidence Identification…
New Forms of Assessment
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 39: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/39.jpg)
“Computational Psychometrics”
Evidence for constructs from low-level data.
Hierarchies of chain of evidentiary reasoning (can be up & down, theory-aided.)
from von Davier, Khan, & Kerr
Psychometric
models
Machine
Learning,
Theory,
Data
mining
Supervised,
Semisupervised,
Unsupervised
Theory
Design
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 40: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/40.jpg)
Hierarchical Inference in Evidence Identification
Khan & Kerr (2014, 2015)
Psychometric
models
SimCityEDU: Pollution Challenge!
Locations, times, and durations of
clicks, hovers, drag & drops, etc.
Construct is levels on a systems-thinking
learning progression variable – reflects kinds of
things people can do in kinds of situations. Model
change at the level of challenges.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 41: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/41.jpg)
Hierarchical Inference in Evidence Identification
Khan & Kerr (2014, 2015)
Psychometric
models
SimCityEDU: Pollution Challenge!
Locations, times, durations and objects of “verb
clauses”– verbs like “rezone,” “bulldoze,” “query
map.” Log file contents. (+ system actions)
Construct is levels on a systems-thinking
learning progression variable – reflects kinds of
things people can do in kinds of situations. Model
change at the level of challenges.
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 42: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/42.jpg)
Hierarchical Inference in Evidence Identification
Khan & Kerr (2014, 2015)
Psychometric
models
SimCityEDU: Pollution Challenge!
Construct is levels on a systems-thinking
learning progression variable – reflects kinds of
things people can do in kinds of situations. Model
change at the level of challenges.
Copyright © 2016 by Educational Testing Service. All rights reserved.
Strategic action sequences; e.g., build
new low-pollution plant before bulldozing
high-pollution one.
![Page 43: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/43.jpg)
Hierarchical Inference in Evidence Identification
Khan & Kerr (2014, 2015)
Psychometric
models
SimCityEDU: Pollution Challenge!
Construct is levels on a systems-thinking
learning progression variable – reflects kinds of
things people can do in kinds of situations. Model
change at the level of challenges.
Copyright © 2016 by Educational Testing Service. All rights reserved.
Summary functions of counts of these actions
and system-state variables are input variables
into a dynamic Bayes net – hidden Markov model
with respect to level on learning progression.
![Page 44: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/44.jpg)
Hierarchical Inference in Evidence Identification
Khan & Kerr (2014, 2015)
Psychometric
models
SimCityEDUConstruct is levels on a systems-thinking
learning progression variable – reflects kinds of
things people can do in kinds of situations. Model
change at the level of challenges.
Bertling & Castellano (2016)
Copyright © 2016 by Educational Testing Service. All rights reserved.
Summary functions of counts of these actions
and system-state variables are input variables
into a dynamic Bayes net – hidden Markov model
with respect to level on learning progression.
![Page 45: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/45.jpg)
qR
qW
qS
qL
Student Model Task Model Fragment Library
X1
X2
q
RR
X3
X4
X5
X6
Task 1
Task 2
Task 3
Task 4
q
RR
q
RR
q
RR
q
RW
q
RW
q
RL
q
RS
q
RR
State vector.Tracks relevant
features of
situations and past
actions.
…
Evidence-bearing
opportunity detectors. Agents monitor state vector for
EBOs. [beyond “tasks”]
When a particular EBO occurs, evidence
identification routine evaluates evidence,
and “scoring engine” docks Bayes net
fragment with proficiency model to
update probability distribution for qs.
Flow of Activity
Psychometric
objects and
processes
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 46: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/46.jpg)
A Couple Quick Examples
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 47: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/47.jpg)
Modular Bayes net for Evaluating a
Casual-Loop Diagram
Shute et al. (2010)
Pervasive Student Model Variables
Ephemeral Observable variables
from an evidence-bearing
opportunity
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 48: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/48.jpg)
Conversation Mapping in Trialogue Assessment
Framework for using NLP with chat with avatars, to
monitor and CREATE evidence-bearing opportunities.
LaMar & Bergner (2015)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 49: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/49.jpg)
Business-Process
Modeling to
Identify Computer-
Network
Troubleshooting
Patterns of
Experts and
Novices
Cisco Networking Academy’s
Packet Tracer tasks.
Tiago Calico (2016)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 50: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/50.jpg)
Using network
theory to improve
task design and
scoring
Zhu, Shu, & von Davier (2016)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 51: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/51.jpg)
A Hidden Markov Model for
Collaboration
LaMar & Bergner (2015)
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 52: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/52.jpg)
Validity, reliability, comparability,
[generalizability], and fairness are not just
measurement issues, but social values that
have meaning and force outside of
measurement wherever evaluative judgments
and decisions are made.
Messick, 1994
Social values, revisited
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 53: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/53.jpg)
Conclusion – Key Ideas
• Probability-based reasoning.
• Situative / Sociocognitive psychological perspective.
• “Assessment as measurement” is nested inside
“model-based reasoning” is nested inside
“assessment as argument” is nested inside
“social context.”
• Validity, reliability, comparability, generalizability, fairness
– Probability models help address them rigorously.
• Dialectic between design and discovery.
• “Computational psychometrics”: Synergy of
psychometrics, learning analytics, data mining.
Copyright © 2015 Educational Testing Service. All rights reserved.
53
Copyright © 2016 by Educational Testing Service. All rights reserved.
![Page 54: Copyright © 2016 by Educational Testing Service. All ...lak16.solaresearch.org/wp-content/uploads/2016/05/Mislevy_LAK.Ke… · The Standard Ed Measurement Paradigm Psychology •](https://reader034.fdocuments.in/reader034/viewer/2022042309/5ed716b92da71b386924012c/html5/thumbnails/54.jpg)
Thank you.
Copyright © 2016 by Educational Testing Service. All rights reserved.