The interaction plateau CPI 494, April 9, 2009 Kurt VanLehn 1.

The interaction plateau

CPI 494, April 9, 2009

Kurt VanLehn

Schematic of a natural language tutoring systems, AutoTutor

Stepstart

T: Tell

T: Elicit S: Correct

Stepend

S: IncorrectT: Hint or prompt

Remediation:

Only if out of hints

Schematic of other natural language tutors, e.g., Atlas, Circsim-Tutor, Kermit-SE

Stepstart

T: Tell

Stepend

S: IncorrectT: What is…?S: I don’t know.T:Well, what is…

S:…T:…

Remediation:

Often called a KCD: Knowledge construction dialogue

Hypothesized ranking of tutoring, most effective first

A. Expert human tutors

B. Ordinary human tutors

C. Natural language tutoring systems

D. Step-based tutoring systems

E. Answer-based tutoring systems

F. No tutoring

Hypothesized effect sizes

No tutoring

Answer-based tut...

Step-based tutoring

Nat. lang. tutoring

Ordinary human t...

Expert human tutors

No tutoring

Answer-based tut...

Step-based tutoring

Nat. lang. tutoring

Ordinary human t...

Expert human tutors

nsBloom’s (1984) 2-sigma: 4 weeks of human tutoring vs. classroom

Classroom

No tutoring

Answer-based tut...

Step-based tutoring

Nat. lang. tutoring

Ordinary human t...

Expert human tutors

Classroom

Kulik (1984) meta-analysis of CAI vs. classroom 0.4 sigma

No tutoring

Answer-based tut...

Step-based tutoring

Nat. lang. tutoring

Ordinary human t...

Expert human tutors

Classroom

Many intelligent tutoring systems: e.g., Andes (VanLehn et al, 2005), Carnegie Learning’s tutors…

My main claim: There is an interaction plateau

Expected Observed

A problem and its steps Suppose you are running in a straight line at

constant speed. You throw a pumpkin straight up. Where will it land?

1. Initially, you and the pumpkin have the same horizontal velocity.

2. Your throw exerts a net force vertically on the pumpkin.3. Thus causing a vertical acceleration.4. Which leaves the horizontal velocity unaffected.5. So when the pumpkin falls, it has traveled the same

distance horizontally as you have.6. Thus, it lands in your hands

A dialogue between a human tutor (T) and human student (S)

Suppose you are running in a straight line at constant speed. You throw a pumpkin straight up. Where will it land?

S: Behind me.– T: Hmm. Let’s think about that. Before you toss the

pumpkin and are just carrying it, do you and the pumpkin have the same speed?S: Yes

– T: Good. When you toss it up, is the net force on it exactly vertical?S: I’m not sure.T: You exert a force on the pumpkin, right?Etc.

Schematic of dialogue about a single step

Stepstart

T: Tell

Stepend

S: IncorrectT: Hint, or prompt,

or explain, or analogy, or …

Remediation:

Comparisons of expert to novice human tutors

Stepstart

T: Tell

Stepend

Novices

Experts

Experts may have a wider variety

Schematic of an ITS handling of a single step

Stepstart

T: Tell

S: Correct

Stepend

S: IncorrectT: Hint

Major differences Low-interaction tutoring (e.g., CAI)

– Remediation on answer only Step-based interaction (e.g., ITS)

– Remediation on each step– Hint sequence, with final “bottom out” hint

Natural tutoring (e.g., human tutoring) – Remediation on each step, substep, inference…– Natural language dialogues– Many tutorial tactics

Conditions(VanLehn, Graesser et al., 2007) Natural tutoring

– Expert Human tutors » Typed» Spoken

– Natural language dialogue computer tutors» Why2-AutoTutor (Graesser et al.)» Why2-Atlas (VanLehn et al.)

Step-based interaction– Canned text remediation

Low interaction– Textbook

Human tutors(a form of natural tutoring)

Stepstart

T: Tell

Stepend

Why2-Atlas(a form of natural tutoring)

Stepstart

T: Tell

Stepend

S: IncorrectA Knowledge Construction

Dialogue

Why2-AutoTutor(a form of natural tutoring)

Stepstart

T: Tell

Stepend

S: IncorrectHint or prompt

Canned-text remediation(a form of step-based interaction)

Stepstart

T: Tell

Stepend

S: IncorrectText

Experiment 1: Intermediate students & instruction

00.10.20.30.40.50.60.70.80.9

MultipleChoice

e Human tutors(N=18)

Why2-Atlas(N=22)

Why2-AutoTutor(N=24)

Canned textremediation(N=22)

00.10.20.30.40.50.60.70.80.9

MultipleChoice

e Human tutors(N=18)

Why2-Atlas(N=22)

No reliable differences

Experiment 2:AutoTutor > Textbook = Nothing

00.10.20.30.40.50.60.70.80.9

Multiple Choice Essay

AutoTutor

Textbook

Nothing

Reliably different

Experiments 1 & 2(VanLehn, Graesser et al., 2007)

00.10.20.30.40.50.60.70.80.9

Read-onlytextbookstudying

Step-based

computertutoring

Why2-AutoTutor

Why2-Atlas

Humantutoring

res No significant differences

00.10.20.30.40.50.60.70.80.9

Multiplechoice

Near transferessay

Far transferessay

Retentionmultiplechoice

Retentionessay

Why2-AutoTutor (N=32) Canned Text Remediation (N=30)

Deeper assessments

00.10.20.30.40.50.60.70.80.9

Multiplechoice

Near transferessay

Far transferessay

Retentionmultiplechoice

Retentionessay

Why2-AutoTutor (N=32) Canned Text Remediation (N=30)

Experiment 4: Novice students & intermediate instruction

Multiple choice Essay

Spoken humantutoring (N=14)

Typed humantutoring (N=20)

Relearning

Multiple choice Essay

Spoken humantutoring (N=14)

Typed humantutoring (N=20)

All differences reliable

Experiment 5: Novice students & intermediate (but shorter) instruction

Multiple choice Near essay Far essay

Spokenhumantutoring(N=21)Why2-Atlas(N=21)

Relearning AddAdd

Experiment 5: Low-pretest students only

Aptitude-treatment

interaction?

Experiment 5, Low-pretest students only

Spoken human tutoring > canned text remediation

Experiments 6 and 7 Novice students & novice instruction

Multiple choice Fill in the blank Essay

Why2-AutoTutor

CTR expt 6

Text only

Why2-Atlas

CTR expt 7

Was the intermediate text over the novice

students’ heads?

Experiments 6 and 7 Novice students & novice instruction

Multiple choice Fill in the blank Essay

Why2-AutoTutor

CTR expt 6

Text only

Why2-Atlas

CTR expt 7No reliable differences

Interpretation

Experiments 1 & 4

Experiments 3 & 5

Experiments 6 & 7

High-pretest Low-pretest

Intermediates

High-pretest Low-pretest

Novices

Content complexity

= Can follow reasoning only with tutor’s help (ZPD) predict: Tutoring > Canned text remediation= Can follow reasoning without any help predict: Tutoring = Canned text remediation

Original research questions

Can natural language tutorial dialog add pedagogical value?– Yes, when students must study content that is too

complex to be understood by reading alone

How feasible is a deep linguistic tutoring system?– We built it. It’s fast enough to use.

Can deep linguistic and dialog techniques add pedagogical value?

When content is too complex to learn by reading alone: Deep>Shallow?

Why2-Atlas is not clearly better than Why2-AutoTutor

When to use deep vs. shallow?

Shallow linguistic Deep linguistic

Sentence understanding

LSA, Rainbow, Rappel Carmel: parser, semantics…

Essay/Discourse understanding

LSA Abduction, Bnets

Dialog management

Finite state networks Reactive planning

Natural language generation

Text Plan-based

Use both

Use deep

Use locally smart FSA

Use equivalent texts

Results from all 7 experiments(VanLehn, Graesser et al., 2007)

Why2: Atlas = AutoTutor Why2 > Textbook

– No essays– Content differences

Human tutoring = Why2 = Canned text remediation– Except when novice students worked with instruction

designed for intermediates, then Human tutoring > Canned text remediation

Other evidence for the interaction plateau (Evens & Michael, 2006)

Reading(1993)

Reading(1999)

Reading(2002)

Circsim(1999)

Circsim-Tutor

(1999)

Circsim-Tutor

(2002)

Humantutors(1999)

Humantutors(1993)

No significant differences

Other evidence for the interaction plateau (Reif & Scott, 1999)

Untutored Step-basedtutoring

Human tutoring

Other evidence for the interaction plateau (Chi, Roy & Hausmann, in press)

Individuals +video

Individuals +textbook

Pairs + textbook Pairs + video Human tutoring

Still more studies where natural tutoring = step-based interaction Human tutors

1. Human tutoring = human tutoring with only content-free prompting for step remediation (Chi et al., 2001)

2. Human tutoring = canned text during post-practice remediation (Katz et al., 2003)

3. Socratic human tutoring = didactic human tutoring (Rosé et al., 2001a

4. Socratic human tutoring = didactic human tutoring (Johnson & Johnson, 1992)

5. Expert human tutoring = novice human tutoring (Chae, Kim & Glass, 2005)

Natural language tutoring systems1. Andes-Atlas = Andes with canned text (Rosé et al, 2001b)2. Kermit = Kermit with dialogue explanations (Weerasinghe &

Mitrovic, 2006)

Hypothesis 1: Exactly how tutors remedy a step doesn’t matter much

Stepstart

T: Tell

Stepend

S: Incorrect

What’s in here doesn’t matter much

Main claim: There is an interaction plateau

Low-interactioninstruction

Step-basedinstruction

Naturaltutoring

Expected Observed

Hypothesis 1

Hypothesis 2: Cannot eliminate the step remediation loop

Stepstart

T: Tell

Stepend

S: Incorrect

Must avoid this

Main claim: There is an interaction plateau

Low-interactioninstruction

Step-basedinstruction

Naturaltutoring

Expected Observed

Hypothesis 2

Conclusions

What does it take to make computer tutors as effective as human tutors?– Step-based interaction– Bloom’s 2-sigma results may have been due to weak

control conditions (classroom instruction)– Other evaluations have also used weak controls

When is natural language useful?– For steps themselves (vs. menus, algebra…)– NOT for feedback & hints (remeditation) on steps

Future directions for tutoring systems research

Making step-based instruction ubiquitous– Authoring & customizing– Novel task domains

Increasing engagement

Final thought

Many people “just know” that more interaction produces more learning.

“It ain’t so much the things we don’t know that get us into trouble. It’s the things we know that just ain’t so.” – Josh Billings (aka. Henry Wheeler Shaw)

The interaction plateau CPI 494, April 9, 2009 Kurt VanLehn 1.

Documents

Transcript of The interaction plateau CPI 494, April 9, 2009 Kurt VanLehn 1.

Kurt VanLehn

CPI CONCEPTES

494 1 - apcpdcl.in

On CPI CPI(M) Differences

Www.transparentnost.org.rs esearch/cpi/overview Global Corruption Perception Index (CPI) 2014.

How to improve (decrease) CPI - University of …...Expl. ILP & Dyn.SchedCSE 471 1 How to improve (decrease) CPI •Recall: CPI = Ideal CPI + CPI contributed by stalls •Ideal CPI

International cpi

11 RD 1 2 12 RE Main A 3 12 C SHARYLAND ISD FACILITIES 4 · 4 2 8 5 3 1 7 11 7 1926 1926 1924 1924 2220 494 2220 676 676 1924 494 494 494 494 6 5 495 495 2220 2220 2220 1926 495 494

Gamelike sims CPI 494, April 30, 2009 Kurt VanLehn.

District : Thiruvananthapuramsec.kerala.gov.in/images/lbheadsrural.pdfElampalloor Sujatha Mohan CPI(M) Jalaja Gopan CPI Thrikkovilvattom Asha Chandran CPI A. Suku CPI (M) Kottamkara

All you need to know about ITS (in 60 slides) © Kurt VanLehn For journal article version, see vanlehn/ distrib/journal/ITSintroAbstract.htm.

In vivo Experimentation: An introduction Robert G.M. Hausmann (Channeling Kurt VanLehn)

cmbchina.comlive.cmbchina.com/privatebank/download/report201001.pdf · 2010-02-08 · 18.91 2 -4 09/11 2010 CPI JhE, ( ) , 2010 CPI CPI CPI 09 V 2009 11 0.8%, 1 11 CPI 1.26% 2.5%;

one piece 494

Laguna Beach, CA · 2016. 9. 2. · Prayer Line Parish Office 494-9701 Religious Education Gina Stewart 494-9701, x119 RCIA Donna Beam 494-9701, x113 Respect Life Susan Daley 494-9701,

Calle mayor 494

Hansen Hall, B050 Purdue University Office: 494 0757 Fax 494 0517

An introduction to intelligent interactive instructional systems Kurt VanLehn ASU

QA on “The behavior of tutoring systems” CPI 494 Feb 3, 2009 Kurt VanLehn.

Parker Cpi