Putting assessment principles in practice · EG: Useful way to think about this is with a...
Transcript of Putting assessment principles in practice · EG: Useful way to think about this is with a...
Putting assessment principles in practice
The case of speaking assessment
Evelina Galaczi and Nahal Khabbazbashi IATEFL 2016
Birmingham, UK © Cambridge English Language Assessment 2016
Why am I testing?
Who am I testing?
What am I testing?
How am I testing?
How am I scoring?
How is my test
benefiting learners?
Six key questions
© Cambridge English Language Assessment 2016
What makes speaking one of the most difficult skills to assess?
© Cambridge English Language Assessment 2016
Our aim today
Explore…
• What am I testing?
• How am I testing? • How am I scoring?
© Cambridge English Language Assessment 2016
TEST TAKER CHARACTERISTICS
CONTEXT VALIDITY COGNITIVE VALIDITY
RESPONSE
SCORING VALIDITY
SCORE/GRADE
CRITERION-RELATED VALIDITY
CONSEQUENTIAL VALIDITY
(Weir 2005, O’Sullivan and Weir 2011) © Cambridge English Language Assessment 2016
Look at the six speaking tasks in your hand outs. Think of a way to categorise them according to what they seem to be testing.
© Cambridge English Language Assessment 2016
Interaction
Production
Mechanical Reproduction
© Cambridge English Language Assessment 2016
What is speaking?
COGNITIVE KNOWLEDGE & PROCESSING
(Levelt 1989; Field 2011) © Cambridge English Language Assessment 2016
What is speaking?
COGNITIVE KNOWLEDGE & PROCESSING
USED IN A SOCIAL CONTEXT
The Speaking Construct
© Cambridge English Language Assessment 2016
The mechanics of driving Core linguistic knowledge
A range of driving situations
A range of communicative situations
A driving metaphor
© Cambridge English Language Assessment 2016
Tasks and Formats
© Cambridge English Language Assessment 2016
What are the advantages and disadvantages of face-to-face and computer-based tests?
© Cambridge English Language Assessment 2016
‘Direct’ face-to-face tests
Examiner Test taker
Any issues?
Traditionally…
© Cambridge English Language Assessment 2016
Emerging issues
• Oral interview = natural conversation? • Unbalanced power distribution • Differences in examiner support
Examiner Test taker
© Cambridge English Language Assessment 2016
Examiner talk Examiner: ‘Tell me something about your family. Have you got any brothers or sisters?’
In speaking tests the interlocutor (examiner and/or test partner) is part of the task.
Examiner rephrases original – to easier closed Yes/No question
© Cambridge English Language Assessment 2016
Examiner talk
Examiner: And how often do you play tennis? Test-taker: At home? Or in Cambridge? Examiner : Uhm… in Cambridge. Test-taker: Cambridge no. Examiner : You don’t play? Test-taker: I don’t play tennis.
What is the effect of the examiner talk here?
© Cambridge English Language Assessment 2016
Examiner talk
Examiner : What is she like in the class? Test-taker : Well really she’s very quiet. Examiner : Quiet. Test-taker : Yeah. Examiner : And you’re noisier.
What is happening here?
© Cambridge English Language Assessment 2016
Reflect the classroom
More pair work
Broader range of language
More interactional
symmetry
Paired tests Strengths
© Cambridge English Language Assessment 2016
Paired tests Issues
• Role of the peer interlocutor(s) • Individual scores / joint co-construction
© Cambridge English Language Assessment 2016
• Speech rate • Variety of accent • Proficiency level • Personality • Acquaintanceship • Cultural background • Gender • Status, etc.
Note: both in individual and paired tests
The interlocutor effect
(O’Sullivan, 2002)
© Cambridge English Language Assessment 2016
Variability and speaking tests: what should we do about it?
• Uniformity of interviewer language across speaking tests through rater training through an examiner script
• A range of task types and a range of interactional channels: ̶ benefit from interactional paired/group format ̶ control possible limitations
• Use of analytical and global criteria and scales
© Cambridge English Language Assessment 2016
Is the variability inherent in interaction: construct-irrelevant variance (and therefore to be avoided) …
… or is it part of
the speaking construct?
© Cambridge English Language Assessment 2016
‘Variability related to different characteristics of conversational partners is all that happens in the real world. And so they are things that we should be interested in testing.’ (Swain, 2004)
© Cambridge English Language Assessment 2016
No ‘best’ way to test speaking
• each format has its unique advantages and limitations
• each format is suitable for different purposes
© Cambridge English Language Assessment 2016
TASK DIFFICULTY
What features contribute to task difficulty?
© Cambridge English Language Assessment 2016
Cambridge English: Key (CEFR level A2)
• Controlled • Concrete and factual • Familiar experience • Scaffolding
Now I’m going to ask you to talk together. Candidate B, here is some information about a bookshop. Candidate A, you don’t know anything about the bookshop, so ask Candidate B some questions about it. Now A, ask B your questions about the bookshop and B, you answer them. [Card for B] [Card for A]
© Cambridge English Language Assessment 2016
Cambridge English: First (CEFR level B2)
• Semi-controlled • Concrete/abstract • Familiar context • Some scaffolding
© Cambridge English Language Assessment 2016
Cambridge English: Proficiency (CEFR level C2)
• Semi-controlled/open-ended • Abstract • Limited scaffolding
SAMPLE A
© Cambridge English Language Assessment 2016
Task Difficulty: Level of control
Reading aloud/repeating utterances Picture story Describing a picture/graphic Presentation about a topic Interview Role-play Discussion (examiner-test taker) Discussion (test taker-test taker)
cont
rol
Dialogic
Monologic
© Cambridge English Language Assessment 2016
Task Difficulty: Nature of information
Personal
Familiar
Concrete
Non-personal
Unfamiliar
Abstract
© Cambridge English Language Assessment 2016
Task Difficulty: Genre and speech functions
Genre • E.g. descriptive, narrative, argument, persuasion
Speech functions • Informational functions (e.g. expressing opinions,
comparing) • Interactional functions (e.g. agreeing, asking for
opinion, persuading) • Interaction management functions (initiating a turn,
interrupting, assigning the conversational floor) © Cambridge English Language Assessment 2016
And also…
• Are the instructions clear? • Is the purpose of the task clear to the
learners? • Is there a range of task types? • Are the items/tasks in a justifiable order? • Is the timing for each part of the test
appropriate? • Are the marking criteria explicit for the
learners? © Cambridge English Language Assessment 2016
Scoring speaking tests
Raters and rating scales
© Cambridge English Language Assessment 2016
Assessment scales
Holistic • General impression • One overall mark
Analytic • Broken down into components • Several marks - one for each assessment
category
© Cambridge English Language Assessment 2016
Rating scales Strengths and limitations of types of scales?
© Cambridge English Language Assessment 2016
Holistic assessment scales Strengths
• Practicality: faster Limitations
• No useful diagnostic information (single score) • Assumption that all relevant aspects of speaking
ability develop at the same rate • Not always easy to interpret as raters do not
necessarily use same criteria to arrive at a score
© Cambridge English Language Assessment 2016
Analytic assessment scales Strengths
• Diagnostic information if scores reported separately • Could be more reliable (multiple scores) • Useful in rater training • Useful for L2 learners who have an uneven profile
Limitations • Time-consuming • Scores sometimes combined into a composite score
© Cambridge English Language Assessment 2016
Reducing rater subjectivity
• Use of explicit and detailed assessment criteria • Training • Double/multiple marking • Checks on marking
– Expert-judgement approach – Statistical analysis
• A range of response formats in test design
© Cambridge English Language Assessment 2016