Issues in Standardized and Custom-designed Assessment

38
Issues in Standardized and Custom-designed Assessment Presented by Gail Stewart & Andrea Strachan Moving Forward Together: Higher-Level Language Training Niagara Falls, Ontario March 25-28, 2008

Transcript of Issues in Standardized and Custom-designed Assessment

Page 1: Issues in Standardized and Custom-designed Assessment

Issues in Standardized and

Custom-designed

Assessment

Presented by

Gail Stewart & Andrea Strachan

Moving Forward Together: Higher-Level Language Training

Niagara Falls, Ontario

March 25-28, 2008

Page 2: Issues in Standardized and Custom-designed Assessment

Discussing the Issues

Gail Stewart

Page 3: Issues in Standardized and Custom-designed Assessment

Standardized Assessment

Some Common Associations

Large scale

Generic

Biased in favour of dominant culture

Graded on a curve

Multiple choice

Focus on what test takers cannot do

Negative impact on curriculum

Page 4: Issues in Standardized and Custom-designed Assessment

Assumptions from

Public Education System

Standardized Tests are norm referenced

Rank students as successes or failures

Page 5: Issues in Standardized and Custom-designed Assessment

Standardized

Conforming to an acknowledged

measure of comparison for

quantitative or qualitative value

ITP Nelson Canadian Dictionary

Page 6: Issues in Standardized and Custom-designed Assessment

Customized Assessment

Some Common Associations

Small scale

Graded against a criterion

Authentic tasks and items

Focus on what test takers can do

Positive impact on curriculum

Page 7: Issues in Standardized and Custom-designed Assessment

Assumptions from

Public Education System

Customized tests are criterion referenced

Students succeed or fail based on their

knowledge of the subject

Page 8: Issues in Standardized and Custom-designed Assessment

Customized Assessment can:

Be norm referenced

Include multiple-choice item types

Be administered on a large scale

Conform to rigid procedures and protocols

Be standardized

Page 9: Issues in Standardized and Custom-designed Assessment

Standardized Assessment can:

Be criterion referenced

Include subjective item types

Be administered on a small scale

Focus on what test takers can do

Have a positive effect on curriculum

Be customized

Page 10: Issues in Standardized and Custom-designed Assessment

Standardized

Administered and scored in a

consistent manner

Wikipedia

Page 11: Issues in Standardized and Custom-designed Assessment

Customized

Created according to

individual specifications

ITP Nelson Canadian Dictionary

Page 12: Issues in Standardized and Custom-designed Assessment

Standardized ~ Customized

Not mutually exclusive terms

Every good test should be:

standardized to some degree

customized to some degree

Page 13: Issues in Standardized and Custom-designed Assessment

Ideal Assessment

Custom-designed

Created according to individual needs and specifications

Standardized

Administered and scored in a consistent manner

Page 14: Issues in Standardized and Custom-designed Assessment

Authenticity

Development

Time

Maintenance

Money and

Resources

Relevance

Validity

Customization

Page 15: Issues in Standardized and Custom-designed Assessment

Customization requires

Careful needs analysis

Rigorous fieldwork and observation

Detailed specifications

Adaptation of authentic materials

Thorough review and consultation process

Careful selection of pilot sample

Ongoing commitment to maintenance

Page 16: Issues in Standardized and Custom-designed Assessment

Customization

To what degree is it necessary?

Depends on ~ Population

Purpose

Page 17: Issues in Standardized and Custom-designed Assessment

CLBA

Population and Purpose

Newcomers to Canada

Various international cultures

Various international languages

Adults ~ all ages

Limited exposure to Canadian culture

Purpose is placement in language programs

Page 18: Issues in Standardized and Custom-designed Assessment

CLBA Customization

Consideration of newcomer needs

CLB-based test specifications

Accessible, authentic, meaningful tasks

Appropriate range and variety of tasks

Tasks linked to CLB-based curriculum

Reviewed by newcomers and agencies

Pilot included various international cultures

Pilot included various first languages

Pilot included adults of various ages

Page 19: Issues in Standardized and Custom-designed Assessment

Additional Requirements for

Occupation-specific Customization

Occupational Analysis

Literature Review

Surveys and Interviews

Shadowing

Scope of Practice

Range of Functions

Levels of Language

Page 20: Issues in Standardized and Custom-designed Assessment

Validity

Development

Time

Maintenance

Money and

Resources

Reliability

Quality

Standardization

Page 21: Issues in Standardized and Custom-designed Assessment

Standardization requires

Detailed specifications

Strict protocols and procedures

Written rules and regulations

Large pilot sample

Rigorous data analysis

Quality control measures

Ongoing commitment to maintenance

Page 22: Issues in Standardized and Custom-designed Assessment

Standardization

To what degree is it necessary?

Depends on ~ Purpose

Stakes

Page 23: Issues in Standardized and Custom-designed Assessment

Purpose Stakes

Eligibility Highest

Certification

Outcomes

Progress

Placement

Diagnostic Lowest

Page 24: Issues in Standardized and Custom-designed Assessment

CLBA Standardization

Consistent administration procedure

Assessment Protocol

Assessor Training

Reliable scoring procedure

Task and Item Types

Marking Memoranda

Assessor Training

Page 25: Issues in Standardized and Custom-designed Assessment

The Dilemma of Institution X

Need High-stakes test of eligibility

Problem No time and no resources

Target English language ability

of profession Z

for practice in Ontario

Page 26: Issues in Standardized and Custom-designed Assessment

Reasons for Choosing TOEFL

Validated for high stakes

40-year track record

ETS reputation

Perceived as a test of general proficiency

Perceived as a one-size-fits-all

Other institutions are using it

Page 27: Issues in Standardized and Custom-designed Assessment

Customization ~ Standardization

Paradox

The more a test is customized,

the more appropriate it becomes for its intended population and purpose

the less appropriate it becomes as a test of general proficiency for varied purposes

Page 28: Issues in Standardized and Custom-designed Assessment

Original Intent of TOEFL

Population International TAs

Purpose Eligibility

Stakes High

Page 29: Issues in Standardized and Custom-designed Assessment

Actual Usage of the TOEFL

Purpose Eligibility

Stakes High

Population Determined by

users of the test

Page 30: Issues in Standardized and Custom-designed Assessment

Original New

CBT iBT

Item based Task based

Separate skills Integrated skills

Variety of content Academic content

General proficiency Targeted proficiency

Page 31: Issues in Standardized and Custom-designed Assessment

TOEFL Paradox

As the TOEFL is improved and refined,

it becomes more defensible and more valid

for its intended purpose.

But, it becomes even less defensible than before

as a general one-size-fits-all solution.

Page 32: Issues in Standardized and Custom-designed Assessment

Typical Response to New TOEFL

We use TOEFL scores to determine eligibility.

We notice that the TOEFL has changed.

The scoring scale looks different.

How do we equate these new scores

with our existing standard?

Page 33: Issues in Standardized and Custom-designed Assessment

A better way to respond …

We use TOEFL scores to determine eligibility.

We notice that the TOEFL has changed.

What changes have been made and why?

What does a score on the new TOEFL mean?

Is the new TOEFL iBT appropriate

for our intended purpose?

Page 34: Issues in Standardized and Custom-designed Assessment

The Relevant Question

Is a custom-designed assessment of

academic eligibility appropriate and

valid for other eligibility purposes?

Justified in fewer contexts

Page 35: Issues in Standardized and Custom-designed Assessment

Choosing an Existing

Assessment is Difficult

Seldom a perfect fit (face validity issues)

May be a valid test, but for what purpose?

Interpretation of scores in a new context

Standard-setting issues

Challenges and appeals

Even large-scale tests are more customized

(TOEFL example) ~ so what is the solution?

Page 36: Issues in Standardized and Custom-designed Assessment

Toward a Custom-design Model

Canada has a common standard (CLB 2000)

widely accepted

ESL curriculum foundation

assessment underpinning

Like institutions could pool resources

CLB occupational / needs analyses

CLB-based test specifications

Common design and format

Specific tasks and items

Share registration, administration, maintenance, assessor training, validation, scoring, reporting

Page 37: Issues in Standardized and Custom-designed Assessment

CLB-based Challenges

Curriculum document, not a test blueprint

Describes learners, not tasks

Tasks are illustrative in nature

Sample tasks are not a validated hierarchy

Many tasks are integrated

Descriptors require exemplars

Page 38: Issues in Standardized and Custom-designed Assessment

Meeting the Challenges

Success stories from the field:

Placement: CLBA, CLBPT, ELTPA, WLA

Eligibility: CELBAN, CLBA-I

MELA and IPGLA