Cscw family searchindexing

CSCW, SAN ANTONIO, TXFEB 26, 2013

Derek Hansen, Patrick Schone, Douglas Corey, Matthew Reid, & Jake Gehring

QUALITY CONTROL MECHANISMS FOR CROWDSOURCING: PEER REVIEW, ARBITRATION, & EXPERTISE AT FAMILYSEARCH INDEXING

FamilySearch.org

FamilySearch Indexing (FSI)

FSI in Broader Landscape• Crowdsourcing Project

Aggregates discrete tasks completed by volunteers who replace professionals (Howe, 2006; Doan, et al., 2011)

• Human Computation SystemHumans use computational system to work on a problem that may someday be solvable by computers (Quinn & Bederson, 2011)

• Lightweight Peer ProductionLargely anonymous contributors independently completing discrete, repetitive tasks provided by authorities (Haythornthwaite, 2009)

Design Challenge: Improve efficiency without sacrificing quality

Scanned Documents

Quality Control Mechanisms• 9 Types of quality control for human computation

systems (Quinn & Bederson, 2011)• Redundancy• Multi-level review

• Find-Fix-Verify pattern (Bernstein, et al., 2010)• Weight proposed solutions by reputation of contributor

(McCann, et al., 2003)• Peer or expert oversight (Cosley, et al., 2005)• Tournament selection approach (Sun, et al., 2011)

A-B-Arbitrate process (A-B-ARB)

Currently Used Mechanism

Peer review process (A-R-RARB)

A R RARB

Already Filled InOptional?Proposed Mechanism

Two Act Play

Act I: Experience

What is the role of experience on quality and efficiency?

Historical data analysis using full US and Canadian Census records from 1920 and earlier

Act II: Quality Control

Is peer review or arbitration better in terms of quality and efficiency?

Field experiment using 2,000 images from the 1930 US Census Data & corresponding truth set

Act I: Experience

Quality is estimated based on A-B agreement (no truth set)

Efficiency calculated using keystroke-logging data with idle time and outliers removed

A-B agreement by field

A-B agreement by language (1871 Canadian Census)

English Language

Given Name: 79.8%

Surname: 66.4%

French Language

Given Name: 62.7%

Surname: 48.8%

A-B agreement by experience

Birth Place: All U.S. Censuses

ce ↔

A (novice ↔ experienced)

Given Name: All U.S. Censuses

ce ↔

Surname: All U.S. Censuses

ce ↔

Gender: All U.S. Censuses

ce ↔

Birthplace: English-speaking Canadian Census

ce ↔

Time & keystroke by experience

Summary & Implications of Act I Experienced workers are faster and more accurate,

gains which continue even at high levels

- Focus on retention

- Encourage both novices & experts to do more

- Develop interventions to speed up experience gains (e.g., send users common mistakes made by people at their experience level)

Summary & Implications of Act I Contextual knowledge (e.g., Canadian placenames)

and specialized skills (e.g., French language fluency) is needed for some tasks

- Recruit people with existing knowledge & skills

- Provide contextual information when possible (e.g., Canadian placename prompts)

- Don’t remove context (e.g., captcha)

- Allow users to specialize?

Act II: Quality ControlA-B-ARB data from original transcribers (Feb 2011)

A-R-RARB data includes original A data and newly collected R and RARB data from people new to this method (Jan-Feb of 2012)

Truth Set data from company with independent audit by FSI experts

Statistical Test: mixed-model logistic regression (accurate or not) with random effects, controlling for expertise

Limitations• Experience levels of R and RARB were

lower than expected, though we did statistically control for this

• Original B data used in A-B-ARB for certain fields was transcribed in non-standard manner requiring adjustment

No Need for RARB• No gains in quality from extra arbitration of

peer reviewed data (A-R = A-R-RARB)• RARB takes some time, so better without

Quality Comparison

• Both methods were statistically better than A alone

• A-B-ARB had slightly lower error rates than A-R

• R “missed” more errors, but also introduced fewer errors

Time Comparison

Summary & Implications of Act II Peer Review shows considerable efficiency

gains with nearly as good quality as Arbitration

- Prime reviewers to find errors (e.g., prompt them with expected # of errors on a page)

- Highlight potential problems (e.g., let A flag tough fields)

- Route difficult pages to experts

- Consider an A-R1-R2 process when high quality is critical

Summary & Implications of Act II Reviewing reviewers isn’t always worth the time

- At least in some contexts, Find-Fix may not need Verify

Quality of different fields varies dramatically

- Use different quality control mechanisms for harder or easier fields

Integrate human and algorithmic transcription

- Use algorithms on easy fields & integrate into review process so machine learning can occur

Questions• Derek Hansen (dlhansen@byu.edu)• Patrick Schone (BoiseBound@aol.com)• Douglas Corey (corey@mathed.byu.edu)• Matthew Reid (matthewreid007@gmail.com)• Jake Gehring (GehringJG@familysearch.org)

Cscw family searchindexing

Documents

Transcript of Cscw family searchindexing

2014 CrowdfundingWorkCommunities CSCW Accepted-11

CSCW in Healthcareanu.brighid.idc.ul.ie/CS4458_2018/3CSCWHealthcare.pdf · 2018-02-13 · Main issues p Electronic Patient Records (EPR, HER, EMR) p CSCW Concepts: collaboration,

CSCW 2013: Butler Lies from Both Sides

CSCW. What Is CSCW? Computer Supported C--------- Work –Cooperative –Collaborative –Competitive Design and evaluation of new technologies to support social.

From Artefacts to Infrastructures - NTNUericm/FINAL-hardcopy-cscw-jubilee.pdf · 2013-09-16 · espoused-CSCW referring to the more open agenda initially accompanying CSCW. Our interest

CSCW in Times of Social Media

CSCW Presentation 19th March 2009

INF5200/TOOL5100: CSCW/L Issues in CSCW and groupware Lecture 1, 06.02.2007 Issues in CSCW and Groupware: Anders Mørch and Sisse Finken INF5200/TOOL 5100,

CSCW: Colocatedasynchronous applicationscs554m/Winter2012/assignment1/... · 2013-03-12 · CSCW: Colocatedasynchronous applications CPSC 554m Kalan MacRow INTRODUCTION Computer Supported

CSCW and Web 2.0: are We in?

CSCW – An Introduction Saul Greeenberg - 1saul/781/presentations/cscw... · Design and Evaluation of Organizational Interfaces. p85-93, Proc CSCW, ACM Press. 1988 ... to each other

CSCW – keep in touch with discharged patients

Hci gattech31 cscw

CSCW 2012 Note - Wikipedia Public Policy Initiative

CSCW 2014: You can't block people offline

How to write a technical CSCW paper*

Talk or Not to Talk @ CSCW 2012

19th ACM Conference on Computer-Supported Cooperative Work ...eprints.qut.edu.au/93585/1/93583.pdf · Family, Finance, Ethnography, CSCW, and Design. ACM Classification Keywords .

Writing a ‘Behavioral’ CSCW Paper

CSCW: AN INITIAL EXPLORATION1 - the IRIS Association