Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh)...

40
Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) [email protected] Sukdith Punjasthitkul (Dartmouth College) [email protected] Scott Renton (University of Edinburgh) [email protected] Gavin Willshaw (University of Edinburgh) [email protected] Geoff Kaufman (Dartmouth College) [email protected] Mary Flanagan (Dartmouth College) [email protected] 1 Open Repositories – 9 th June 2015 CC-BY

Transcript of Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh)...

Page 1: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Crowdsourcing of image metadata:

Say what you seeClaire Knowles (University of Edinburgh) [email protected]

Sukdith Punjasthitkul (Dartmouth College) [email protected] Renton (University of Edinburgh) [email protected]

Gavin Willshaw (University of Edinburgh) [email protected] Kaufman (Dartmouth College) [email protected] Flanagan (Dartmouth College) [email protected]

1

Open Repositories – 9th June 2015CC-BY

Page 2: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Who, What, Why, When, HowQ. Who are the crowd?Q. What are we collecting?Q. Why are we collecting tags?Q. When are we integrating the tags?Q. How will metadata tags help us?

2

Page 3: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Who are Tiltfactor?Tiltfactor, Dartmouth College:

– “Games for Social Change”– Critical Play– Metadata Games, BHL’s Purposeful Gaming

3

Page 4: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Who are University of Edinburgh, Library and University Collections?

4

Page 5: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What is Crowdsourcing?“--crowdsourcing not as extracting labor

from a crowd, but as a way for us to invite the participation of amateurs (in the non-

derogatory sense of the word)—those with every bit of the potential to be a Darwin or Mendel—in the creation,

development, and further refinement of public good.”

Owens, T. (2013) Digital Cultural Heritage and the Crowd. Curator: The Museum Journal, Vol. 56. Iss. 1.

5

Page 6: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Who are the crowd?- Crowd

- large, anonymous masses of people- Sourcing

- outsourcing work and labour

6Images from the Roslin Glass Slides Collection, University of Edinburgh

Page 7: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Who are the crowd?

7

- “Knowledge Communities”- volunteers, engaged citizens, peers

- Participation- production, development, refinement

Images from the Roslin Glass Slides Collection, University of Edinburgh

Page 8: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Why develop metadata games?- Where there is no or little metadata- Enrichment of current metadata- Engagement with communities- Re-use & discovery of digitised collections

8

Page 9: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What can metadata games offer?

9

Art Tags added by staff and students in Innovative Learning Week, Feb 2015

- Tiltfactor 170,000+ tags for over 16,700 images- 13,000+ tags for UoE images in Tiltfactor games- UoE Library Labs 9,000+ tags from 150 users

Page 10: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What content? 1. Archives Collections

10http://collections.ed.ac.uk/iconics/record/51418

Page 11: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What content? 2. Art Collection

11http://collections.ed.ac.uk/art/record/22522

Page 12: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What content? 3. Archival Photography

12

Boston Public Library

Claire Knowles
can you add to the notes for this slide please Sukie
Page 13: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What communities exist?Academic researchersHobbyistsStudentsPublicEducationalists Self forming groupsmany more . . .

13

Page 14: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Engagement and finding advocates is as important as

building the tools!

14

Page 15: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What sort of game?

15

Page 16: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What motivates participants?

- Altruism- Display your skills/knowledge- Entertainment- Compound motivations

16

Page 17: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

How to engage a local audience?

17Metadata Games at Edinburgh Central Library – May 2015

Page 18: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Why engage communities?- Increased users- Widening participation- Feedback for improvements- Frees staff to focus on other tasks

18

Page 19: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

How to build trustworthiness?Creating incentives through design, to submit accurate, high quality tags:- Voting- Determining validity- Duplication is good

19

Page 20: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Potential workflows for integration

20

Page 21: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What is your risk factor?- Integration of folksonomies with

controlled metadata- Mitigation of risk- Display of tags- Varying risk factors

21

What’s the worst that can happen?

Page 22: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Questions, Questions, Questions- Which tags to import?- How to test the validity of tags?- Where to store the tags?- Where to display the tags?

22

Page 23: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What tags to remove?1) Stop words 2) Naughty words 3) Long answers4) Numerical tags5) Curatorial choices

e.g. From the Art Curators:abstract art,abstract,academic,art,black and white, black & white,block,bronze,carved stone, carving, colorful,Colors,color,colour,colours,dark,ink,fine art,forrest,fragment,ghandharan,headless, historical,history,human form,ink on paper,lines, modern art,sculpture,oil painting,oil,old,painting, person,paper,portrait,protrait,religion,rotate, sculpture,service,solid,watercolour,gandharan, landscape,portriat,abstact 23

Page 24: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

How were tags integrated at UoE?- DSpace Curation Task

- Matching on image filename- API to LUNA coming soon(ish)- Added to collections.ed.ac.uk

- 1,337 total tags- 776 unique tags - For 287 items

24

Page 25: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What are the benefits of integrating tags?

- Enriched metadata - Improved user experience- Improved search for users - New interactions with colleagues/peers- Greater interaction with collections

25

Page 26: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

26

http://collections.ed.ac.uk/iconics/record/51418

Page 27: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

27

http://collections.ed.ac.uk/art/record/22522

Page 28: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Have the tags had an impact? - Hard to link traffic to tags- Highest referrals from search

engines (Google)- Majority of search terms are

metadata related- Most visited site (MIMEd) is

the best described- Too early to tell

http://collections.ed.ac.uk/art/record/2187528

Page 29: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Say what you see

29

Page 30: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

“By the people, For the people”

Manzo C., Kaufman G., Punjasthitkul, S., Flanagan, M., "By the People, For the People": Assessing the Value of Crowdsourced, User-Generated Metadata”, Digital Humanities Quarterly, Volume 9 Number 1, 2015. 30

Page 31: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

31

Manzo C., Kaufman G., Punjasthitkul, S., Flanagan, M., "By the People, For the People": Assessing the Value of Crowdsourced, User-Generated Metadata”, Digital Humanities Quarterly, Volume 9 Number 1, 2015.

Page 32: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Not all images are equalgallimaufryˌɡalɪˈmɔːfri/nounnoun: gallimaufry; plural noun: gallimaufries1. a confused jumble or medley of things.2. "a glorious gallimaufry of childhood perceptions"

32http://images.is.ed.ac.uk/luna/servlet/UoEgal~5~5

Page 33: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Problem solved then?

- Mmmmm not quite- Visibility makes mistakes apparent:Zhouyi zhuanyi daquan / Book of Changes

33

Page 34: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What have we learnt?A. Who = communitiesA. What = simple gamesA. Why = addition #tags, engagement with communitiesA. When = after voting and duplicationA. How = improved search and engagement and …

34

Page 35: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What’s next - games?

35http://smorballgame.org http://beanstalkgame.org

Page 36: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

What’s next?Tool / workflows

- automate where possible → scalability- shortening the feedback loop- voting as analytical tool

More Research:- Compare tags to

authoritative metadata- More analysis of the

usage after tags added

36http://collections.ed.ac.uk/calendars/record/23121

Page 37: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Tiltfactor and UoE Links- Metadata Games

http://www.metadatagames.org- University of Edinburgh Library Labs

http://librarylabs.ed.ac.uk- Tiltfactor’s Github repositories

https://github.com/tiltfactor

- University of Edinburgh Library’s Github for Library Labshttps://github.com/UoEMainLibrary/librarylabs

- University of Edinburgh’s Collections Online http://collections.ed.ac.uk/

37

Page 38: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Other Crowdsourcing Links

Crowd Consortium http://crowdconsortium.org

Projects- DIY History, NYPL Building Inspector, Trove, etc.

Tools- Pybossa, From the Page, Metadata Games, Hive,

Panoptes etc.

Notes- Past Crowd Consortium webinars, meeting/workshops- Selected readings

38

Page 39: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Acknowledgements

Thank you for listening and also to:- Sukie for helping write the presentation- Scott, Ianthe, Charlie and Gavin for design,

development, testing and taking Library Labs on tour

- Max and Geoff for help with the submission- Mary for enabling Tiltfactor and University of

Edinburgh, Library and Collections to work together

39

Page 40: Crowdsourcing of image metadata: Say what you see Claire Knowles (University of Edinburgh) claire.knowles@ed.ac.uk Sukdith Punjasthitkul (Dartmouth College)

Questions?

Contact us at:Claire Knowles (University of Edinburgh) [email protected] @cgknowles

Sukie Punjasthitkul (Dartmouth College) [email protected]@tiltfactor

40