Museum Data Exchange

20
RLG Programs Museum Data Exchange: Making hay with harvestable data Günter Waibel Program Officer OCLC Research 12 November 2009 MCN Portland

description

A presentation focusing on the data analysis OCLC Research performed on 900K museum records, plus next steps for the nine project museums who now have the capacity to share standards-based records.

Transcript of Museum Data Exchange

Page 1: Museum Data Exchange

RLG Programs

Museum Data Exchange: Making hay with harvestable data

Günter WaibelProgram OfficerOCLC Research

12 November 2009MCN Portland

Page 2: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

2

Museum Data Exchange

Create ToolsExtract CDWA Lite XMLPublish via OAI-PMH

Harvest DataTest toolsCreate Research Aggregation

Analyze DataStandards compliance?Interoperability?

What now?

Harvard University Art MuseumsMetropolitan Museum of ArtNational Gallery of ArtPrinceton University Art MuseumYale University Art GalleryCleveland Museum of ArtVictoria & Albert Museum (UK)National Gallery of Canada (CA)Minneapolis Institute of Artfunded by Andrew W. Mellon

Foundation

Page 3: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

3

Open Archives Initiative (OAI) Protocol for Metadata Harvesting

MuseumsOCLCResearch

TMSCOBOA

TOAICatMuseum

Research Aggregatio

n

Museums

Basic Interface

Page 4: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

5

Research Aggregation

9 Museums887,572

Records

Page 5: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

6

Analyzing the Research Aggregation

My data• Are CDWA Lite required fields present?• What is the state of CCO compliance of the data?• Who uses which controlled vocabularies, and does their use

demonstrably aid retrieval?• Suggest strategies to work around the inevitable

inconsistencies in institutional data

Aggregate data• Which CDWA Lite fields are used by all institutions?

• Do queries across the aggregate return meaningful results?• How is my cataloging different from other institution’s

cataloging?• Suggest strategies to work around the inevitable

inconsistencies in aggregate data

Page 6: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

7

Available at:http://hangingtogether.org/wp-content/uploads/2009/04/methodology_03-30-2009.pdf

Page 7: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

8

Use of CDWA Lite

131 CDWA Lite units of information

Mu

seu

ms

54% not used

8% used by all

Page 8: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

9

Conformance to CDWA Lite

Required/Highly recommended elements

Record

s

90% consistency

for 7 of 17

elements

Page 9: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

10

Impact of COBOAT mapping

based on museum mapping documents uses 32 units of information of the 131 CDWA Lite

defined all required/recommended elements and

attributes are in the default mapping, except for <subjectTerm>

4 out of 6 COBOAT contributors used, in various combinations, 11 additional elements and attributes

Page 10: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

11

Controlled Vocabularies

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

379 838 519 577 232 200 200

TGN ULAN TGN AAT AAT AAT TGN

locationName nameCreator nationalityCreatorobjectWorkType roleCreator subjectTerm subjectTerm

Matches

preferred non-preferred

match rates< 42%

Page 11: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

12

Data Values correlated to records

Top 100<objectWorkTy

pe>

All the Institution’s

100K+ Records

AAT Match27

No AAT Match

73 73%

99%

Page 12: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

13

Data Values correlated to records

Top 100<nameCreator>

All the Institution’s

200K+ records

ULAN Match49

No ULAN

Match51

25%

50%

Page 13: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

14

What now?

“We must create a single information system which embraces all museum holdings in the United States”

- MCN/IBM 1969

“Universal access to cultural heritage will likely soon become a reality, but museums may be

losing their role as key players.”

- Nicholas Crofts 2008

Page 14: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

15

publishing your

collections

Page 15: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

16

reaching educators

Page 16: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

17

reachinghigher

education

Page 17: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

18

sharing infrastructu

re

Page 18: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

19

projecting

authority

Page 19: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

20

nationalcollaborati

on

Page 20: Museum Data Exchange

Programs & Research Günter Waibel – Museum Data ExchangeMCN Portland – 12 November 2009

21

Thank you!

Thanks also to my colleagues and collaborators Bruce Washburn and Ralph LeVan.

Museum Data Exchangehttp://www.oclc.org/research/activities/museumdata/default.htm

- report forthcoming -

OAICatMuseum 1.0http://www.oclc.org/research/activities/oaicatmuseum/default.htm

COBOAThttp://www.oclc.org/research/activities/coboat/default.htm

Günter [email protected]: guWa

blog: hangingtogether.org