Pamwg 2012ahm

16
DataONE Preservation and Metadata Working Group September 2012 DataONE All Hands Meeting

description

Combined slides for PAMWG at DataONE All Hands meeting.

Transcript of Pamwg 2012ahm

Page 1: Pamwg 2012ahm

DataONE Preservation and Metadata Working Group

September 2012DataONE All Hands Meeting

Page 2: Pamwg 2012ahm

DataONE Preservation in a Nutshell*

1. Keep the bits safe• Replicate the data and metadata• Do local security and media refresh

2. Protect their form and meaning• Know what you have, and know your rights• Know when to migrate and emulate

3. Safeguard the guardians• Organizational and network sustainability

* DataONE Preservation Strategy, PWG workshop, Chicago, December 5-6, 2010

Page 3: Pamwg 2012ahm

DataONE Metadata WG Goals

1. Build an e-dictionary to look up metadata terms and to publish your own terms

2. Develop community focusing on data curation, citation, and discovery for DataONE

3. Develop a community to sustain it

Page 4: Pamwg 2012ahm

4

Agreeing on terms: a totally different take

• Traditional metadata standards are controlled• Change by committee is ugly, costly, and slow• Example: Dublin Core, 15 cross-domain terms

• 5 years to agree, highly divergent local use, change relegated to external ontologies

Page 5: Pamwg 2012ahm

The Metadata Universe

Jenn Riley, IU

Page 6: Pamwg 2012ahm

The Metadata Universe

Jenn Riley, IU

Page 7: Pamwg 2012ahm

The Metadata Universe

Jenn Riley, IU

Page 8: Pamwg 2012ahm

The Metadata Universe

Jenn Riley, IU

Page 9: Pamwg 2012ahm

The Metadata Universe

Jenn Riley, IU

Page 10: Pamwg 2012ahm

10

Metadata Vision

Instead, create one dictionary• Crowd sourced plus lightly supervised canon• Anyone can look up terms• Any part of “metadata speech”• Anyone can propose and refine their terms• Strong terms rise, weak terms decline

Greenberg, J., Murillo, A. and Kunze, J (in press). Ontological Empowerment: Sustainability via Ownership. In K. LeBarre and J. Tennis Advances in Classification Research, 23nd Annual ASIS SIG/CR Workshop, 26 October 2012, Baltimore, MD.

Page 11: Pamwg 2012ahm

DataONE Preservation and Metadata Working Group

September 2012DataONE All Hands Meeting

Page 12: Pamwg 2012ahm

12

Metadata Vision

One dictionary• Crowd sourced plus lightly supervised canon• Anyone can look up terms• Any part of “metadata speech”• Anyone can propose and refine their terms• Strong terms rise, weak terms decline

Greenberg, J., Murillo, A. and Kunze, J (in press). Ontological Empowerment: Sustainability via Ownership. In K. LeBarre and J. Tennis Advances in Classification Research, 23nd Annual ASIS SIG/CR Workshop, 26 October 2012, Baltimore, MD.

Page 13: Pamwg 2012ahm

13

What we did

• Met• Laughed, Talked, Cried, Hugged• Conquered

Page 14: Pamwg 2012ahm

14

Use cases

Six solid cases, eg,• Sally Scientist is about to enter column headers

for observational data on Pikas in the alpine for data to go into Dryad

• Doug Data wants to use Sally’s observations and needs to lookup the definition of one of her column headers

Page 15: Pamwg 2012ahm

Mockup

Page 16: Pamwg 2012ahm

Work packages in the next 2 years

Move from pre-proof-of-concept to Beta• Software development• Assessment (eg, students)• Moderation protocols – community elders• Establish community identity and rhythm

• Not completely flat, not completely crowd-sourced