DataONE Preservation and Metadata Working Group
September 2012DataONE All Hands Meeting
DataONE Preservation in a Nutshell*
1. Keep the bits safe• Replicate the data and metadata• Do local security and media refresh
2. Protect their form and meaning• Know what you have, and know your rights• Know when to migrate and emulate
3. Safeguard the guardians• Organizational and network sustainability
* DataONE Preservation Strategy, PWG workshop, Chicago, December 5-6, 2010
DataONE Metadata WG Goals
1. Build an e-dictionary to look up metadata terms and to publish your own terms
2. Develop community focusing on data curation, citation, and discovery for DataONE
3. Develop a community to sustain it
4
Agreeing on terms: a totally different take
• Traditional metadata standards are controlled• Change by committee is ugly, costly, and slow• Example: Dublin Core, 15 cross-domain terms
• 5 years to agree, highly divergent local use, change relegated to external ontologies
The Metadata Universe
Jenn Riley, IU
The Metadata Universe
Jenn Riley, IU
The Metadata Universe
Jenn Riley, IU
The Metadata Universe
Jenn Riley, IU
The Metadata Universe
Jenn Riley, IU
10
Metadata Vision
Instead, create one dictionary• Crowd sourced plus lightly supervised canon• Anyone can look up terms• Any part of “metadata speech”• Anyone can propose and refine their terms• Strong terms rise, weak terms decline
Greenberg, J., Murillo, A. and Kunze, J (in press). Ontological Empowerment: Sustainability via Ownership. In K. LeBarre and J. Tennis Advances in Classification Research, 23nd Annual ASIS SIG/CR Workshop, 26 October 2012, Baltimore, MD.
DataONE Preservation and Metadata Working Group
September 2012DataONE All Hands Meeting
12
Metadata Vision
One dictionary• Crowd sourced plus lightly supervised canon• Anyone can look up terms• Any part of “metadata speech”• Anyone can propose and refine their terms• Strong terms rise, weak terms decline
Greenberg, J., Murillo, A. and Kunze, J (in press). Ontological Empowerment: Sustainability via Ownership. In K. LeBarre and J. Tennis Advances in Classification Research, 23nd Annual ASIS SIG/CR Workshop, 26 October 2012, Baltimore, MD.
13
What we did
• Met• Laughed, Talked, Cried, Hugged• Conquered
14
Use cases
Six solid cases, eg,• Sally Scientist is about to enter column headers
for observational data on Pikas in the alpine for data to go into Dryad
• Doug Data wants to use Sally’s observations and needs to lookup the definition of one of her column headers
Mockup
Work packages in the next 2 years
Move from pre-proof-of-concept to Beta• Software development• Assessment (eg, students)• Moderation protocols – community elders• Establish community identity and rhythm
• Not completely flat, not completely crowd-sourced