Metadata for Digital Preservation: A Status Report on PREMIS

19
Metadata for Digital Preservation: A Status Report on PREMIS Priscilla Caplan, FCLA Nancy Hoebelheinrich, Stanford University CNI Fall Task Force Meeting December 6-7, 2004

description

Metadata for Digital Preservation: A Status Report on PREMIS. Priscilla Caplan, FCLA Nancy Hoebelheinrich, Stanford University CNI Fall Task Force Meeting December 6-7, 2004. OCLC/RLG Preservation Metadata Framework Working Group. OCLC/RLG Preservation Metadata Working Group - PowerPoint PPT Presentation

Transcript of Metadata for Digital Preservation: A Status Report on PREMIS

Page 1: Metadata for Digital Preservation:  A Status Report on PREMIS

Metadata for Digital Preservation: A Status Report on PREMIS

Priscilla Caplan,FCLANancy Hoebelheinrich,Stanford University

CNI Fall Task Force MeetingDecember 6-7, 2004

Page 2: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

OCLC/RLG Preservation Metadata Framework Working Group

OCLC/RLG Preservation Metadata Working Group• Convened March 2000• Looked at CEDARS, NLA, NEDLIB, OCLC

Preservation metadata framework (June 2002)• Synthesized elements from existing sets• Based on OAIS information model• Set of “prototype” preservation metadata elements

Page 3: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

PREMIS

June 2003: OCLC/RLG sponsored new working group: PREMIS• Preservation Metadata: Implementation Strategies

Objectives• Define “core” set of preservation metadata elements, with supporting data dictionary, applicable to broad range of digital preservation activities

• Identify and evaluate alternative strategies for encoding, storing, managing, and exchanging preservation metadata

http://www.oclc.org/research/projects/pmwg/

Page 4: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Membership

Priscilla Caplan, FCLA (Chair) Rebecca Guenther, LC (Chair) Michael Alexander, British

Library George Barnum, GPO Charles Blair, U. of Chicago Olaf Brandt, U. of Gottingen Adam Farquhar, British Library

David Gewirtz, Yale Kevin Glavash, MIT/Dspace Cathy Hartman, U. of N. Texas Helen Hodgart, British Library Nancy Hoebelheinrich, Stanford Roger Howard/Sally Hubbard,

Getty Museum Pam Kircher, OCLC John Kunze, Calif. Digital

Library

Brian Lavoie, OCLC liaison Robin Dale, RLG liaison Vicky McCarger, LA Times Jerry McDonough, NYU/METS Evan Owens, JSTOR Erin Rhodes, NARA Madi Solomon, Walt Disney

Co. Angela Spinazze, ATSPIN Stefan Strathmann, U. of

Gottingen Gunter Waibel, RLG Lisa Weber, NARA Robin Wendler, Harvard Hilde van Wijngaarden, KB Andrew Wilson, NAA

Page 5: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Advisory Committee

Howard Besser, UCLA Liz Bishoff, OCLC (via

Colorado Digitization Program)

Gerard Clifton, National Library of Australia

Gail Hodge, CENDI Steve Knight, National

Library of New Zealand

Maggie Jones, Digital Preservation Coalition

Nancy McGovern, Cornell Cliff Morgan, Wiley UK Richard Rinehart, U. of

California, Berkeley

Page 6: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Implementation Survey Report

State of the art in Winter, 2003/2004 28 libraries, 7 archives, 3 museums, and 11 other 13 different countries; 45% from U.S. 38% in planning; 33% development; 46% production

Page 7: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Core Elements

Mission: Define a core set of implementable preservation metadata elements.

Page 8: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Core Elements

Mission: Define a core set of implementable preservation metadata elements.

• Information that supports and documents the digital preservation process;

• Information that supports the the viability, renderability, understandability, identity and authenticity of digital objects over time.

Page 9: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Core Elements

Mission: Define a core set of implementable preservation metadata elements.

• What most working preservation repositories are likely to need to know.

Page 10: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Core Elements

Mission: Define a core set of implementable preservation metadata elements.

• As rigorous as possible• As much explanation as possible• Implementation neutral -- “This is what you have to know”

• Values can be automatically supplied and processed -- no lengthy textual descriptions

Page 11: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Core Elements: Data Model

Page 12: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Sample data dictionary entry

Semantic unit sizeSem ant iccom ponents

None

Definit ion The size of a file or bitstream in bytes.Rat ionale Size is useful for knowing whether you have ret r ieved

the correct number of bytes from storage and whetheran applicat ion has enough room to move or processfiles. I t m ight also be used when billing for storage.

Data constra int IntegerLEVEL Representat ion File Bitst reamScope Not applicable Applicable ApplicableExam ples 2038927Repeatability Not repeatable Not repeatableObligat ion Opt ional Opt ionalNotes May be repeated for embedded files.

Page 13: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Semantic units pertaining to Objects

objectIdentifier contentLocation originalName preservationLevel objectCharacteristics environment

Page 14: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

objectCharacteristics

compositionlevel fixity size format inhibitors significantProperties creatingApplication

Page 15: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Semantic units pertaining to Events

eventIdentifier• eventIdentifierScheme• eventIdentifierValue

eventType eventOutcome eventOutcomeDetail eventDetail eventDateTime relatedPermission

Page 16: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Semantic units pertaining to Agents

agentIdentifier• agentIdentifierScheme• agentIdentifierValue

agentName

Page 17: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Semantic units pertaining to Rights

permissionStatement relatedObject grantingAgent grantingAgreement permission

act restriction

Page 18: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

Next steps:

PREMIS ACTIVITIES Complete data dictionary (January 2005) Write narrative report Develop XML schemas for exchanging metadata

FOLLOW-UP ACTIVITIES Community outreach Establish feedback/maintenance mechanism Testbeds for implementation and exchange

Page 19: Metadata for Digital Preservation:  A Status Report on PREMIS

CNI Fall 2004 Metadata for Digital Presrvation

Preservation Metadata: Implementation Strategies

For More Information:

PREMIS Web Site• www.oclc.org/research/projects/pmwg

“Implementing Metadata in Digital Preservation Systems: The PREMIS Activity” D-Lib (April ‘04)• www.dlib.org/dlib/april04/lavoie/04lavoie.html

RLG DigiNews October 2004 and December 2004 issues• www.rlg.org/en/page.php?Page_ID=12081

Priscilla Caplan: [email protected]

Rebecca Guenther: [email protected]