Establishing Metadata Practices

28
Establishing Metadata Practices Chris Burns Winona Salesky

description

Presentation at the New England Archivists Spring Meeting in Newport, Rhode Island - March 29, 2008.

Transcript of Establishing Metadata Practices

Page 1: Establishing Metadata Practices

Establishing Metadata Practices

Chris Burns

Winona Salesky

Page 2: Establishing Metadata Practices

Background

IMLS Center for Digital Initiatives Digital Initiatives Librarian

Page 3: Establishing Metadata Practices
Page 4: Establishing Metadata Practices

Infrastructure

People Space Digital Asset Management System Interface

Page 5: Establishing Metadata Practices

Digital Asset Management Systems

The Contenders ContentDM eXist Fedora Greenstone XTF

Page 6: Establishing Metadata Practices

Evaluation Matrix

Software ContentDM Greenstone Fedora eXist XTF

Data Types          

EAD Finding Aids No No Yes Yes Yes

TEI (full text) No TEI, poor full text handling

No TEI, full text stored as plain text

Yes Yes Yes

Descriptive Metadata Yes Yes Yes Yes Yes

Preservation Metadata No No Yes Yes Yes

Structural Metadata- METS Outputs Mets Outputs Mets Yes Yes Yes

Other formats (Sound, video, PDF, etc)

Limited Unclear Yes Yes Limited

Costs          

Purchase Price  Annual fee Open source & Free Open source & Free Open source & Free

Open source & Free

Staff Time Some time for customization, but is essentially an out-of-the-box system that runs as is

Some time for customization, but is essentially an out-of-the-box system that runs as is

Lots of customization needed. High learning curve.

Lots ofCustomizationdepending onsystem needs.

Unclear.

Software “Add-Ons” Some versions come with JPEG200, and

OCR

None Possible integration with a “METS navigator” and Xforms

Integration with

METS navigator

and Xforms

Integration with Fedora

Page 7: Establishing Metadata Practices

Evaluation Matrix Cont.Software ContentDM Greenstone Fedora eXist XTF

Searching          

Simple Yes Yes Yes Yes Yes

Advanced Yes (fielded searching and full text)

Yes (customizable field searching and full text)

Yes (May be customizable)

Yes - customizable Yes – customizable

Implementation Comes ready to go, can be customized

Comes ready to go, can be customized

Unclear Must be written Comes ready to go, can be customized

Dynamic Browsing Yes, can browse on indexed terms

? Unclear Yes Unclear

User Interface          

Customizable Somewhat Somewhat Yes Fully customizable Fully customizable

Browse Options Somewhat Customizable

Titles, subjects. Others Unclear Fully customizable Unclear

Preservation          

Speed of deployment 2-3 months 2-3 months 12-14 months 6-8 months Unclear

Proprietary Yes No No No No

Ability to Extract Data for future Migrations

A variety of export methods for descriptive metadata only.

Yes. METS record with Greenstone metadata format for technical, relative links to images

Yes, METS record

Yes, METS record Yes, METS record

Page 8: Establishing Metadata Practices

eXist XML Native Database

Open Source XML native database

Stores data as xml – retains data integrity Development time is reasonable Easy to integrate web services Easy to export data to future digital asset

management systems if necessary

Page 9: Establishing Metadata Practices

Faceted Browsing - Solr

Increased avenues for discovery Allows users to easily “build” complex searches Prevent empty results sets Integrates keyword searching with browse-ability Always a visible “path” so users never feel lost Allows users to expand and narrow results set Easier to explore the true extent of the collection Recognition over recall Easy to add new facets, categories, or items

Page 10: Establishing Metadata Practices
Page 11: Establishing Metadata Practices
Page 12: Establishing Metadata Practices

Some limitations of facets

Use of facets will make inconstancies in metadata obvious to users

Some facets become unmanageable with large result sets

Facets work better on some fields than others

Page 13: Establishing Metadata Practices

Metadata Selection

METS (Metadata Encoding & Transmission Standard) Structural Metadata

Dublin Core / MODS Descriptive metadata

TEI EAD Preservation Metadata

Page 14: Establishing Metadata Practices

Levels of Description

Collection level Items Items with Transcriptions or OCR Items with pre-existing descriptive metadata Folder Level Description Finding Aids with Links to Digital Objects

Page 15: Establishing Metadata Practices
Page 16: Establishing Metadata Practices
Page 17: Establishing Metadata Practices
Page 18: Establishing Metadata Practices
Page 19: Establishing Metadata Practices
Page 20: Establishing Metadata Practices
Page 21: Establishing Metadata Practices

Metadata Workflow

Captured at the time of scanning OCR/Transcription Descriptions Subject Headings Authority Control

Page 22: Establishing Metadata Practices

Structural Metadata Creation

Page 23: Establishing Metadata Practices

Descriptive Metadata Creation

Xforms Platform and device independent Separates data and logic from

presentation XML in, XML out XML Schema validation Reduces or eliminates the need for

scripting Does not require expensive round-

tripping when the data is modified

Page 24: Establishing Metadata Practices
Page 25: Establishing Metadata Practices
Page 26: Establishing Metadata Practices
Page 27: Establishing Metadata Practices

Lessons

Staffing is critical Images are faster than text to describe Minimal descriptive metadata Software choice

Staffing needs Flexible, easy to migrate out of, interoperable with other

products Record of eXist has been mixed

Xforms editor Has made xml data entry easier Firefox extension

Page 28: Establishing Metadata Practices

More Info

Codehttp://code.google.com/p/xforms4lib/

Exampleshttp://cdi.uvm.edu/exist/xforms/modshttp://cdi.uvm.edu/exist/xforms/modsSimple

Bloghttp://thedil.wordpress.com/