Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National...

Post on 26-Dec-2015

212 views 0 download

Transcript of Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National...

Adventures in Digital Asset Management: Fedora at the National Library of Wales

Glen Robson

National Library of Wales

gmr@llgc.org.uk

Contents

The National Library of Wales Why the NLW choose Fedora The pilot Theoretical look into preservation Data Models

The National Library of Wales

Nature of NLW Collecting

– Variety of data types and formats

Preserving– Obsolescence– Lack of context information– Persistent identifiers– Integration

Access– Open collections

Why we choose Fedora

Comparison with D-Space Fundamental issues

– Suitability for wide range of data types– Suitability for distribution of data types– Support for collection structures– Scalability– `Future-proof’ architecture

The Pilot

Understand the Fedora System Experiment with different data types Allow access to Digital Assets Investigate workflows for moving digital

material into the repository

Examples

Ingested Digitised Images E-Thesis Ingested web pages Born Digital Object Basic authentication and rights

management

E-Thesis

Abstract Word Document Thesis

– Original, PDF, Text, HTML and Tiff page images

Video Composition– Original, DivX and Web Viewer

Web Pages

Complex Digital Objects– Arrive in a Compressed File

Dissemination 1 Uncompress tar and serve– Simple– Difficult to migrate formats

Dissemination 2 Extracted and ingested into Fedora– More complicated– Can do format migration without breaking links– HTML converted to XHTML– Meta data can be assigned to each page, image or movie.

Digitised Images

Problems:– Obsolete Formats– Loss of context information– Persistent identifiers and URLs– Integration – Access

Digital Images – Obsolete Formats

What if we move from jpeg to jpeg2000– Website would have to be updated– All links would break:

• http://cairsweb.llgc.org.uk/images/mst/mst00001.jpg– Special Viewer?

Fedora’s Solution– Find all images– Add a disseminator to convert jpg files to jpeg2000– Links not file specific:

• http://teilo:9080/llgc/getImage/getMedium?PID=llgctest1:189

– Record conversion in meta data– History automatically saved

Digital Images – Context information

Fedora’s Solution– Mets Document in object as Data Stream– Version history so changes saved– Can store any type of Meta data:

• Mets Rights• PREMIS Preservation Meta data

– Could even store the intro page located on the Digital Mirror

Digital Images – Persistent Identifiers

Fedora’s Solution– Data Type independent URLs

• http://teilo:9080/llgc/getImage/getMedium?PID=llgctest1:189

– Fedora PID constant even through upgrades– Can add any Identifiers using Fedora relationships – URLs link to Servlets for redirection

GetMedium Servlet1 Find pid llgctest1:189 Get Mid sized image Data Stream Convert to JPEG2000 Return Image

GetMedium Servlet2 Find pid llgctest1:189 Resize large image Data

Stream Convert to JPEG2000 Return Image

Digital Images - Integration

Existing Digital Content from the Digital Mirrorhttp://teilo:8080/cocoon/METS/MST00001/frames?

div=1&subdiv=0&locale=en&mode=thumbnail

– Ingest existing Mets documents into Fedora• No change to existing workflow

– Ingest images into Fedora• Better preservation

– Allow original look and feel to website• One line change to configuration file

– Enhanced Version (PDF of Book)

Digital Images - Access

3 Types– Through Catalogue

• Difficult with Geac• New System OAI Harvesting?

– By Browsing• Current Digital Mirror • Relationships

– View all digitised collection:http://teilo:8080/cocoon/ViewCollection/llgctest1:DigitisedCollection

– By searching repository• Ambfish• Indexes Mets Documents

Data Models

Object: Fish BookPID: llgctest1:108

DS1.0 Page 1 LargeDS2.0 Page 1 MidDS3.0 Page 2 LargeDS4.0 Page 2 MidDS5.0 Page 2 LargeDS6.0 Page 2 MidDS7.0 MIX Meta for Page 1DS8.0 MIX Meta for Page 2DS9.0 MIX Meta for Page 3DS10.0 METS Document

New Model

Object: Page1 Fish BookPID: llgctest1:109

DS1.0 Image LargeDS2.0 Image MidDS3.0 MIX Meta about DS1.0

Object: Page2 Fish BookPID: llgctest1:110

DS1.0 Image LargeDS2.0 Image MidDS3.0 MIX Meta about DS1.0

Object: Fish BookPID: llgctest1:108

DS1.0 Mets Doc

Is Part Of Is Part Of

Summary

Fedora as DAMs Fedora Community Moving towards OAIS and Trusted

Digital Repository status

Questions and Answers

Glen Robson gmr@llgc.org.uk