Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National...

18
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales [email protected]

Transcript of Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National...

Page 1: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Adventures in Digital Asset Management: Fedora at the National Library of Wales

Glen Robson

National Library of Wales

[email protected]

Page 2: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Contents

The National Library of Wales Why the NLW choose Fedora The pilot Theoretical look into preservation Data Models

Page 3: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

The National Library of Wales

Nature of NLW Collecting

– Variety of data types and formats

Preserving– Obsolescence– Lack of context information– Persistent identifiers– Integration

Access– Open collections

Page 4: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Why we choose Fedora

Comparison with D-Space Fundamental issues

– Suitability for wide range of data types– Suitability for distribution of data types– Support for collection structures– Scalability– `Future-proof’ architecture

Page 5: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

The Pilot

Understand the Fedora System Experiment with different data types Allow access to Digital Assets Investigate workflows for moving digital

material into the repository

Page 6: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Examples

Ingested Digitised Images E-Thesis Ingested web pages Born Digital Object Basic authentication and rights

management

Page 7: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

E-Thesis

Abstract Word Document Thesis

– Original, PDF, Text, HTML and Tiff page images

Video Composition– Original, DivX and Web Viewer

Page 8: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Web Pages

Complex Digital Objects– Arrive in a Compressed File

Dissemination 1 Uncompress tar and serve– Simple– Difficult to migrate formats

Dissemination 2 Extracted and ingested into Fedora– More complicated– Can do format migration without breaking links– HTML converted to XHTML– Meta data can be assigned to each page, image or movie.

Page 9: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Digitised Images

Problems:– Obsolete Formats– Loss of context information– Persistent identifiers and URLs– Integration – Access

Page 10: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Digital Images – Obsolete Formats

What if we move from jpeg to jpeg2000– Website would have to be updated– All links would break:

• http://cairsweb.llgc.org.uk/images/mst/mst00001.jpg– Special Viewer?

Fedora’s Solution– Find all images– Add a disseminator to convert jpg files to jpeg2000– Links not file specific:

• http://teilo:9080/llgc/getImage/getMedium?PID=llgctest1:189

– Record conversion in meta data– History automatically saved

Page 11: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Digital Images – Context information

Fedora’s Solution– Mets Document in object as Data Stream– Version history so changes saved– Can store any type of Meta data:

• Mets Rights• PREMIS Preservation Meta data

– Could even store the intro page located on the Digital Mirror

Page 12: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Digital Images – Persistent Identifiers

Fedora’s Solution– Data Type independent URLs

• http://teilo:9080/llgc/getImage/getMedium?PID=llgctest1:189

– Fedora PID constant even through upgrades– Can add any Identifiers using Fedora relationships – URLs link to Servlets for redirection

GetMedium Servlet1 Find pid llgctest1:189 Get Mid sized image Data Stream Convert to JPEG2000 Return Image

GetMedium Servlet2 Find pid llgctest1:189 Resize large image Data

Stream Convert to JPEG2000 Return Image

Page 13: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Digital Images - Integration

Existing Digital Content from the Digital Mirrorhttp://teilo:8080/cocoon/METS/MST00001/frames?

div=1&subdiv=0&locale=en&mode=thumbnail

– Ingest existing Mets documents into Fedora• No change to existing workflow

– Ingest images into Fedora• Better preservation

– Allow original look and feel to website• One line change to configuration file

– Enhanced Version (PDF of Book)

Page 14: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Digital Images - Access

3 Types– Through Catalogue

• Difficult with Geac• New System OAI Harvesting?

– By Browsing• Current Digital Mirror • Relationships

– View all digitised collection:http://teilo:8080/cocoon/ViewCollection/llgctest1:DigitisedCollection

– By searching repository• Ambfish• Indexes Mets Documents

Page 15: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Data Models

Object: Fish BookPID: llgctest1:108

DS1.0 Page 1 LargeDS2.0 Page 1 MidDS3.0 Page 2 LargeDS4.0 Page 2 MidDS5.0 Page 2 LargeDS6.0 Page 2 MidDS7.0 MIX Meta for Page 1DS8.0 MIX Meta for Page 2DS9.0 MIX Meta for Page 3DS10.0 METS Document

Page 16: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

New Model

Object: Page1 Fish BookPID: llgctest1:109

DS1.0 Image LargeDS2.0 Image MidDS3.0 MIX Meta about DS1.0

Object: Page2 Fish BookPID: llgctest1:110

DS1.0 Image LargeDS2.0 Image MidDS3.0 MIX Meta about DS1.0

Object: Fish BookPID: llgctest1:108

DS1.0 Mets Doc

Is Part Of Is Part Of

Page 17: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Summary

Fedora as DAMs Fedora Community Moving towards OAIS and Trusted

Digital Repository status

Page 18: Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales gmr@llgc.org.uk.

Questions and Answers

Glen Robson [email protected]