3TU Datacentrum Tech Overview

14

Click here to load reader

description

Presented at 3TU Datacentrum project group in Utrecht April 2009

Transcript of 3TU Datacentrum Tech Overview

Page 1: 3TU Datacentrum Tech Overview

3TU Datacentrum Back-end

An overview of current status as to the technical infrastructure.

Etienne Posthumus TU Delft Library

14-04-2009

Page 2: 3TU Datacentrum Tech Overview

Overview

• Fedora Repository Software

• Python middleware

• Fedora Front-end

• Dataset types

• Current Datasets

Page 3: 3TU Datacentrum Tech Overview

Fedora Repository Software

Investigation of use in past year

Institutional repository implemented

Not perfect, but most flexible of comparable systems.

The DIY downside is paradoxically also a key benefit.

Page 4: 3TU Datacentrum Tech Overview

Python middleware

• Django Application framework

• Agile software development

• Fedora coupling via REST HTTP API

• Also use SOLR for indexing

• Fall-through to Fedora provided services always possible

Page 5: 3TU Datacentrum Tech Overview

Fedora Front-end

• XSLT based

• Dynamic queries based on Resource Index

• Multiple output formats possible

• Used CMA for behaviours

Page 6: 3TU Datacentrum Tech Overview

Ellips: Fedora objectPijl: relatie (rdf)Kleine rechthoek: tekstuele metadataGrote rechthoek: datastream (anders dan DC of RELS-EXT)

Diagram by E Gramsbergen

Page 7: 3TU Datacentrum Tech Overview

Fedora Front-end

• XSLT based

• Dynamic queries based on Resource Index

• Multiple output formats possible

• Used CMA for behaviours

demo link en link

Page 8: 3TU Datacentrum Tech Overview

Dataset types

We identify two different types

• archival ingested submissions

• enriched objects

Page 9: 3TU Datacentrum Tech Overview

Dataset types

Archival Submissions

• Recorded, checksum as-is

• For reference purposes

Page 10: 3TU Datacentrum Tech Overview

Dataset types

Archival Submissions

• Recorded, checksum as-is

• For reference purposes

• Considering Bagit(from Library of Congress)

• Or accepting the Fedora FOXML format

Page 11: 3TU Datacentrum Tech Overview

Dataset types

Enriched objects

Possible conversions to other formats

For example CSV to XML

Manageable chunks

Page 12: 3TU Datacentrum Tech Overview

Dataset types

Enriched objects

Possible conversions to other formats

For example CSV to XML

Manageable chunks

Selected metadata as RDF

Stored in Resource Index

Page 13: 3TU Datacentrum Tech Overview

Current Datasets

• DARELUX

• WindZon

• Flame

• Asfalt

• Water

Page 14: 3TU Datacentrum Tech Overview

Discussion/Questions