3TU Datacentrum Tech Overview
Click here to load reader
-
Upload
eposthumus -
Category
Education
-
view
426 -
download
2
description
Transcript of 3TU Datacentrum Tech Overview
3TU Datacentrum Back-end
An overview of current status as to the technical infrastructure.
Etienne Posthumus TU Delft Library
14-04-2009
Overview
• Fedora Repository Software
• Python middleware
• Fedora Front-end
• Dataset types
• Current Datasets
Fedora Repository Software
Investigation of use in past year
Institutional repository implemented
Not perfect, but most flexible of comparable systems.
The DIY downside is paradoxically also a key benefit.
Python middleware
• Django Application framework
• Agile software development
• Fedora coupling via REST HTTP API
• Also use SOLR for indexing
• Fall-through to Fedora provided services always possible
Fedora Front-end
• XSLT based
• Dynamic queries based on Resource Index
• Multiple output formats possible
• Used CMA for behaviours
Ellips: Fedora objectPijl: relatie (rdf)Kleine rechthoek: tekstuele metadataGrote rechthoek: datastream (anders dan DC of RELS-EXT)
Diagram by E Gramsbergen
Fedora Front-end
• XSLT based
• Dynamic queries based on Resource Index
• Multiple output formats possible
• Used CMA for behaviours
demo link en link
Dataset types
We identify two different types
• archival ingested submissions
• enriched objects
Dataset types
Archival Submissions
• Recorded, checksum as-is
• For reference purposes
Dataset types
Archival Submissions
• Recorded, checksum as-is
• For reference purposes
• Considering Bagit(from Library of Congress)
• Or accepting the Fedora FOXML format
Dataset types
Enriched objects
Possible conversions to other formats
For example CSV to XML
Manageable chunks
Dataset types
Enriched objects
Possible conversions to other formats
For example CSV to XML
Manageable chunks
Selected metadata as RDF
Stored in Resource Index
Current Datasets
• DARELUX
• WindZon
• Flame
• Asfalt
• Water
Discussion/Questions