-1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User...
-
date post
19-Dec-2015 -
Category
Documents
-
view
215 -
download
2
Transcript of -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User...
![Page 1: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/1.jpg)
-1-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
FEDORA @ AWIFedora User Meeting
Copenhagen, Denmark28 September, 2005
Pho
to: L
. Tad
day
Ana Macario, Computer CenterAlfred Wegener Institute for Polar and Marine Research
Germany
![Page 2: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/2.jpg)
-2-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Overview
AWI and its research scope SOA at AWI Rationale for choosing FEDORA Long-term issues
![Page 3: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/3.jpg)
-3-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
About AWI
1980 Establishment of the institute
in Bremerhaven as a foundation under public law; AWI is one out 15
centers belonging to Helmholtz Society
To date
- Budget: 103 Mill. Euro- 800 Employees
Funding- 90% Federal Ministry of Education and Research (BMBF) - 8% Bremen state- 1% Brandenburg and Schleswig-Holstein states- external funds
![Page 4: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/4.jpg)
-4-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Our mission
Wadden Sea Station Sylt
Biologische Anstalt Helgoland
Alfred-Wegener-Institut
für Polar- und Meeresforschung
Bremerhaven
Research Unit Potsdam
To contribute to polar and marine research in order to advance insights into the changeability of the global environment and the earth system
![Page 5: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/5.jpg)
-5-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Research platforms
Primary data:• observations acquired in diverse
research platforms, long-time series monitoring (observatories)
• numerical models• lab. experiments• photographs, maps/charts
PublicationsEventsIntelectual property rights –
Technology transfer
![Page 6: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/6.jpg)
-6-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Backups
Backups
Relational DatabasesPANGAEA/WDC-Mare
Meteorology,Oceanography
Diatom collectionsGIS, Polarstern expeditions
DirectoryPeople, Organizational
PublicationsEvents
Technology transferExpeditions
Examples: Directory services
MapServer
Middleware Services
Examples:Web-based interfaces for searching primary datasets, publications, expeditions, etc
Backups
File and Storage systems
Publications full-textModel runs
Large datasets
ISO 19115DublinCore
Internet2/eduPersoneduOrg
DublinCoreAuthN&AuthZ
Simplified Overview (2004)
![Page 7: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/7.jpg)
-7-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
“Staging”
Versionning and trace-ability relevant to scientists (data
calibration, validation, processing, etc)
Distributed data storage
“Role” tailored access policy to assure data
rights
Spatial, temporal and thematic
search/visualization(GIS mapping services)
“Publication”
Long-term archival of quality-controlled digital objects in IR
IR exposed via OAI-PMH and SOAP
Export functionality to international agencies (GCMD,
NGDC, NOAA, GBIF, etc)
PI turns in post-print
PI removes data access restrictions
In practice…
Fedora as “active workspace”
![Page 8: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/8.jpg)
-8-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Why AWI chose to test FEDORA?
Flexible, extensible digital object model Open source; good documentation and tutorials Allows for metadata description other than Dublin Core record;
relevant for geo-referenced objects (ISO 19115), bio-diversity objects (Darwin Core), objects of type people (Internet2/eduPerson), organizational units (Internet2/eduOrg),etc
Able to distribute load and object storage among several IR instances („Virtual Repository“ concept)
Standards compliant: XML storage, OAI-PMH and web services
![Page 9: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/9.jpg)
-9-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Why AWI chose to test FEDORA? – cont.
Promising scalability; Fedora@AWI currently archives 15,000 objects
Object preservation through content versionning; includes audit trail record for preserving event history
XML ingest/export assures interoperability with existing in house information systems
![Page 10: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/10.jpg)
-10-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Backups
Directory&
File systemsPublications
EventsTechnology transfer
PeopleOrganizational Units
15,000 objects
SybaseBLOBs
PANGAEA/WDC-MARE
Managesoap
Accesssoap
Searchsoap
OAI Provider
http
Searchsoap
OAI Provider
http
Fedora RepositorySystem
OAI Harvester
(PKP)
Backups
SybaseRelational
PANGAEA/WDC-MARE
245,000 objects
FOXML ingest
FrontendBackend
Simplified Overview (2005)
WDC-specific XML
![Page 11: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/11.jpg)
-11-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
SOAP client
![Page 12: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/12.jpg)
-12-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
SOAP client – cont.
![Page 13: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/13.jpg)
-13-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
SOAP client – cont.
![Page 14: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/14.jpg)
-14-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
A few technical remarks on Fedora 2.0...
Web services APIs are great; suggested improvements: - findObjects: browsing list backwards is not possible yet, totalNumberOfResults is missing - addDatastream: file uploads: could it be done with SOAP-attachments?
Timestamp resolution in miliseconds has raised problems in „conformance tests“ under www.openarchives.org
„DeletedRecords“ set to „Transient“ in order to allow for incremental harvesting by „modified date“
![Page 15: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/15.jpg)
-15-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Next steps ...
Set up new services: naming, full-text indexing & search, large-scale content ingestion (bulk load) together with metadata
Metadata transformation services as „disseminator“ – relevant for data supply to external service providers (e.g., NGDC, GCMD, NOAA, GBIF)
Set up collections (and respective granularity policies) - relevant for object-to-object relationship metadata
![Page 16: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/16.jpg)
-16-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
DC-hardwired relation
Resource
Item
Dublin Core Pangaea-specific
OAI-PMHrecords
OAI-PMH identifier – “DOI”
ISO 19115
Descriptive+
Administrative metadata
Descriptive+
Administrative metadata
Descriptivemetadata
DC metadata
<dc.source> locator for content
<dc.relation> locator for
publication(s)
Dataset-to-Publication relationship metadata
should be expressed in RDF/XML and placed in the
“Relations datastream”
![Page 17: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/17.jpg)
-17-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Backups
Directory&
File systemsPeople
Organizational UnitsPublications
EventsTechnology Transfer
15,000 records
We need the XACML-based
module in order toadd „live“ data!
SybaseBLOBs
PANGAEA/WDC-MARE
Managehttp/soap
Accesshttp/soap
Searchhttp/soap
OAI Provider
http
Searchhttp/soap
OAI Provider
http
Fedora RepositorySystem
OAI Harvester
(PKP)
Backups
SybaseRelational
PANGAEA/WDC-MARE
245,000 records
FOXML ingest
FrontendBackend
Testing triple store query performance
Pangaea-XML
2006: FOXML ingest
![Page 18: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/18.jpg)
-18-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Long-term issues for AWI
Benchmarking for large number of files; we fear scalability breakpoint related to the size of the filesystem-based LLStorage area
Out-of-box web-based client relevant for „acceptance“ by other Helmholtz centers
Fine-grained access control policies and Shibboleth based AuthN – relevant in DataGRID context
Support for sets
![Page 19: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/19.jpg)
-19-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Long-term issues for AWI – cont.
Federation model Collaboration and support infra-structure
- disseminators for specific visualizations services (e.g. NetCDF data and LiveAcessServer, GIS data and OpenMapServer); relevant for DataGRID
- ECLIPSE project to facilitate plug-in development?
- Google strategy
- Seminars, tutorials for „advanced“ FEDORA users
![Page 20: -1- Ana Macario, Computer Center Alfred Wegener Institute, Bremerhaven, Germany European Fedora User Meeting, Copenhagen, Denmark, 2005-09-28 Mastertitelformat.](https://reader030.fdocuments.in/reader030/viewer/2022032800/56649d2a5503460f949ff2cf/html5/thumbnails/20.jpg)
-20-Ana Macario, Computer Center
Alfred Wegener Institute, Bremerhaven, GermanyEuropean Fedora User Meeting, Copenhagen, Denmark, 2005-09-28
Mastertitelformat bearbeiten
Thanks for your attention!
Pho
to: L
. Tad
day
Ana Macario, Computer CenterAlfred Wegener Institute for Polar and Marine Research
Germany
[email protected] http://www.awi-bremerhaven.dehttp://web.awi-bremerhaven.de/fedora/oai