BHL-E Meta Data Harmonisation Wolfgang Koller & Heimo Rainer NHM Vienna.

18
BHL-E Meta Data Harmonisation Wolfgang Koller & Heimo Rainer NHM Vienna

Transcript of BHL-E Meta Data Harmonisation Wolfgang Koller & Heimo Rainer NHM Vienna.

BHL-E Meta Data Harmonisation

Wolfgang Koller & Heimo RainerNHM Vienna

BHL-E Meta Data

• Corner Stones

EU cofunded eContentplus ( ECP 518001 ) – lead @ Museum f. Naturkunde, Berlin, GE

2009-05-01 / 2012-04-30

Consortium of 28 Partners (AT, BE, CZ, FI, GB, GE, IT, NL, PL, SP, US – SIL & MO )9 Technology Providers, incl. ATOS / AIT

21 Content Providers

• Major Goals

digitized literature content from european institutions for BHL-family

WebSite incl. Search Portal – www.bhl-europe.eu

Multilinguality

Contribution to European Cultural Portal – www.europeana.eu

BHL-E Meta Datawww.bhl-europe.eu

BHL-E Meta Datawww.europeana.eu

BHL-E Meta Datawww.europeana.eu

BHL-E Meta DataOpen Literature Exchange Formatwww.bhl-europe.eu/bhl-schema/v0.3/OLEF_v0.3.xsdhttp://www.bhl-europe.eu/bhl-schema/v0.3/

OLEF

OLEF Specification

XML-Schema for exchange of literature data list of required metadata information https://docs.google.com/spreadsheet/ccc?key=0Ak_9CQQdVjCidERlRmRhOHZDUGJONC1FMkw1VFByVUE&hl=en_US#gid=0

•Imports

– bibliographic data – MODS Metadata Object Description Standard – http://www.loc.gov/standards/mods/ – policy expressions IPR – ODRL Open Digital Rights Language - http://odrl.net/ – still image data – MIX Metadata for Images in XML – http://www.loc.gov/standards/mix/ – scientific names – DwC Taxon Terms - http://code.google.com/p/darwincore/wiki/Taxon

•RDF-S representation for Linked Open Data (in progress)

OLEF

OLEF Structure (simplified)

BHL-E Meta Data

Institution

Metadatastandard[according to Preingest test data]

To be uploaded[volumes/pages]

Content in BHL-E Portal

Content in Europeana

IngestSpring 2011

Comment on FTP/ detailed information

Comment on content

Comment on workflow

NHM[Natural History Museum]

over BHL-US

NMP[Narodni muzeum]

MARC21 Update from Richard on 22.04.2011:April 2011: ~3000 pagesApril 2012: ~5000 pages

Herbarz :...(882 pages)

asked for upload and estimation of pages/items -03.03.11- problems with ftp client 04.03.11metadata files missing in folders &jpeg files in main directory- asked to check upload -14.07.2011

LANDOE[Land Oberösterreich]

? 3400 volumes~600.000 pages

2568 2568 planned additional content : 800 volumes in spring 2011 readyadditional scanning of 150.000 pages during this year - 11.01.2011

provide metadata over oai-pmh using OLEF

OK from AIT - 19.05.2011OK from NHMW - 24.05.2011green light for LANDOE

HNHM[Hungarian Natural History Museum]

? ~ 35 volumes~ 3000 pages

will send detailed informationuntil 17.12.2010

BHL-E Meta Data

• Schema Mapping Tool – slim / easy to use / cross platform / standalone application (JAVA)https://github.com/bhle/bhle/tree/master/pre-ingest/schema-mapping-toolhttp://bhl.nhm-wien.ac.at/smt/launch.html

• built in schemas

ESE 3.2 & 3.3MARC21MODS 3.4OLEF 0.3

• JDBC connection

• built in conversions

MARC21 – MARCXMLMARC21 – MODS

MARC21 – OLEFMARCXML – MODS

MARCXML – MOLEFMODS – OLEF

RefNum – OLEF

Schema Mapping Tool

Mapping to OLEF

BHL-E Meta DataBHL-Europe Global Architecture Diagram

BHL-E Meta Data

• Integration of Components into Ingest System

BHL-E Meta Data

Current work

Person Names – VIAF www.viaf.org Taxonomic repositories – Catalogue of Life www.catalogueoflife.org /

PESI www.eu-nomen.eu/pesi/