Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g....

20
Data Publication at GFZ Kirsten Elger, Damian Ulbricht, Roland Bertelmann

Transcript of Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g....

Page 1: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Data Publication at GFZ

Kirsten Elger, Damian Ulbricht, Roland Bertelmann

Page 2: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Deutsches GeoForschungsZentrum GFZ

• Helmholtz Zentrum für die Erforschung der festen Erde („vom All bis zum Erdkern”)

• ~1200 Angestellte

• FB Erde und Umwelt, Energie• Methodische Kernkompetenzen:

Satellitentechnologien, geodätisch-geophysikalische Messnetze, Tomographie der festen Erde; Forschungsbohrungen, Labor- und Experimentiertechnik; Modellierung von Geoprozessen, usw.

• Die Entwicklung von Datensystemen zur Archivierung, Verbreitung und Publikation von Forschungsdaten ist ein wichtiges Standbein und Service für Wissenschaft und Gesellschaft.

Page 3: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

…to long tail data

From global networks…DATA SERVICES

Page 4: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

GFZ Data Services

• Beratung rund um Daten• Data Repository• Open Access Verlag (internal Review von STR Data)• DOI Service für andere Netzwerke und Datenzentren• Produkte

– Datenpublikation als supplementary material zu wiss. Artikeln– Datenpublikation im Rahmen von Data Papers– Datenpublikation mit begleitendem Report

Page 5: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Unser Repository Service

• Seit 2004 Registrierung von Daten DOI (>450 registrierte Datensätze und Data Collections

• ~30 Data Reports seit 2011 (15 in 2015)• DOI Registrierung für andere GFZ Netzwerke (z.B. GEOFON: >5500

DOI für seismische Events, neu 2015: DOI für seismische Netzwerke, 15)

• 2016ff: FID Geo (Fachinformationsdienst Geowissenschaften der festen Erde, DFG Projekt)– Bundesweites Angebot der Publikation von Datensupplementen für

die Geowissenschaften an Hochschulen und in Forschungsinstituten (soweit keine institutionelle Lösung existiert).

Page 6: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

PanMetaDocs/ eSciDoc/ DOIDB –a modular approach

File System

PubMan other systems

DOIDB metadata store

DataCite metadata storeData Portals

eSciDoc

PanMetaDocs

Dataset files

Basis, intern

Anwendungs-ebene

Extern

Page 7: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

PMD Metadata Editor

mandatory fieldoptional field

(recommended)

A new line will automatically appear when filling the first line Click here todelete an entry

ClearentriesLoad a previousversionSafe youractualversion

Page 8: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Information on field definition

drop-down menusappear when clicking at the arrow

Stopping the mousepointer over (or clickinginto) a field or a drop-down parameter showsexplanatory informationor definitions

Page 9: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Spatial Domain – visual control via map

Opens:

movedraw

bounding box

draw point

Enter coordinates manually (decimaldegree with at least 4 decimal digits, DD.dddd)

or Select from map

• Manual changes of coordina-tes will be immediately dis-played in the bounding box and vice versa

• You may define the acquisi-tion time of each spatialelement in the same line

Page 10: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Metadata Standards

• Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow databaseinteroperability and metadata exchange between portals.

• However, having standardised metadata does not necessarilyinvolve displaying each variable on landing pages (e.g. a global dataset does not require a map; for seismic networks a stationmap is essential).

• Due to the large variety of geoscientific disciplines, thecommon bracket would not be more than Dublin Core Metadata, which is often not sufficient to identify a suitabledataset.

Page 11: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Some examples

Page 12: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

No map required global dataset

structural metadata

International Centre for Global Earth Models: c.150 global gravitationalmodels since the1960s

Page 13: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Seismic Networks - GEOFONGE Net (permanent global network)

station map

citation

citation: GEOFON Data Centre(1993) GEOFON Seismic Network, GFZ Data Services. DOI: 10.14470/…

DOIDatacite metadataStructural metadata

network station list

Page 14: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

GIPP Example: Data Report + Data

Data Report (= data description) Datasets

Page 15: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Example STR – Data: GIPP (MINAS) – template

• Abstract (+ coordinates, keywords, related sources)• Introduction• Data Acquisition

– Experiment design and schedule– Geometry/Location and Instrumentation– Acquisition parameters

• Data Processing• Data Description

– File Format– Data content and structure

• Data Quality/Accuracy• Data Availability/Access (“data is restricted until May 2017”)• Acknowledgements, References• Figures on data completeness, logger GPS quality, probabilistic Power

Spectral Densities for subarrays

Page 16: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

EnMAP (hyperspectral satellite mission)

Project-specificdesign

Data requestvia form (large

datasets)

Page 17: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Datenpublikation Tereno NO

automatisch-generierte Metadaten

Page 18: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Next step

•International Geo Sample Number IGSN – uniqueidentifier forphysical objects

Page 19: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata
Page 20: Data publication at GFZ - Helmholtz...•Metadata should follow international standards, (e.g. Dublin Core, ISO, Datacite for discovery) to allow database interoperability and metadata

Data publication with assigned DOI

citableDOI have emerged as the leading system for text and data publication (COPDESS 2015).

persistentlong-term data access guaranteed (by the publisher) despite servers being changed or switched off or people change affiliations and email addresses.

with metadata and data descriptionessential for data re-use and discovery, a comprehensive data description should be made a condition for assigning a DOI to a dataset.