Open Access to Data: an Introduction - bib.telegrafenberg.debib.telegrafenberg.de › fileadmin ›...

Post on 08-Jun-2020

3 views 0 download

Transcript of Open Access to Data: an Introduction - bib.telegrafenberg.debib.telegrafenberg.de › fileadmin ›...

Open Access to Data:an Introduction

Open Access Week 20092009­10­19

Dr. Jens Klump (CeGIT)

Agenda

• What is Open Access and why do we want it?

• Why Open Access to data?

• OA Data @ Helmholtz

• OA Data @ GFZ

• Data publication as a prerequisite of Open Access to data.

What is Open Access?

• Open access to scientific and scholarly literature means "its free availability on the public internet, permitting any users to read, download, copy [...] without financial, legal, or technical barriers [...]. The only constraint on reproduction and distribution, and the only role for copyright in this domain, should be to give authors control over the integrity of their work and the right to be properly acknowledged and cited.“ (Budapest Open Access Initiative)

Why Open Access?

“Maximize research usage and impact by maximizing user access to it.“

Steven Harnard: Opimizing OA-Self-Archiving Mandates. <http://eprints.soton.ac.uk/13098/01/arch.html>

Open Access to Scholarly Literature

• Green Road: secondary publication of papers through institutional or disciplinary servers.

• Golden Road: primary publication of papers through journals without access barriers.

Open Access at Helmholtz

• The Helmholtz Association is among the first signatories of the „Berlin Declaration on Open Access to Knowledge“.

• The principles of the „Berlin Declaration“ have been adopted by the Helmholtz Centres.

• The Helmholtz Open Access Project supports Helmholtz centres in the implementation of Open Access.

Policy at GFZ

• As a member of the Helmholtz Assiociation GFZ is committed to further the aims of the „Berlin Declaration“.

• Open Access is part of the GFZ publication guidelines, as are the DFG „Rules for Good Scientific Practice“.

PS: This is part of your employment contract with GFZ.

Why Open Access to Data?

The other side ...

Policy

• Policies for research data already exist in some areas:– Legal (RöntgenV, privacy, contractual, ...)– Research organisatios (DFG, Helmholtz, Max-

Planck, ...)– Institutional rules– Journal editorial policies

• Implementation of these policies still has many gaps.

Open Access to Data

• The Helmholtz Open Access Project has Open Access to data as one of its goals.

• Provision of and access to research data is of high priority for Helmholtz centres, not at least because many centres run internationally well reputed large research infrastructures and devices.

• By 2010 all three ICSU WDS data centres in Germany are at Helmhotz Centres.

Two Goals

• Good Scientific Practice

– Quality– Transparency– Ethics

• Open Access to Knowledge

– Visibility– Innovation– Impact

Data are part of the record of science

Data have become an important part of our research and of the record of science.

Most data are not managed systematically and are at risk of being lost.

Risks: Duplication Fraud

Scientific data today

Manuscript

Publication

Library

Data Metadata

Private Files

after Helly et al. (2003)

Access to data today

Access to data today

Source of data is not referenced.

No Metadata. Often the source of

the data is not even mentioned in the acknowledgements.

Data and Copyright

• Copyright law sees data as unoriginal. Collections of facts cannot be copyrighted.

• „Sweat of the brow“ does not protect a data collection.

• The EU directive on databases protects databases to a certain extent.

• Workaround: End-user licence agreements.

• The legal framework for research data leaves much to be desired.

Access to data?

What next?

• „The only constraint on reproduction and distribution, and the only role for copyright in this domain, should be to give authors control over the integrity of their work and the right to be properly acknowledged and cited“.

• Is it possible to cite data? - Yes!

DFG Project STD-DOI

• Projects „Publication and citation of scientific primary data“ (STD-DOI):

– Make data citeable– Persistent Identifiers for Datensets (Digital

Object Identifier, doi)– Methods for quality management in data

publication.

• Data publication should (again) become part of common scientific practice.

Use cases

The project STD-DOI defines three use cases:

• Data publication as supplement to a journal publication

• Data publication as an independent work

• Data publication from data streams and by automated systems (e.g. environmental monitoring, earth observation, observatories).

Data publication as supplement to Literature

• Heim et al., J. Global and Planetary Change, 46(1-4), 9-27.

• doi:10.1594/GFZ.SDDB.1043 at ICDP Scientific Drilling Database.

Procedure

• Peer-review as part of the editorial process of the journal, including review of the underlying data.

• Formal checks (syntactic)

Data publication as independent work

Procedure

• Peer-review by data jornal (e.g. Earth System Science Data, G-Cubed Data Briefs)

• Formal checks (syntactic)

Data from automated systems

Procedure

• Peer-review of the observatories and processing chains

• Quality management of the processing chains

Literature, Data, Samples

Search: ...

doi:10...doi:10.1594/...

IGSN hdl: ...

Sref: ...

doi:10...

doi:10.1594/...

doi:10.1594/...

Thank You!

The GFZ Library and the GFZ Centre for GeoInformation Technology will be glad to assist you in questions regarding Open Access and the publication of data.

Contact: jens.klump@gfz-potsdam.de,Phone: -1702http://www.scientificdrilling.org