Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive...

21
Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010

Transcript of Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive...

Page 1: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Bridging the gap between data centres and publishers

J. BraseICSTI Workshop “Interactive Publications and the Record of Science

February 8th, 2010

Page 2: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

A Gap

A widening gap in the scientific record between published research and the data that underlies it

• Published work held by libraries• Datasets held by data centres• No effective way to link

between datasets and articles• No widely used method to

identify datasets• No widely used method to cite

datasetsAs a result, datasets are

• Difficult to discover• Difficult to access• Second-class citizens in the

scientific record

Page 3: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Datasets – first class citizens?

Datasets• Data is difficult to manage after

project funding ceases

• Informal networks provide the primary means of sharing

• Only 21% use a national or international facility

• Datasets are not included in impact analysis

• Good luck finding it or getting permission to use it (your discipline may vary)[Source: UKRDS Study]

Published articles• Libraries ensure long-term

storage and management

• Established funded services provide the primary means of access

• Nearly all published articles are held in multiple national libraries

• Articles and citations form the backbone of impact analysis

• Catalogues and full-text search support discovery

Page 4: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Dataset citation using the DOI system

The DOI system offers an easy way to connect the article with the underlying data:

The dataset:

G.Yancheva, N. R. Nowaczyk et al (2007)

Rock magnetism and X-ray flourescence spectrometry analyses on sediment cores of the Lake Huguang Maar, Southeast China, PANGAEA

doi:10.1594/PANGAEA.587840

Is supplement to the article:

G. Ycheva, N. R. Nowaczyk et al (2007)

Influence of the intertropical convergence zone on the East Asian monsoon

Nature 445, 74-77

doi:10.1038/nature05431

Page 5: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

• DataCite

Page 6: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

DataCite

• Global consortium carried by local institutions• focused on improving the scholarly infrastructure around

datasets and other non-textual information• focused on working with data centres and organisations that

hold data• Providing standards, workflows and best-practice• Initially, but not exclusivly based on the DOI system• Founded December 1st 2009 in London

Page 7: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Members

• Technische Informationsbibliothek (TIB), Germany• Canada Institute for Scientific and Technical Information (CISTI), • California Digital Library, USA• Purdue University, USA• Library of TU Delft,

The Netherlands• Technical Information

Center of Denmark• The British Library• ZB Med, Germany• Gesis, Germany• Library or the ETH Zürich• L’Institut de l’Information Scientifique

et Technique (INIST), France• Australian National Data Service (ANDS)

Page 8: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

DataCite

The DataCite registration agency• Maintains the resolution infrastructure

• Maintains a searchable database of metadata

• Manages the identifiers over the long term

• Establishes and shares best practice

Publishing agents (data centres, research institutes, data publishers) are responsible for

• Quality assurance

• Content storage and access

• Creating the identifier

• Creating and updating metadata

Page 9: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

DataCite Structure

Carries

International DOI Foundation

DataCite

MemberInstitution

Data CentreData CentreData Centre

MemberInstitution

Data CentreData CentreData Centre

… Works with

Managing Agent(TIB)

Member

AssociateStakeholder

Page 10: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Another way to see it…

Publishers Data centres

Page 11: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Another way to see it…

Publishers Data centres

Page 12: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Another way to see it…

Publishers Data centres

Page 13: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

• Examples

Page 14: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Page 15: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Page 16: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Page 17: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Page 18: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Page 19: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Page 20: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.
Page 21: Bridging the gap between data centres and publishers J. Brase ICSTI Workshop “Interactive Publications and the Record of Science February 8th, 2010.

Bridging the gap

• DataCite supports researchers by enabling them to locate, identify, and cite research datasets with confidence

• DataCite supports data centres by providing workflows and standards for data publication

• DataCite supports publisher by enabling linking from articles to the underlying data

http://www.datacite.org