Datasets Programme- John Kaye
-
Upload
heatherdawson -
Category
Education
-
view
112 -
download
0
description
Transcript of Datasets Programme- John Kaye
![Page 1: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/1.jpg)
Datasets Programme
June 2010
John Kaye – Lead Content Specialist Datasets
![Page 2: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/2.jpg)
2
The British Library
� Exists for everyone who wants to do research – for academic, personal, and commercial purposes.
� Covers all subject areas – sciences, technology, medicine, arts, humanities, social sciences…
� Receives a copy of every itempublished in the UK.
� Holds over 150 million items , with 3 million items added each year.
� Used by over 16,000 people each day(on site and online).
![Page 3: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/3.jpg)
3
Data and the Digital Landscape
� Seismic measurements taken by a geologist.
� Genetic data collected by a medical researcher.
� A survey of public opinions collected by a sociologist.
![Page 4: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/4.jpg)
4
The Foundation for Research
� Data is a crucial component of the scholarly record.
� Re-acquisition may be impossible
� Datasets are essential to the British Library’s mission to advance the World’s knowledge.
![Page 5: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/5.jpg)
5
Currently…
There is:� No effective way to link between datasets and article;� No widely used method to identify datasets;� No widely used method to cite datasets.
As a result, datasets are:� Difficult to discover;� Difficult to access;� In danger of being lost.
![Page 6: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/6.jpg)
6
Datasets Strategy - Vision
Researchers can discover, access, adapt, reuse and reference datasets in the course of their research
Researchers will be able to track the impact that their datasetshave and receive appropriate credit
The British Library will be an essential component of an interconnected network of service providers
Datasets from all disciplines remain intact, discoverable, useable and vital for future generations
![Page 7: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/7.jpg)
7
The Datasets Programme
We envision a future where researchers can:
� Discover, access, reuse, and reference datasets.
� Track the impact of the data that they generate and receive appropriate credit.
Our approach is to:
� Provide a focus for the community to establish needs, requirements and agreement.
� Explore novel technology and creative solutions.
![Page 8: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/8.jpg)
8
Projects – DataCite
DataCite is an international consortium which aims to:
� Establish easier access to scientific research data on the Internet
� Increase acceptance of research data as legitimate, citable contributions to the scientific record
� Support data archiving that will permit results to be verified and re-purposed for future study
![Page 9: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/9.jpg)
9
Projects – DataCite
Founded on 1 Dec 2009
Now 12 members from 9 different countries
� German National Library of Science and Technology (TIB)
� British Library (BL), UK� ETH Zurich Library, Switzerland� Institute for Scientific and Technical
Information (INIST-CNRS), France� National Technical Information Center
(DTIC), Denmark� TU Delft Library, Netherlands� Canada Institute for Scientific and
Technical Information (CISTI)� Australian National Data Service (ANDS)� California Digital Library (CDL), USA� Purdue University Libraries (PUL), USA� German National Library of Medicine (ZB
MED)� GESIS - Leibniz Institute of Social
Sciences, Germany
![Page 10: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/10.jpg)
10
Projects – DataCite
DataCite:
� Supports researchers by enabling them to locate, identify, and cite research datasets with confidence
� Supports data centres by providing workflows and standards for data publication
� Supports publishers by enabling research articles to be linked to the underlying data
![Page 11: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/11.jpg)
11
A Key Component for Many Goals
MakeVisible
Find
AccessTrack
Impact
Verify
Reuse
Cite
?Persistent
Identification
![Page 12: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/12.jpg)
12
Digital Object Identifiers (DOIs) offer a solution
�Mostly widely used identifier for scientific articles
�Researchers, authors, publishers know how to use them
�Put datasets on the same playing field as articles
Connecting an Article with the Underlying Data
DatasetYancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA.doi:10.1594/PANGAEA.587840
URLs are not persistent
� (e.g. Wren JD: URL decay in MEDLINE- a 4-year follow-up study . Bioinformatics. 2008, Jun 1;24(11):1381-5).
�
�
![Page 13: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/13.jpg)
13
The Cost of Visibility
� €0.01 – €1
Harvesting and production
Storage, quality assurance,
and metadata
� €50 – €500(approx 1% of data creation cost)
� €5,000 – €5,000,000
DOI-registration andsearch results
![Page 14: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/14.jpg)
14
Projects – Search Our Catalogue
![Page 15: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/15.jpg)
15
Social Science Collections and Research Datasets Str ategy
� Content� Continue to build existing content (print and electronic): OECD, World
Bank, UN etc� Enhance links to Economic and Social Data Service: International
Government, Longitudinal, Qualidata� Partnerships
� Key partners: UK Data Archive, ONS, The National Archives� Involved in UK Data Forum; signatory to National Data Strategy for
Economic and Social Data� Resource discovery
� Resource/ user guides – add value to SSCR projects� Dataset Cataloguing� Census 2011 exhibition
� Capacity building� Datasets Content Lead Recruited� Training for Reference Team (Social Science, Science)
![Page 16: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/16.jpg)
16
Challenges to Explore
� Long-term preservation of data
� Standards for data citation and metadata
� Methods for assuring quality and integrity of data
� Attribution and credit for data producers
� Effective discovery and accessibility
![Page 17: Datasets Programme- John Kaye](https://reader034.fdocuments.in/reader034/viewer/2022052504/54c62dd54a7959087a8b461c/html5/thumbnails/17.jpg)
17
John KayeLead Content Specialist – DatasetsSocial Science Collections and ResearchThe British Library96 Euston Road London NW1 2DB
Telephone: 020 7412 7450Email: [email protected]