Identifiers and Data Management

18
Identifiers and Data Management Joan Starr California Digital Library

description

How can persistent identifiers like digital object identifiers (DOIs) and archival resource keys (ARKs) help with data management? When should you use a DOI vs and ARK? These "talking point" slides give you sample text you can use in your data management plans--and more!

Transcript of Identifiers and Data Management

Page 1: Identifiers and Data Management

Identifiers and Data Management

Joan StarrCalifornia Digital Library

Page 2: Identifiers and Data Management

What is an identifier?

Page 3: Identifiers and Data Management

What is an identifier?

What you see: alphanumeric string (never changes)Associated with: location of object (such as a URL)

Optional: who, what, when, etc (i.e. metadata)

By Joelk75: http://www.flickr.com/photos/75001512@N00/2728233597/

Page 4: Identifiers and Data Management

Identifier example

string: doi:10.9999/FK40K2GTVhtml version: http://dx.doi.org/10.9999/FK40K2GTV

location: http://www.bologna.edu/biology/xfg/123.xls

metadatacreator: Dr. Felix Kottortitle: Data for chromosomal study of catfish (Ictalurus punctatus)publisher: University of Bolognadate: 8/31/2012

Page 5: Identifiers and Data Management

Identifier example

string: doi:10.9999/FK40K2GTVhtml version: http://dx.doi.org/10.9999/FK40K2GTV

location: http://www.state.edu/ecology/783sdr/123.xls

metadatacreator: Dr. Felix Kottortitle: Data for chromosomal study of catfish (Ictalurus punctatus)

publisher: Dryad Data Repository date: 10/01/2013

Page 6: Identifiers and Data Management

Why Identifiers are Important

Page 7: Identifiers and Data Management

Allow readers to find data products

Get credit for data and publications

Promote reproducibilityBetter measure of research

impact

Example:Sidlauskas, B. 2007. Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. doi:10.5061/dryad.20

Why Identifiers are Important

Page 8: Identifiers and Data Management

A&I Indexing and #altmetrics

Page 9: Identifiers and Data Management

• DataCite DOIs for data, linking to scholarly research

• Credit to data producers and data publishers

• Exposure and research metrics for datasets(Web of Knowledge, Google)

Primary Functions1. Create identifiers

2. Manage identifiers (and metadata) over time

3. Resolve identifiers

EZIDLong term identifiers made easy

@ezidCDL

http://n2t.net/ezid

Page 10: Identifiers and Data Management
Page 11: Identifiers and Data Management

DataCite Services1. DOIs for data! 2. Local service & support3. Usage stats4. Citation formatter5. Content negotiation6. Metadata search7. OAI provider8. DataCite-to-ORCID hookup*9. Your ideas here…

Page 12: Identifiers and Data Management

ARKs

DOIsIDF

EZID CLIENTS

DOIs

DOIs

EZID and DataCite together

Page 13: Identifiers and Data Management

EZID: DOIs & ARKs

DOIs ARKs

Strict metadata requirements Flexible metadata guidelines

From the scholarly communication community

From the archives and museums community

Established “brand name” Option-rich, open source

Use case: Data Citation Use case: Data Documentation

Page 14: Identifiers and Data Management

DOIs and ARKs indata management plans

Page 15: Identifiers and Data Management

DOIs in data management plans

What it looks like: Sample plan language:

Aguilée R, Lambert A, Claessen D (2011) Data from: Ecological speciation in dynamic landscapes. Journal of Evolutionary Biology doi:10.5061/dryad.74024

Publication of data shall occur during the project, if appropriate, or at the end of the project, consistent with normal scientific practices.[Team] follows a standardized data product citation including DOI, that indicates the version and how to obtain a copy of that product.Why it’s important:

OSTP mandate to: identify and provide “appropriate attribution to scientific data sets”

Researcher benefits: Credit, increased citations, increased productivity

Data Citation

Page 16: Identifiers and Data Management

ARKs in data management plans

What it looks like:At top-level directory/folder:

Project TitleUnique IdentifierDate (yyyy or yyyy.mm.dd)

At sub-directories:optional identifiers at granular levels

Sample plan language:[Team] follows the recommended best practice for good data management by assigning unique identifiers (ARKs) to the data as part of the data documentation.

Why it’s important:

Researcher benefits: Data documentation helps you keep track of (and remember) aspects of your data throughout the research project.

Data Documentation

Page 17: Identifiers and Data Management

Identifiers for data management

Identifiers + data=• Easy to access, • Easy to re-use • Easy to verify

Page 18: Identifiers and Data Management

EZIDDMP Tool

EmailTwitter

http://ezid.cdlib.org https://dmptool.org/

[email protected]@ezidCDL