Data publication and Citation for CLIR postdoc seminar
-
Upload
carly-strasser -
Category
Technology
-
view
103 -
download
0
description
Transcript of Data publication and Citation for CLIR postdoc seminar
CLIR/DLF Postdoc Seminar 4 August 2014
Data Publication & Citation Workshop
Carly Strasser California Digital Library
@carlystrasser
Roadmap
3. Data citation
1. Intro & background
2. Data publication
4. Altmetrics
From
Flic
kr v
ia lib
raria
ninst
a.tu
mbl
r.com
I am not a librarian.
Enable data sharing Encourage
new incentives
Think about code sharing
Work with libraries, publishers and
researchers
Explore new tools to help
change system
Build tools
John Kratz PhD in Biology from Columbia University
CLIR/DLF Postdoctoral Fellow, started 12 months ago
Data publication and its importance for data sharing, reuse, and preservation
From Wikimedia Commons
Back in the day…
From ahswhg.wikispaces.com
Back in the day…
Da Vinci
Curie Newton
classicalschool.blogspot.com
Darwin
Research has changed
Better
From wikimedia
Such Internet!
So many tools!
From Flickr by John Jobby
So much data!
Research has changed Worse
Digital data Fr
om F
lickr
by
Flick
mor
From
Flic
kr b
y US
Arm
y En
viron
men
tal C
omm
and
From
Flic
kr b
y D
W08
25
C. Strasser
Cour
tese
y of
WHO
I
From
Flic
kr b
y d
eltaM
ike
Digital data +
Complex workflows
“Reproducibility Crisis”
“Digital Dark Age”
“Erosion of Trust”
All of the research Early & often
Transparently & openly Fr
om F
lickr
by
gsag
ostin
ho
the way we communicate our
v Can we fix research?
notebook science/research source content access data government repository knowledge Fr
om F
lickr
by
cdse
ssum
s
Open Science
Making data research dissemination
available to all
notebook science source content access data government repository knowledge Fr
om F
lickr
by
cdse
ssum
s
Open
certain data should be freely available to everyone to use & republish as they wish, without restrictions from copyright, patents or other mechanisms of control
Data
From Flickr by Ninja M.
From Flickr by Iqubal Osman
Culture Shift Required
“I own my data and you can’t have it.”
“Let me do my work.”
“I’m already too busy.”
“This takes away from research time.”
h/t Ted Hart, NEON
You can be the
Guardian Steward Caretaker
Data can’t be owned.
Roadmap
3. Data citation
1. Intro & background
2. Data publication
4. Altmetrics
What does “data publication” mean? 1. Available 2. Citable 3. Trustworthy
Data are
Available | Citable | Trustworthy
• Publish means to “make public”. • You should not have to email the author. • The data doesn’t have to be open access.
“Email me!” CC-0 on web
Best practice: data in a trusted community repository with a machine-readable license/waiver
Repositories for data
General content
Non-institutional
Publishers/for-profits
Other
Institutional
Discipline-specific
Repository choices…
Institutional
Discipline-specific
• All data associated with a paper
• Tells a story • Clearinghouse for
researcher’s works
• Some of data for a given paper
• Discoverable • Integrated systems • Collection policies
? Both
Which should a researcher use?
Which is more important?
Depends
Repository choices…
Five-element citation: author, year, title, publisher, identifier
Available | Citable | Trustworthy
Boettiger C, Dushoff J, Weitz JS (2009). Data from: Fluctuation domains in adaptive evolution. Theoretical Population Biology. Published in Dryad. doi:10.5061/dryad.j8n0p7vc More later
Available | Citable | Trustworthy
From Flickr by Percival Lowell
For articles: peer review For data: ? peer review? validation?
Technical VS. Scientific
Available | Citable | Trustworthy
Technical review: completeness, formats
Available | Citable | Trustworthy
Peer review of data
Scientific review: importance, methods evaluation
vs
Available | Citable | Trustworthy
Peer review of data • Experts • Users • Community • Use = validation
Who?
1. Data as supplemental material
Data published alongside a traditional journal article. Available + citable. Review varies. Potential issues with long-term availability.
What does a data publication look like?
From Flickr by subsetsum
2. Data paper: Data + descriptive “data paper”
Standalone journals: Nature Scientific Data, Geoscience Data Journal, Ecological Archives OR Journals that publish data papers: GigaScience, F1000 Research, Internet Archaeology
What does a data publication look like?
From Flickr by subsetsum
3. Standalone data
Data published without a related journal article. Rich metadata (structured or unstructured) • Institutional repository • Open Context • NASA PDS Peer Review Data • figshare (but no validation)
What does a data publication look like?
From Flickr by subsetsum
…“publication” insinuates that we are beholden to the current broken system of journal publication. The word itself has too much baggage. …bureaucrats, funders, and institutions have a familiarity with the word and it will ensure the success of the data publication goals, regardless of whether we break the mold in the process.
þ
ý
http://datapub.cdlib.org/2012/03/06/data-publication-an-introduction/
“Publish”
“Paper”
“Peer review” “Sharing”
“Available”
“Article” “Publication”
From Flickr by Sandia Labs
C. Strasser
C. Strasser
World Bank Photo Collection From Flickr
What do researchers think of data publication?
Survey of researchers
N=274
John Kratz, forthcoming
Roadmap
3. Data citation
1. Intro & background
2. Data publication
4. Altmetrics
Identifiers & Data Citation
Allows readers to find data products Get credit for data and publications
Promotes reproducibility
Example: Sidlauskas, B. 2007. Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. doi:10.5061/dryad.20
An article about data, but no data
from Joan Starr
And then the hunt for the data…
from Joan Starr
FTP site
And then the hunt for the data…
from Joan Starr
The citation difference: data linked…
from Joan Starr
…to the scholarly publication
from Joan Starr
Identifiers • String of characters • Unique • Linked to a digital
object
DOI: Digital object identifier
From Flickr by Plbmak
DOIs ARKs Strict metadata requirements Flexible metadata guidelines
From the scholarly communication community
From the archives and museums community
Established “brand name” Option-‐rich, open source
$$$ $
Comparing two…
DOI: 10.1890/1540-9295-10.2.59 ARK: ark:/12025/654xz321/s3/f8.05v
Res
olve
r
Website with
“object”
Identifiers How it works
dx.doi.org
From Flickr by Sandia Labs
C. Strasser
C. Strasser
World Bank Photo Collection From Flickr
Identifier for people
Res
olve
r
Person’s products
Identifiers for people
Researcher Identification
1. Register 2. Connect with ORCID partners 3. Claim your work
orcid.org
Roadmap
3. Data citation
1. Intro & background
2. Data publication
4. Altmetrics
Altmetrics?
Impact Factors
+ Citation Counts
Credit in academia…
Altmetrics Article-level metrics
Altmetrics Article-level metrics
Altmetrics for alt-products
Data Code Slides Blogs
Downloads Tweets
Mentions Views
Altmetrics Article-level metrics
Altmetrics for alt-products
Data Citation Credit via Altmetrics
From Flickr by chriscook04
What does this have to do with data?
impactstory.org
Website Email Tweet Slides
carlystrasser.net [email protected] @carlystrasser slideshare.net/carlystrasser
Big thanks to John Kratz, CLIR Postdoc