New Metaphors: Data Papers and Data Citations
-
Upload
john-kunze -
Category
Education
-
view
897 -
download
4
description
Transcript of New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Cita4ons
27 F e b r u a r y 2 0 1 2
U C C u r a 4 o n C e n t e r
C a l i f o r n i a D i g i t a l L i b r a r y
Metaphors we live by
“... metaphor is pervasive in everyday life, not just in language but in thought and ac4on. Our ordinary conceptual system, in terms of which we both think and act, is fundamentally metaphorical in nature.”
From Lakoff and Johnson, Metaphors We Live By, 1980
(thanks to Parsons & Fox, Is Data Publica8on the Right Metaphor?, 2011)
Digital = Metaphorical
Everything is a story on top of sequences of bits • Fonts, files, folders, formaXng, phone calls
• Programs, protocols, data, tweets, even bits
Old metaphors can impede technical change
Disrup4ve technical change is inevitable
Roadmap for today’s talk
• Who we are
• What’s changed • Forced incrementalism • Data cita4on • Tradi4onal ar4cles • Data papers • Closing metaphor
California Digital Library (CDL)
California Digital Library – born 1997
CDL supports the research lifecycle
• Collec4ons
• Digital Special Collec4ons
• Discovery & Delivery • Publishing Group
• UC Cura4on Center (UC3)
University of California stakeholders
• 10 campuses
• 226K students, 134K faculty & staff
• 100’s of museums, art galleries, observatories, marine centers, botanical gardens
• 5 medical centers
• 5 law schools
• 3 Dept. of Energy na4onal labs
Our environment circa 2002-‐2008
Focus on preserva4on
For memory organiza4ons
Infrastructure: sta4c
Services: hosted
Content: museum & library
Sustainability: ?
Our environment since 2008
cura8on (lifecycle) and now data producers + cloud, vm, bitbucket
+ partnered, self-‐serve data, web crawls cost recovery, pay once
Focus on preserva4on
For memory organiza4ons
Infrastructure: sta4c
Services: hosted
Content: museum & library
Sustainability: ?
Journal expenditures are outpacing library budgets
The Library Reality
• Journal expenditures rising
• Increase in research publica4on
• Increase in researchers
• Declining budgets
The Library Reality • Journal
expenditures rising
• Increase in research publica4on
• Increase in researchers
• Declining budgets
(Mabe, 2003) The growth of acEve, peer reviewed learned journals since 1665
The Library Reality • Journal
expenditures rising
• Increase in research publica4on
• Increase in researchers
• Declining budgets
(Mabe 2004, based on data from ISI and NSF)
The Library Reality • Journal
expenditures rising
• Increase in research publica4on
• Increase in researchers
• Declining budgets
Trends create a structural problem; calls on libraries to do more with less
Trends create a structural problem; climb the mountain step by step ...
Or look for a radical solu4on?
Prac8cal incrementalism for the complex problem of data cura8on
• Baby steps – data paper/cita4on metaphors • Chipping away – making the problem smaller • DataONE global data network [NSF] • Merrio data repository • EZID for crea4ng DOIs, ARKs, and URNs • Data management plans (DMPTool)
• Web archiving service (WAS) [Library of Congress] • Open-‐source Excel add-‐in [MS Research & GBMF]
Prac8cal incrementalism for the complex problem of data cura8on
• Baby steps – data paper/cita4on metaphors • Chipping away – making the problem smaller • DataONE global data network [NSF] • Merrio data repository • EZID for crea4ng DOIs, ARKs, and URNs • Data management plans (DMPTool)
• Web archiving service (WAS) [Library of Congress] • Open-‐source Excel add-‐in [MS Research & GBMF]
The scien4fic record is at risk
Data dissemina4on is rare, risky, expensive, labor-‐intensive, domain-‐specific, and receives liole credit as research output
Global Change Galac4c Change
What data cita4on offers
• Credit • Discovery • Impact tracking – Helping data authors verify use of their data and – Helping iden4fy how others have used the data
• With archiving: re-‐use and reproducibility
Tradi4onal ar4cles vs data papers
Need to save data + processing
Parallel pyramids
The collec4ve data product
Need to save data + processing
Algorithms + Data Structures = Programs
Vision for a “data paper”
• Wrap the unfamiliar in a familiar façade
• A “data paper” is minimally a cover sheet and a set of links to archived ar4facts
• Cover sheet contains familiar elements: 4tle, date, authors, abstract, and persistent iden4fier (DOI, ARK, etc.)
• Just enough to permit basic exposure and discovery
– Building a basic data cita4on – Indexing by services such as Web of Science, Google Scholar
– Ins4lling confidence in the iden4fier’s stability
Data Papers at the CDL UC CuraEon Center
• Merrio Cura4on repository
• EZID: Persistent id management and resolu4on (ARKs, DOIs, et al.)
Publishing Services Program
• Online journals, with peer review
• Scholarly communica4on: grey literature to post-‐prints
• Search and display tools (XTF)
Provide incremental benefit for incremental effort
... plus nano-‐publicaEons and executable papers.
Data paper: envisioned outcomes
• Familiar look and feel eases adop4on and indexing
• Aoribu4on mo4vates deposit • Stable storage and ids leads to cita4on and impact • Data products enter the record instead of being lost • Data journals spring up around disciplines
Metaphors we close with
“Our ordinary conceptual system, in terms of which we both think and act, is fundamentally metaphorical in nature.”
OTOH, “the more things change the more they remain the same”
Ques4ons?
California Digital Library hop://www.cdlib.org/
“Data Paper” Paper: hop://escholarship.org/uc/item/9jw4964t