OPEN DATA: LOCATING & SHARING RESEARCH DATA TO PROMOTE GLOBAL SCHOLARLY COMMUNICATION Stephanie...
-
Upload
suzan-hopkins -
Category
Documents
-
view
215 -
download
0
Transcript of OPEN DATA: LOCATING & SHARING RESEARCH DATA TO PROMOTE GLOBAL SCHOLARLY COMMUNICATION Stephanie...
OPEN DATA:LOCATING & SHARING RESEARCH DATA TO PROMOTE GLOBAL SCHOLARLY COMMUNICATION
Stephanie Swanberg, MSI, AHIPAssistant Professor, Information Literacy & eLearning LibrarianOakland University William Beaumont School of Medicine, [email protected]
RECAP: OPEN ACCESS
Open Access: Defined
“Free availability on the public internet, permitting any users to read, download, copy, distribute, print, search, or link to the full-texts of these articles, crawl them for indexing, pass them as data to software, or use them for any other lawful purpose, without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself.”
Peter Suber from the Budapest Open Access Initiative
JR(Suber 2012)
Traditional Publishing vs Open Access
Pay to read Restricted access
Library subscriptions Personal subscriptions
Limited peer review Often not free to
use/reuse Delayed publishing cycle
Long turnaround from research – submission – publication
Some publishing fees
Free to read Widely available Unlimited peer
review/sharing/collaborating
Often free to use/reuse
Shorter publishing cycle
Some publishing fees
TRADITIONAL OA
Benefits of OA
Researchers/Authors Increased visibility ~ larger potential audience Increased impact ~ more citations (See Swan 2010)
Shorter publishing times ~ quicker dissemination of research
Readers/Teachers Barrier-free access to important scholarly literature OA literature ~ authors/copyright holders have
given permission in advance for classroom/teaching uses
JR(SPARC; Hitchcock 2011; Swan 2010)
Open Data
Open Data: Defined
“Open data is data that can be freely used, re-used and redistributed by anyone - subject only, at most, to the requirement to attribute and share alike.”
Open Data Handbook
(Open Knowledge 2015)
Video from the Open Data Institute: https://vimeo.com/125783029
3 Components of “Open” Data
•available as a whole•available in a convenient and modifiable form•preferably by downloading over the internet
Availability and Access
•permit re-use and redistribution of data including the intermixing with other datasets
Re-use and Redistribution
•everyone must be able to use, re-use and redistribute•including no restrictions on commercial use or for education purposes only
Universal Participation
(Open Knowledge 2015)
What Types of Data?
(Open Knowledge 2015)
SCIENTIFIC RESEARCH
CULTURAL WORKS/ARTIF
ACTS
FINANCE
STATISTICS
WEATHERHISTORY
CLIMATE
HEALTH
TRANSPORTATION
EDUCATION
COMMUNICATION
Why Open Data?
ENCOURAGES: Reuse of datasets and discourages duplication
of effort Proper management of data Transparency and reproducibility of research
IMPROVES: Discoverability/visibility of datasets Attribution to original data producers and
curators
(University of California 2014)
Locating Existing Datasets
Best Bet Data Resources
Data.gov Managed by the US General Services
Administration Primarily data from federal agencies, but
state, local, and tribal government data is also represented
Currently 186,570 datasets available in:
Best Bet Data Resources
Dryad – datadryad.org Major repository for data from all disciplines Includes data from research, education,
industry, and more
Healthdata.gov (BETA) Managed by the US Department of Health &
Human Services Currently 1,904 datasets available in health
care topics
Search Google: -- datasets site:.gov
Publishing your Datasets
Making your Data Openly Available
1) Publish Just the Data Institutional Repositories
OUR@Oakland - Oakland University’s Institutional Repository Sample Dataset:Raw
Data for the 2004 Freshwater Mussel Survey of the Clinton River
Dryad: datadryad.org Just as you can find existing open datasets on
Dryad, you can also publish your data here!
2) Publish a Summary in a Data Journal Data journals are dedicated to publishing
summaries or descriptions of datasets Some traditional journals are also now including
“data descriptor,” “databases,” or similar as a publication type
Most data journals do not archive the actual data, but have authors submit data elsewhere (such as to institutional repositories or Dryad)
(University of California 2014)
Examples of Data Journals
Scientific Data (Nature) “Open access, peer-reviewed publication
for descriptions of scientifically valuable datasets”
Primary Article Type: Data Descriptor
Check out UM Library’s May 2014 blog post for a list of data journals in many disciplines: https://mlibrarydata.wordpress.com/2014/05/09/data-journals/
Activity: Dissecting a Data Descriptor
Take ~5 minutes and read through the data paper you received
With a partner, compare your data papers and discuss the following: What journal is the paper from? What are the major components of the article? How is this different from a traditional journal
article? Think about a research project you’ve been
involved in. If you were to publish your data in this journal, how would you need to clean your data in order for it to be ‘readable’ to your audience (organization, labeling, etc)?
3) Publish your research in a traditional journal AND make your data openly available!
Check journal guidelines regarding publishing data associated with your article
Dryad Journal Lookup Tool – http://datadryad.org/pages/journalLookup Look up the journal(s) you’re planning to submit a
manuscript to Dryad will tell you:
1. How they work with that journal to publish your data2. When you should submit your data to Dryad
Apply an Open Data License
Open Data Commons: opendatacommons.org Provides legal tools and licenses for open
data
Creative Commons equivalent for data
Mechanism for telling the world exactly how they can use your data/dataset
(Open Data Commons 2015)
2 Basic Open Data Licenses
Public Domain - your data is in the public domain Include the following statement as part of your dataset:
This {DATA(BASE)-NAME} is made available under the Public Domain Dedication and License version v1.0 whose full text can be found at http://opendatacommons.org/licenses/pddl/ - See more at: http://opendatacommons.org/guide/#sthash.2ZOKxdT1.dpuf
Share-Alike Plus Attribution - users of your data must also share their data and attribute you as the source Include the following statement as part of your dataset:
This {DATA(BASE)-NAME} is made available under Open Database License whose full text can be found at http://opendatacommons.org/licenses/odbl/. Any rights in individual contents of the database are licensed under the Database Contents License whose text can be found http://opendatacommons.org/licenses/dbcl/ - See more at: http://opendatacommons.org/guide/#sthash.2ZOKxdT1.dpuf
(Open Data Commons 2015)
Summary
Open data is just one component of the larger open access movement that can promote global collaboration while increasing the impact of your own scholarly endeavors
A variety of online resources exist that collect and preserve this data including data.gov, Dryad, and more!
You can be both a user of open data and a contributor by publishing your own data
“Rather than relying on journal articles alone for scholarly communication, data sets can be published as first-class scholarly products, either alongside journal articles that use the dataset or as a standalone object with inherent value.”
Office of Scholarly Communication, University of California
(University of California 2014)
Bibliography
Hitchcock, S. (2011). The effect of open access and downloads ('hits') on citation impact: a bibliography of studies. The Open Citation Project - Reference Linking and Citation Analysis for Open Archives. Retrieved from http://opcit.eprints.org/oacitation-biblio.html
Open Data Commons. (2015). 2-minute Guide to Making Your Data Open. Retrieved from: http://opendatacommons.org/guide/
Open Knowledge. (2015). Open Data Handbook. Retrieved from: http://opendatahandbook.org/guide/en/what-is-open-data/
Open Knowledge. (2015). What is Open? Retrieved from: https://okfn.org/opendata/
SPARC (n.d.). Digital Repositories Offer Many Practical Benefits. Retrieved from http://www.arl.org/sparc/greaterreach/practical_benefits/index.shtml
Suber, P. (2012). Open Access Overview: Focusing on open access to peer-reviewed research articles and their preprints. Retrieved from http://dash.harvard.edu/bitstream/handle/1/4729737/suber_oaoverview.htm
Swan, A. (2010). The Open Access citation advantage: Studies and results to date. Technical Report , School of Electronics & Computer Science, University of Southampton. Retrieved from http://eprints.ecs.soton.ac.uk/18516/2/Citation_advantage_paper.pdf
University of California Office of Scholarly Communication. (2014). Data Publication. Retrieved from: http://osc.universityofcalifornia.edu/scholarly-publishing/data-publication/
Photo Courtesy: Question Question Mark Request Matter Requests by geralt, Pixabay, http://pixabay.com/en/question-question-mark-request-63916/ (public domain)