The Importance of Permanent IDs PIDs from a Publishers Perspective DOIs, ORCIDs, Fund Ref and the...

16
The Importance of Permanent IDs PIDs from a Publishers’ Perspective DOI’s, ORCIDs, Fund Ref and the Data Era Eefke Smit STM Director Standards and Technology Geneva, 17 October 2013

Transcript of The Importance of Permanent IDs PIDs from a Publishers Perspective DOIs, ORCIDs, Fund Ref and the...

  • Slide 1

Slide 2 The Importance of Permanent IDs PIDs from a Publishers Perspective DOIs, ORCIDs, Fund Ref and the Data Era Eefke Smit STM Director Standards and Technology Geneva, 17 October 2013 Slide 3 You say PID, I say DOI. http://doi.org/10.7484/INSPIREHEP.DATA.A78C.HK44 http://doi.org/10.7484/INSPIREHEP.DATA.RF5P.6M3K http://doi.org/10.7484/INSPIREHEP.DATA.26B4.TY5F Slide 4 And here is another DOI.. now from the related article in PhysLetters B, out of approx 80 mln DOIs.. DOI: 10.1016/j.physletb.2013.08.01010.1016/j.physletb.2013.08.010 Slide 5 The article is here, where are the data? Some high level data are embedded in the article, full background data to appear under a link Slide 6 Hell is a place where nothing connects to nothing T.S. Eliott in his introduction to Dantes Inferno. He added: We are in the business of finding connections--within texts, between texts and contexts, between texts and ourselves, between our readings and etc And this was well before the invention of the WWW Slide 7 Publishers realized very early the importance of an infrastructure for PIDs Already in 1996, Publishers started working on DOIs And Crossref was formed in 1998: http://crossref.org/CrossRefAnimation.html At present, after 15 years, Crossref registered some 84 mln DOIs for academic and STM publications It is an infrastructure to ensure links that last. And this was just the beginning.. In 2009 DataCite is established, publishers were quick to support it The ORCID initiative followed in 2010/ 2011 and was established as an organisation in 2012 Assembling Publishers, Funders, Research Communities, Academic Institutions and most of all: Identifying individual researchers and contributors In 2013 followed by a Funders Tracking service: FundRef, a registry for international funders. To identify Funders of Research, connected to articles, to identify related funders and the funded projects. Slide 8 A Dog on the Web; who knows? It is a matter of creating trust on the web This cartoon is from 1993 in The New Yorker, in the very early days of the web. 20 years later, with a much busier web, it is even more important to know whom you are dealing with on the web. The above cartoon by Peter Steiner has been reproduced from page 61 of July 5, 1993 issue of The New Yorker, (Vol.69 (LXIX) no. 20)only for academic discussion, evaluation, research and complies with the copyright law of the United States as defined and stipulated under Title 17 U. S. Code.The New Yorker, Slide 9 Why are PIDs so important Because the aim of Publishing is Validation Certification Accreditation Longevity In order to establish a sound and solid foundation and infrastructure for the Records of Science. Slide 10 Many stakeholders in the academic world apply the same thinking Crossref system for DOIs is used by more than 4000 organisations, not just publishers Some examples: - CERN assigned DOIs on the Higgs Boson data - Harvards Dataverse network uses DOIs - DataCite is based on DOI registration DataCite content negotiation (in cooperation with CrossRef) http://data.datacite.org http://crosscite.org/cn DOI Citation Formatter http://www.crosscite.org/citeproc/ Interoperability initiatives (syntactical, semantical and community interoperability), see also APARSEN Do not reinvent the wheel........... Slide 11 (1) Data contained and explained within the article (2) Further data explanations in any kind of supplementary files to articles (3) Data referenced from the article and held in data centers and repositories (4) Data publications, describing available datasets (5) Data in drawers and on disks at the institute The Data Publication Pyramid Slide 12 Data-linking based on tagged entities Example EMBO journal linking to Genbank Slide 13 12 Elsevier offers data-viewers from within the article, to data stored elsewhere: Slide 14 Application-based linking with Protein DB Explore protein structures relevant to the article zoom, rotate, change display settings, etc. 3D structure data integrated from Protein Data Bank Unique protein codes provided by the authors 60+ journals Biology, biochemistry, neuroscience, food research, etc. In collaboration with Protein Data Bank http://dx.doi.org/10.1016/j.jmb.2010.05.030 Slide 15 Datarepositories and Journals Slide 16 Data Journals Slide 17 Questions ? Eefke Smit [email protected]@stm-assoc.org