Advanced Climate Research Infrastructure for Data (ACRID)
description
Transcript of Advanced Climate Research Infrastructure for Data (ACRID)
Advanced Climate Research Infrastructure for Data
(ACRID)Dr. Andrew Woolf1, Dr. Tim
Osborn2, Dr. Arif Shaon1, Dr. Colin Harpham2
(1) STFC e-Science Centre, RAL(2) Climatic Research Unit, UEA
JISC 14/09• Citing: “Agreed conventions for data citation and for
data description are important for research data discovery. Persistent identification is required...”
• Linking: “A recent position paper written for JISC ... makes a case for the benefits of linking research data using semantic or linked data technology ... data on which a journal article is based are bi-directionally linked to other data, resources, articles and people.”
• Integrating: “Integrating heterogeneous data across distributed sources can enable effective and innovative reuse”
ACRID• Advanced Climate Research Infrastructure for Data• Collaboration between:
– Climatic Research Unit, University of East Anglia– STFC e-Science Centre, Rutherford Appleton Laboratory– Met Office (unfunded partner)
• Various inquiries following 2009 email hacking recommended greater access to data and workings
• Project aims:– Information architecture, tools, infrastructure for managing
climate data and processing workflows– ‘linked-data’ approach for climate data publishing and citation– Prototype using four high-profile climate datasets
Citing• Convergence around DOI for linking
publication to data in Earth science– DataCite, Parsons and Duerr (2010), Wilson et.
al. (2010), UNESCO (2010), ESSD, etc.
• But “(w)hat is the citeable unit within a DOI?”– file? set of files? OAIS AIP?
• Answer: linked-data graph
Linking
Integrating• An example information model for Observations
and Measurements (ISO/DIS 19156)– An observation is an event that estimates an
observed property of a feature of interest, using a procedure, and generating a result
class Figure 5 - Observ ation
OM_Observation
+ phenomenonTime+ resultTime+ validTime [0..1]+ resultQuality [0..*]+ parameter [0..*]
GFI_PropertyType
GFI_Feature OM_Process
Any{root}
+observedProperty
1
+propertyValueProvider
0..*
+featureOfInterest
1
+generatedObservation0..*
+procedure1
+result
Range
Name:Package:Version:Author:
Figure 5 - ObservationAIP-31.0Simon Cox
Climate Science Modelling Language (CSML)
ReferencesACRID• http://www.cru.uea.ac.uk/cru/projects/acrid• http://www.jisc.ac.uk/whatwedo/programmes/mrd.aspxLinked data• http://linkeddata.org• Tim Berners-Lee: Linked Data – Design Issues
http://www.w3.org/DesignIssues/LinkedData.html
• Bizer, C., T. Heath and T. Berners-Lee (2009): Linked Data – The Story So Farhttp://tomheath.com/papers/bizer-heath-berners-lee-ijswis-linked-data.pdf
URI structure• W3C: Cool URIs don’t change
http://www.w3.org/Provider/Style/URI
• Cabinet Office (2009): Designing URI Sets for the Public Sectorhttp://www.cabinetoffice.gov.uk/media/308995/public_sector_uri.pdf
• Cabinet Office (2010): Designing URI Sets for LocationCSML• http://ndg.nerc.ac.uk/csmlOAI-ORE• http://www.openarchives.org/ore
Questions?