RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

25
RDAP2013 http:// www.asis.org/rdap/ Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data Jian Qin School of Information Studies Syracuse University USA Alex Ball Digital Curation Centre UKOLN University of Bath UK Jane Greenberg School of Information and Library Science University of North Caroline Chapel Hill, USA Research Data Access & Preservation Summit Baltimore, MD, 2013

description

Jian Qin, Syracuse University Jian Qin, Syracuse University; Alex Ball, UKLON; Jane Greenberg, University of North Carolina at Chapel Hill: “Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data” Panel: Linked data and metadata (co-sponsored by the ASIS&T Digital Libraries SIG) Research Data Access & Preservation Summit 2013 Baltimore, MD April 4, 2013 #rdap13

Transcript of RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

Page 1: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/ Functional and Architectural

Requirements for Metadata: Supporting Discovery and

Management of Scientific Data

Jian Qin School of Information

StudiesSyracuse University

USA

Alex Ball Digital Curation Centre

UKOLNUniversity of Bath

UK

Jane Greenberg School of Information and

Library ScienceUniversity of North Caroline

Chapel Hill, USA

Research Data Access & Preservation SummitBaltimore, MD, 2013

Page 2: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

2

Metadata standards for scientific data

CSDGM

Ecological Metadata Language (EML)

Access to Biological Collections Data – ABCD

Darwin Core

Page 3: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

3

Many tools have been developed…

Page 4: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

4

Many tools have been developed…

Page 5: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

5

Many tools have been developed…

Page 6: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

6

Motivation for adoption

• Standardize the format and terminology of metadata

• Enable fast and effective discovery of datasets across different data repositories

• Enable data sharing, reuse, and preservation • Provide information for obtaining datasets

from the data owners

Page 7: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

7

Hindering factors

• large numbers of elements

• many layers in structure– “Unwieldy to apply”

• been created for manual data entry, rather than for automatic generation

• steep learning curve• difficult to automate

metadata generation• unnecessary

duplicate data entry• high costs in time,

resource, and personnel expertise

Metadata standards for scientific data have:

Effects on metadata generation:

Page 8: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

8Same entity data repeated in the same record…

Seamless Daily Precipitation for the Conterminous United States

Metadata:Identification_InformationData_Quality_InformationSpatial_Data_Organization_InformationSpatial_Reference_InformationEntity_and_Attribute_InformationDistribution_InformationMetadata_Reference_Information

Page 9: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

9

…and they are already in…Publication record associated with the data

Page 10: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

10

Research questions

• What functions do metadata standards for scientific data serve?

• How should metadata standards for scientific data be modeled to support these functions by meeting the associated requirements?

Page 11: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

11

Functions expected

• Resource discovery and use,• Data interoperability,• Automatic and semi-automatic metadata

generation,• Linking of publications and underlying

datasets,• Data/metadata quality control, and • Data security.

Page 12: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

12

Metadata requirements for scientific data

Page 13: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

13

Functional view

Architectural view

Page 14: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

14

Architectural view

Page 15: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

15

Identity metadata

• Person: researcherID, URI, FOAF, ORCID

• Institution: ORCID, URI• Data object: DOI,

Handle, URI • Associated publication:

DOI

• Name repositories • Linked data architecture• Customizable research

group/community member name lists

Globally, uniquely identify entities

Build a metadata infrastructure service

Page 16: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

16

Semantic metadata• Large semantic resources available in linked data format, but

usually not suitable for representing scientific data because they are designed for publications, especially books and journals (containers)

• Format is contemporary but the content is far from it

Smaller, specialized semantic resources are necessary for automatic

semantic metadata generation

Page 17: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

17

Contextual metadata

• Provenance

Provenance data model (W3C, http://www.w3.org/TR/prov-primer/)

Provenance data represent the origins of digital objects and describe the entities and activities involved in producing and delivering or otherwise influencing a given object.

Page 18: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

18

Geospatial metadata

FGDCCSDGM

ISO 19115: 2003 Geographic information -

Metadata.

Biological Data Profile

Shoreline Metadata

Profile

Darwin Core

(DwC)

Ecological Metadata Language

(EML)

NetCDF Climate and Forecast (CF)

Metadata Conventions

Astronomy Visualization

Metadata Standard

Biological sciences

CSDGM Profiles

Climate

AstronomyGeoreferencing elements

Georeferencing elements

Page 19: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

19

Temporal metadata

• Mean solar time• Civil time• GPS time• Terrestrial time• Atomic time• …• Geologic time

• Different measurement systems result in different units and format

• Conversion between systems

Page 20: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

20

Three Principles

• The least effort principle• The infrastructure service principle• The portable principle

Page 21: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

21

Page 22: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

22

Development potentials

• An infrastructure of metadata services– Entities as linked data– Tools for “slicing” members by research group,

community, or institution to customize the entity set

– Tools for grabbing entity data from existing resources through interoperability protocols

Page 23: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

23

Application scenarios

• Cross-domain discovery and verification• Automatically populating entity information

from customized slices of entities into metadata records

• And more…

Page 24: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

24

Conclusion• Scientific data are inherently complex and diverse

• Functional metadata requirements should be translated into an effective and efficient architecture

– Three principles for modeling metadata for scientific data

• Metadata for scientific data (or other domains at large) should adopt an infrastructure service approach

• Much to be explored, experimented, and evaluated

Page 25: RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata

RDAP2013, http://www.asis.org/rdap/

25

Thank you!

Questions?