Preserving Research Data in Canada: an update DLI/ACCOLEDS 2009 Chuck Humphrey University of Alberta...
-
Upload
sean-crowley -
Category
Documents
-
view
216 -
download
0
Transcript of Preserving Research Data in Canada: an update DLI/ACCOLEDS 2009 Chuck Humphrey University of Alberta...
Preserving Research Data in Canada: an update
DLI/ACCOLEDS 2009
Chuck Humphrey
University of Alberta
1
Environmental scan of this decade
2000
2002
2004
Research Data Centre Network, 2000
2008
2006
2001
2003
2005
2007
2009
DLI becomes a permanent program, 2001
National Data Archive Consultation, 2001-2002
OECD Access to Publicly Funded Research Data, 2004
Canadian Digital Information Strategy, 2006-2007
Consultation on Access to Scientific Research Data, 2005
International Data Forum, 2007
Research Data Strategy Working Group, 2008
CARL Data Management Working Group
UNESCO Charter on Preservation of Digital Heritage, 2003
2
3
Tipping the balance toward action
Research Data Strategy Working Group
Initiated by Pam Bjornson, CISTI Executive Director Cross-sector working group consisting of members from
government departments & agencies and research libraries Stewardship of Research Data in Canada: A Gap
Analysis (January 2009). Uses a lifecycle model to identify data problems in Canada.
RDSWG reorganized in anticipation of the release of the Gap Analysis in fall 2008. Task Group 1: Engagement strategy Task Group 2: Policies, funding and reward systems Task Group 3: Infrastructure and services Task Group 4: Capacity
4
Gap analysis summary
Source: The Stewardship of Research Data in Canada: a gap analysis, Table 2, page 17. 5
CARL Data Management Working Group
Members Marnie Swanson, Chair (U of Victoria) Pam Bjornson (CISTI) Lynn Copeland (SFU) Michelle Edwards (U of Guelph)
Observers Bernie Gloyn (Statistics Canada) Margaret Haines (Carleton U) Janine Schmidt (McGill U) Kathleen Shearer (CARL consultant)
Produced the Data Management Awareness Toolkit
6
Research Data Management Seminar
7
http://www.dcc.ac.uk/lifecycle-model/8
http://www.dcc.ac.uk/lifecycle-model/9
10
This table lists changes to the stages in the DCC model, re-aggregating activities in the lifecycle to create a data library viewpoint.
DCC Data Lib
create or receive data production
appraisal and select
dissemination
ingest, store, access and use
data repository
discovery
transform repurpose11
Data stewardship lifecycle
Data Repurposing
Data ProductionData Repository
Data Dissemination
Data Discovery
12
Where are we headed?
Trusted Research Data Repositories (TRDR’s) are emerging as a new institutional model to support data preservation. Based on work advanced for trusted digital repositories, TRDR’s are specialized services dedicated to research data.
Internationally, the U.S. and Europe are investing in the development of infrastructure to support TRDR’s. NSF DataNet Europe’s Digital Repository Infrastructure Vision
for European Research (DRIVER) DRIVER II: Federated data repositories
13
VirtualVirtualCommunityCommunity
Network
Grid
Scientific Data
VirtualVirtualCommunityCommunity
Network
Grid
Scientific Data
VirtualLaboratories
Workspace
Meetings, experiments, etc.
VirtualVirtualCommunityCommunity
Network
Grid
Scientific Data
VirtualLaboratories
Workspace
Meetings, experiments, etc.
VirtualLaboratories
Workspace
Meetings, experiments, etc.
Network
Grid
Scientific Data
Econ
om
ies
Econ
om
ies
of
Scale
of
Scale
Effi
cie
ncy
Effi
cie
ncy
Gain
sG
ain
sGlobal virtual research community
Source: Ulf Dahlsten, “Building a global virtual research community,” at the International Data Forum, Beijing, June 7, 2007
14
e-I
nfr
ast
ructu
re
of
re
posi
tori
es
e-I
nfr
ast
ructu
re
for
re
posi
tori
es
Management TransparentResponsiveInformedGrids, Virtual Organisations, etc
Repositories TrustedOpenWell managedRepository management, curation, physical security,
etc
Repositories services Ease of useAvailabilityReliabilityDeposit, annotation, delivery, visualisation, search,
help, etc
Information AuthenticityQualityLongevity
Collections: data, work-flows, publications, learning materials, etc.
AvailableScaleableReliableNetworks, computing, HPC, physical storage, etc
Physical infrastructure
Access StandardisedStableFlexible
Authentication, authorisation, logical security, federation, portals, etc
15Source: Mário CampolargoOpen Grid Forum Barcelona, 3 June 2008 source: eSciDR study (adapted)
e-Infrastructure for repositories
Data stewardship framework
Infrastructure layerInfrastructure layer
Data layerData layer
Services layerServices layer
Metadata layerMetadata layer
PPrroodduuccttiioonn
PPrroodduuccttiioonn
AAcccceessss
AAcccceessss
PPrreesseerrvvaattiioonn
PPrreesseerrvvaattiioonn
Arc
hite
ctur
e
Lifecycle activities
16
Data stewardship framework
Production Access Preservation
Services
Metadata
Data
Infrastructure
Local
National
International
Local
National
International
Local
National
International
17
18
What’s next?
The agenda for research data over the next decade, while substantial, can be managed through collaboration. We can all play a role in furthering developments in sound data management and in advancing data stewardship.