Update on the Canadian RDM Landscape

Post on 02-Jun-2022

4 views 0 download

Transcript of Update on the Canadian RDM Landscape

Update on theCanadian RDM Landscape

Jeff Moon, Director, PortageAtlantic Canada RDM Day | 21 October 2020

Funding in support of the Portage Network’s stewardship of research data within Canada is administered through the New Digital Research Infrastructure Organization (NDRIO).Le financement accordé pour l’intendance des données de recherche au Canada du réseau Portage est administré au travers de la Nouvelle organisation de l’information de recherche numérique.

DMP Exemplars

DMP Templates

Image courtesy: Daimon Tayler-McLeod

Agenda

1. What is Research Data Management?

2. Introduction to Portage

3. Making data FAIR1. Tri-agency RDM Policy

2. Supports for institutions & researchers

3. Other initiatives

4. Looking forward → DM under NDRIO

Photo by Thomas Renaud on Unsplash

"Research data management concerns the

organisation of data, from its entry to the research

cycle through to the dissemination and archiving

of valuable results. It aims to ensure reliable

verification of results, and permits new and

innovative research built on existing information."

Whyte, A., Tedds, J. (2011). ‘Making the Case for Research Data Management’. DCC Briefing Papers. Edinburgh: Digital Curation Centre. Available online

Photo by Shahadat Rahman on Unsplash

Research

Life Cycle

RDM Drivers

Making effective use

of public funds

Improving discoverability &

accessibility

Extending research

Facilitating interoperability

Supporting replicability

Growing demand for data

Avoiding duplication

Verification of research results

Meeting funder & journal

requirements

Growing public awareness of

data

Enabling good public policy

making

Aligning with international best

practices & standards

steer by zidney from the Noun Project RDM Drivers

What keeps me up at night . . .

Data

Commercial interests

Preventing data loss

. . . it can happen to you!

infrastructure by Nithinan Tatah from the Noun Projectcomputing by Adrien Coquet from the Noun Projecttools by tanu doank from the Noun Projecttraining by Adrien Coquet from the Noun Project

Network of Experts

Training

Services

Tools

Infrastructure Platforms

International & Domain Focus

Facilitation & Convening

Planning

Policies

Design Research Data

Management Plan

Reuse

Finding data

Data citation

Collect & Analyze

Capture and organize data

Active Storage and backup

Documentation & metadata

File naming & formats

Collaborate

Deposit &

Preserve

Reformatting

Standards

Archival Storage

Publish

Data sharing

Copyright & Ownership

Ethics

Research

Data

Management

Life Cycle

DMP

Expert Group

Discovery &

Metadata

Expert Group

Preservation

Expert Group

Sensitive Data

Expert Group

Curation

Expert Group

National

Training

Expert Group

Research

Intelligence

Expert Group

Data Repositories

Expert Group

Ne

two

rk o

f E

xp

ert

s

130+ Experts 60+ Organizations

DMP

Coordinator

Discovery &

Metadata

Coordinator

Preservation

Coordinator

Policy, Privacy, &

Sensitive Data

Coordinator

Curation

Coordinator

& Officers

National

Training

Coordinator

Research

Intelligence

& Assessment

Coordinator

Communications

& Project Officers

Na

tio

na

l S

up

po

rt

FAIR Principles

Findable Accessible Interoperable Reusable

A set of principles to ensure that data are shared in a way that

enables and enhances reuse by humans and machines

Funder

Policies

Data

Management

Plans

Institutional

StrategyDeposit

DRAFT Tri-Agency Research Data Management Policy For Consultation

Broad uptake

Revised in 2020

Data Management PlansInstitutional

Strategy Deposit

RDMStrategyTemplate

National, Multi-disciplinary RepositoryOptions

DMP Assistant

National, online, bilingual, Data Management Planning Tool

New version imminent

New discipline-specific

Exemplars & Templates

Dataverse

&

Federated Research

Data Repository

https://publons.com/benefits/institutions

How do I make

my data FAIR?

Institutional

Buy-in

STRATEGY COMPONENTS

Raise awareness

Assess institutional readiness

Formalize RDM practices

Define a Roadmap

Institutional

RDM Strategy

What is a Data Management Plan (DMP)?

➔ Describes what data you expect to

acquire or generate during the course of

a research project and why

➔ Explains how you will manage, describe,

analyze, and store your data and who will

be responsible

➔ Details when and where you will deposit

your data and how it will be shared

https://communitylivingstmarys.ca/services/community-development-and-planning-services/

➔ Helps you think ahead and map out

how you will manage, describe,

analyze, store, and share your data

➔ Helps identify areas for improvement &

questions that need to be answered

➔ Provides you & others with a record of

what you intend(ed) to do

➔ They are (or will be) required

Why DMPs?

https://media.defense.gov/2017/Nov/13/2001842185/-1/-1/0/171026-F-RN211-001.JPG

Infrastructure and Support

DMP Assistant

➔ National, online and bilingual

➔ Step-by-step, easy-to-use

➔ Framed around key sections, questions, & guidance

➔ Update anytime, share with collaborators

➔ Output in funder-ready formats

Visit: https://assistant.portagenetwork.ca/

Research Data Storage ContinuumA

cti

ve S

tora

ge

Controlled Access

Working Copy

Short-term

Duration of

project

Used to

complete

research

AC

TIV

E S

TO

RA

GE

From the Noun Project: storage by Nithinan Tatah | Share by Prasad | Time by Alice Design | working by Ranah Pixel Studio ect | Access by Adrien Coquet ject |Unlock by Zulfa Mahendra | Research by sandra |click by Delwar Hossain

Research Data Storage ContinuumA

cti

ve S

tora

ge

Re

po

sit

ory

Sto

rage

Controlled Access

Working Copy

Short-term

Duration of

project

Used to

complete

research

Open

(as appropriate)

Medium-term

Beyond duration

of project

Discovery &

Access

From the Noun Project: storage by Nithinan Tatah | Share by Prasad | Time by Alice Design | working by Ranah Pixel Studio ect | Access by Adrien Coquet ject |Unlock by Zulfa Mahendra | Research by sandra |click by Delwar Hossain

Dissemination

Copy

RE

PO

SIT

OR

Y S

TO

RA

GE

AC

TIV

E S

TO

RA

GE

Research Data Storage ContinuumA

cti

ve S

tora

ge

Re

po

sit

ory

Sto

rage

Pre

se

rva

tio

n P

roce

ssin

g

Arc

hiv

al S

tora

ge

Controlled Access

Working Copy

Short-term

Duration of

project

Used to

complete

research

Open

(as appropriate)

Dissemination

Copy

Medium-term

Beyond duration

of project

Discovery &

Access

Open

(as appropriate)

Preservation Copy

Long-term

Disaster recovery/

Copy of last resort

AR

CH

IVA

L S

TO

RA

GE

AC

TIV

E S

TO

RA

GE

RE

PO

SIT

OR

Y S

TO

RA

GE

From the Noun Project: storage by Nithinan Tatah | Share by Prasad | Time by Alice Design | working by Ranah Pixel Studio ect | Access by Adrien Coquet ject |Unlock by Zulfa Mahendra | Research by sandra |click by Delwar Hossain

ARCHIVAL STORAGE ACTIVE STORAGE

REPOSITORY STORAGE

ResearchLife Cycle

Benefits of Data Repositories

● Ensure data are discoverable & accessible beyond the original study

-- The Availability of Research Data Declines Rapidly with Article Age (Vines et al., 2014)

● Support publishing datasets for discovery and re-use

● Assign a Digital Object Identifier (DOI) for unambiguous citation

● Set licensing terms specifying how datasets may be used

● Monitor research impact by tracking use of published datasets

Repository Options in Canada: A Portage Guide

Storage, Discovery & Access

Infrastructure and Support: Repositories

A scalable, federated platform for digital research data

management and the discovery of Canadian research data

Big data by Arafat Uddin from the Noun Project

Server by Graphic Tigers from the Noun Project

Subfolder by shashank singh from the Noun Project

Big data capable

& scalable

Retains file

hierarchies

Geographically

distributed

Federated Research Data RepositoryTotal Datasets in FRDR Repository: 135

Total number of FRDR accounts: 275

Total Published: 14.9 TB Oct 2020

Institutions

51

Dataverses

450+

Datasets

2,307

Files

29,862

Downloads

178,642

Nov 12, 2019

University of British Columbia licensed

Simon Fraser University

University of Northern British Columbia

UAL

Dataverse Dataverse

Scholars Portal

Dataverse

https://www.technologynetworks.com/informatics/news/deep-learning-algorithm-could-remove-materials-discovery-bottleneck-339063

How do I make my data FAIR?

Ensure they are findable

& accessible

FRDR.ca

Metadata harvested to FRDR

Domain-specific

Repositories

General Repositories

National Discovery Layer

Improve discovery of Canadian research

(meta)data

Break down repository siloes

Drive traffic to existing repository

sites

Create interoperability

between Canadian and international

platforms

Government Repositories

Harvested Canadian repositories: 79

https://techbeacon.com/app-dev-testing/seven-key-enablers-continuous-testing

How do I make my FAIR

Other initiatives…

Fair EnablersPersistent Identifiers

➔ Persistent identifier (PID): a

long-lasting reference to a

digital resource

➔ Provides the information

required to reliably identify,

verify and locate

➔ Example: Digital Object

Identifier (DOI)

Fair EnablersPersistent Identifiers

DataCite Canada

Consortium

➔ Support Canadian institutions in managing and providing DOIs

➔ Allows Canadian researchers to obtain DOIs for their research

outputs easily and without direct costs

Fair EnablersPersistent Identifiers

ORCID-CA: The ORCID

Consortium in Canada

➔ Obtain an ORCID iD for free from https://orcid.org/

➔ Publish information about your research interests and collate

all your research outputs in one location

➔ Solve name ambiguity and researcher identification problems

➔ Major publishers, funders and research institutions have been

adopting

Other Potential PIDs

Source: https://www.slideshare.net/OpenAIRE_eu/new-pid-developments

Fair EnablersRepository Certification

Internationally endorsed set of core characteristics of trustworthy data repositories

https://www.coretrustseal.org/why-certification/certified-repositories/

Fair EnablersRepository Certification

Align with emerging TRUST Principles:• Transparency• Responsibility• User Focus• Sustainability• Technology

https://www.nature.com/articles/s41597-020-0486-7

Fair Enablers Metadata, Controlled Vocabularies

& Discovery

FASTSubject Headings

(Faceted

Application of

Subject

Terminology)

Improving &

expanding

metadata

harvesting

from Canadian

Repositories

Improving

Geospatial

discovery

through

GEODISY

Project

GeospatialDiscovery(beta)

https://geo.frdr-

dfdr.ca/

Fair Enablers Sensitive Data

Chandra Kavanagh, Memorial University

✓Glossary of sensitive data terms

✓Defining risks related to roles of individuals

✓Deposit-friendly text for ethics, informed consent

✓Data access agreements

✓Research Data Risk Matrix

Risks to Research

Participants

Risks to Groups,

Communities and Third

Parties

Risks to Researchers

Risks to Institutions

Risks to Data

Risk Management

Fair Enablers Sensitive Data

Chandra Kavanagh, Memorial University

✓Glossary of sensitive data terms

✓Defining risks related to roles of individuals

✓Deposit-friendly text for ethics, informed consent

✓Data access agreements

✓Research Data Risk Matrix

Risks to Research

Participants

Risks to Groups,

Communities and Third

Parties

Risks to Researchers

Risks to Institutions

Risks to Data

Risk Management

Fair Enablers Sensitive Data

✓ FRDR-aligned project to support sensitive

data

✓ Exploring viability of zero-knowledge

encryption in an RDM context

✓ Developing tools to encrypt packages at time

of deposit

✓ Supporting metadata-only review and

discovery of sensitive datasets.

Fair Enablers Training!

Strengthening Research Data Management in Canada:

A National Training Strategy

[Draft] October 2020

Looking Forward

DRI before…

DRI after…

New CEO: Nizar Ladak

NDRIO Board

https://engagedri.ca/wp-content/uploads/2020/09/Fact-Sheet.pdf

https://engagedri.ca/wp-content/uploads/2020/09/Fact-Sheet.pdf

https://engagedri.ca/wp-content/uploads/2020/09/Fact-Sheet.pdf

https://thebulletin.org/doomsday-clock/current-time/

Key priorities for DM

RDM Platform Support &

Development

Develop and operationalize

national, collaborative RDM

platforms & services

Transition to NDRIOMerger of Portage & RDC

Committees----------------------------------------------------------------------------

Integration with ARC & RS----------------------------------------------------------------------------

Needs Assessment &

Input from Researcher Council

National Data

Stewardship

Support

Providing coordinated support

at the national level, across

the data life cycle

Oct 1

2019Sept 30

2020

April 1

2020

Mar 31

2021

N D R I O D M F u n d i n g

U n d e r S t r a t e g i c P l a n

Mar 31

20242022 – 2023

C o o r d i n a t i o n & A l i g n m e n t w i t h A R C , R S , & N e t w o r k

S t r a t e g i c P l a n

B r i e f i n g N o t e s

Dec

2020

Jan 31

2020

C o r p

P l a n

D e v

C A N A R I E / I S E D –

D i r e c t e d F u n d i n g

N D R I O D i r e c t e d F u n d i n g

U n d e r C o r p o r a t e P l a n

DM Timeline for transition to NDRIO

C A N A R I E

• Improving discovery

• Developing & deploying a national RDM training strategy

• Growing a national network of data curation support

• Supporting the evolution of national preservation services

• Addressing issues related to sensitive data

• Working with domains to address discipline-specific issues

• Promoting data management planning

• Advancing repository certification efforts in Canada

• Successfully merging DM into NDRIO

Looking Forward

http://gojmff.org/program-areas/other-initiatives.cfm

Thanks again to our partnersCARL Portage is supported by directed funding from Innovation, Science &

Economic Development Canada (ISED), flowing through NDRIO

Network of Experts

Questions?

Jeff MoonDirector, Portage NetworkEmail: portagedir@carl-abrc.caportagenetwork.ca