Open Science: Research Data Management

16
Open Science: Research Data Management April 2016 Wouter Haak VP Research Data Management Solutions

Transcript of Open Science: Research Data Management

Page 1: Open Science: Research Data Management

Open Science: Research Data Management

April 2016

Wouter Haak VP Research Data Management Solutions

Page 2: Open Science: Research Data Management

ICSU/IAP/TWAS/ISSC: It is widely recognized that ‘data re-use’ is not just a technology challenge, or something that the funding bodies can just change by themselves

“Open Data in a Big Data world” – January 2016 Four major organisations representing global science, the International Council for Science (ICSU), the InterAcademy Partnership (IAP), The World Academy of Sciences (TWAS) and the International Social Science Council (ISSC)

Backup

Page 3: Open Science: Research Data Management

When you leave your institution…what happens with your data?

„Forschende und ihre Daten. Ergebnisse einer österreichweiten Befragung (eBook)“ E-infrastructures Austria Bauer, B. (Bruno) et all Oct 2015 https://phaidra.univie.ac.at/detail_object/o:407736

Stays at institution

Take it with me

Don’t know

Data is lost

Other

Page 4: Open Science: Research Data Management

Is your research data useful for others?

Frequently

Yes

No

„Forschende und ihre Daten. Ergebnisse einer österreichweiten Befragung (eBook)“ E-infrastructures Austria Bauer, B. (Bruno) et all Oct 2015 https://phaidra.univie.ac.at/detail_object/o:407736

Page 5: Open Science: Research Data Management

The 10 components for effective research data 10

. Int

egra

te u

pstre

am a

nd d

owns

tream

mak

e m

etad

ata

to s

erve

use

.

Save

Share

Use

9. Re-usable (allow tools to run on it)

8. Reproducible

7. Trusted (e.g. reviewed)

6. Comprehensible (description / method is available)

5. Citable

4. Discoverable (data is indexed or data is linked from article)

3. Accessible

2. Preserved (long-term & format-independent)

1. Stored (existing in some form)

5

Page 6: Open Science: Research Data Management

10. I

nteg

rate

ups

tream

and

dow

nstre

am

– m

ake

met

adat

a to

ser

ve u

se.

Funder mandates

9. Re-usable (allow tools to run on it)

8. Reproducible

7. Trusted (e.g. reviewed)

6. Comprehensible (description / method is available)

5. Citable

4. Discoverable (data is indexed or data is linked from article)

3. Accessible

2. Preserved (long-term & format-independent)

1. Stored (existing in some form)

6

Mandates are changing behavior – but only focused on the bottom of the data pyramid

Page 7: Open Science: Research Data Management

7

1. Research Data linking – linking articles to external datasets

http://www.sciencedirect.com/science/article/pii/S022352341500272X 7

Page 8: Open Science: Research Data Management

9

2. Gold OA reviewed data journals, software journals, method journals (collectively called “Research Elements”)

http://www.journals.elsevier.com/data-in-brief/

Direct submission

50%

Just publish and get credit for your data

Co-submission

50%

On top of the main article:

attract attention for your data

Increasing from 100 journals to 250+ journals in November 2015

MethodsX

Data in Brief

SoftwareX

e.g.:

Page 9: Open Science: Research Data Management

2. Gold OA journal data / research elements articles growth

• Usage is in top 25% of open access articles on ScienceDirect

0

20

40

60

80

100

120

140

160

180

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

2014

2015

Submitted

0102030405060708090

100

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

20142015

Accepted

10

0

10

20

30

40

50

60

70

80

90

100

Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

2014

2015

Non-Elsevie

competition*, predominantly: F1000Research, Nature Scientific Data, Codata, BMC Research Notes, GigaScience

Page 10: Open Science: Research Data Management

3. Development Partnership (France) – Lab Data Tool: structure in the lab www.hivebench.com

11

Page 11: Open Science: Research Data Management

https://data.mendeley.com/datasets/xz6gv65m6d/6

Linked to published papers – or not

Versioning and provenance

4. Manage, Store: Mendeley Data launched Dec 2015

Data citation (DataCite)

Researcher in control (embargo / visible)

Page 13: Open Science: Research Data Management

6. Datasets in Pure („Research Data Sets“) – Compliance!

•Register datasets and their related metadata in Pure

•Upload binary dataset files directly into Pure

•Link datasets to related projects, publications, awards and more for enhanced reporting capabilities

Provide transparency and comply with funders' requirements through the new 'Datasets' content type

Page 14: Open Science: Research Data Management

Regular Elsevier journals Mendeley data repo (researcher)

Data in the lab / ELN

Post

Other repositories Discoverability: Data/article linking program

Index

Index

15

The open RDM ecosystem: where do we stand now

Review & curate data: data journals

Elsevier

RDM Solutions

Non-Elsevier / open

Integrated

Discoverability: Data search

M&A process

Prototype

Link

Link

Page 15: Open Science: Research Data Management

Powered by ROS Social (profiles, awareness, dashboards)

Compliance (e.g. Pure)

Local institutional data repository

altmetrics, SciVal, Scopus

Scopus citations

Regular Elsevier journals Journal data repository

Mendeley data repo (researcher)

Publish

Data in the lab / ELN

Post Publish

Other publishers/journals

Other repositories

Discoverability: Data search

Discoverability: Data/article linking program

Index

Index Post data

Repository data search

Institutional data search

Domain data search

Embed

Embed

Measure outcomes (data use) 16

The open RDM ecosystem: where do we go => Focus and complexity is in the integration & coverage (and less in the individual pieces)

Review & curate data: data journals

Other lab management tools

Elsevier

RDM Solutions

Non-Elsevier / open

Institutional data repository

Integrated

Link

Link

Post data & supp material

Page 16: Open Science: Research Data Management

Questions? 10

. Int

egra

te u

pstre

am a

nd d

owns

tream

mak

e m

etad

ata

to s

erve

use

.

Save

Share

Use

9. Re-usable (allow tools to run on it)

8. Reproducible

7. Trusted (e.g. reviewed)

6. Comprehensible (description / method is available)

5. Citable

4. Discoverable (data is indexed or data is linked from article)

3. Accessible

2. Preserved (long-term & format-independent)

1. Stored (existing in some form)

17