Velterop 2 a ssp arlington may 2015

84
Big Journal Literature Big Usage Jan Velterop – SSP – Arlington, May 28, 2015

Transcript of Velterop 2 a ssp arlington may 2015

Page 1: Velterop 2 a ssp arlington may 2015

Big Journal LiteratureBig Usage

Jan Velterop – SSP – Arlington, May 28, 2015

Page 2: Velterop 2 a ssp arlington may 2015

11,135,542

More than 2 addedevery minute of 2014

Number of abstracts in PubMed

Page 3: Velterop 2 a ssp arlington may 2015

Information overload!

Page 4: Velterop 2 a ssp arlington may 2015

that Overload?

Or rapidly increasing knowledge…

…making a world of difference that can change the course of scientific thought?

Page 5: Velterop 2 a ssp arlington may 2015

The purpose of scientific communication:

Dissemination of knowledge

Page 6: Velterop 2 a ssp arlington may 2015

• maximal usefulness of scientific research results efficient, fast, and effective new knowledge creation & discovery

Optimal dissemination for

Page 7: Velterop 2 a ssp arlington may 2015

Efficient?

There’s too much and it’s impossible to read

everything, even if you have access!

Page 8: Velterop 2 a ssp arlington may 2015

Lamp post research

Page 9: Velterop 2 a ssp arlington may 2015

Looking merely at the literature that one can read – which is not necessarily all the literature that is potentially important to one’s

research

Lamp post research:

Page 10: Velterop 2 a ssp arlington may 2015
Page 11: Velterop 2 a ssp arlington may 2015
Page 12: Velterop 2 a ssp arlington may 2015
Page 13: Velterop 2 a ssp arlington may 2015
Page 14: Velterop 2 a ssp arlington may 2015
Page 15: Velterop 2 a ssp arlington may 2015
Page 16: Velterop 2 a ssp arlington may 2015
Page 17: Velterop 2 a ssp arlington may 2015
Page 18: Velterop 2 a ssp arlington may 2015
Page 19: Velterop 2 a ssp arlington may 2015
Page 20: Velterop 2 a ssp arlington may 2015
Page 21: Velterop 2 a ssp arlington may 2015
Page 22: Velterop 2 a ssp arlington may 2015

Big Usage

But not in the way we’re used to

Page 23: Velterop 2 a ssp arlington may 2015

So, what to do?

Page 24: Velterop 2 a ssp arlington may 2015

problemproblemEveryEvery has itshas its solutionsolution

Page 25: Velterop 2 a ssp arlington may 2015

Possible strategies:

1.Publish a smaller number of papers

2.Accept that an ever smaller proportion of the available papers is actually being read

3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge

Page 26: Velterop 2 a ssp arlington may 2015

Possible strategies:

1.Publish a smaller number of papersMaybe, but if it means less information, it’s

ludicrous

2.Accept that an ever smaller proportion of the available papers is actually being read

3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge

Page 27: Velterop 2 a ssp arlington may 2015

Possible strategies:

1.Publish a smaller number of papers

2.Accept that an ever smaller proportion of the available papers is actually being read

How to choose, though?

3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge

Page 28: Velterop 2 a ssp arlington may 2015
Page 29: Velterop 2 a ssp arlington may 2015
Page 30: Velterop 2 a ssp arlington may 2015

In any event:

l’embarras du choixIn any event:

l’embarras du choix

Page 31: Velterop 2 a ssp arlington may 2015

Possible strategies:

1.Publish a smaller number of papers

2.Accept that an ever smaller proportion of the available papers is actually being read

3.Capture the knowledge contained in all papers and map it in such a way that you can navigate that knowledge

Yes! Helps to see trends and what to choose!

Page 32: Velterop 2 a ssp arlington may 2015

First

create an overview…

Page 33: Velterop 2 a ssp arlington may 2015

…only then

start digging

Page 34: Velterop 2 a ssp arlington may 2015

How might we create overviews?

Page 35: Velterop 2 a ssp arlington may 2015

“As the rate of publishing accelerates, the need for computational support to work out which articles to read, and how to interpret, reproduce and validate the claims they contain is growing.”

Quote from ‘Lazarus’: http://www.bbsrc.ac.uk/pa/grants/AwardDetails.aspx?FundingReference=BB/L005298/1

Page 36: Velterop 2 a ssp arlington may 2015

Extract Key Insights

Extract Key Insights

Extract Key Insights

Extract Key Insights

Page 37: Velterop 2 a ssp arlington may 2015

Imagine you had a paper that concluded:

“On hot days, it turns out that aspirin decreases the chances of blot clots, but increases the chances of heart attack in humans; the effect wasn't observed in rats at all; simulations of dogs seem to suggest that the effect is present but independent of temperature unless the dog is accompanied by a human”

Page 38: Velterop 2 a ssp arlington may 2015

Imagine you had a paper that concluded:

“On hot dayshot days, it turns out that aspirinaspirin decreasesdecreases the chances of blot clotsblot clots, but increasesincreases the chances of heart attackheart attack in humanshumans; the effect wasn't observed in ratsrats at all; simulations of dogsdogs seem to suggest that the effect is present but independent of temperaturetemperature unless the dogdog is accompanied by a humanhuman”

Page 39: Velterop 2 a ssp arlington may 2015

Significant concepts:

[CHEMBL25] (aspirin)[EFO_0001702] ('temperature' from the experimental factors ontology)[Canis lupus familiaris][Homo sapiens][Mus musculus]

Headline Interactions (in the form of Triples):

[ASPIRIN] [DECREASES] [THROMBOSIS][ASPIRIN] [INCREASES] [MYOCARDIAL INFARCTION]

Significant concepts:

[CHEMBL25] (aspirin)[EFO_0001702] ('temperature' from the experimental factors ontology)[Canis lupus familiaris][Homo sapiens][Mus musculus]

Headline Interactions (in the form of Triples):

[ASPIRIN] [DECREASES] [THROMBOSIS][ASPIRIN] [INCREASES] [MYOCARDIAL INFARCTION]

Add this to the article’s abstract (after it’s been validated by the author):

Page 40: Velterop 2 a ssp arlington may 2015

Most efficient:If publishers were to do this (doesn’t cost much, and makes articles far more useful)

In case publishers don’t, alternative ways are being developed outside publishers’ control

Page 41: Velterop 2 a ssp arlington may 2015

publishing data in articles

Currently:

equals burying data R.I.P.R.I.P.

Page 42: Velterop 2 a ssp arlington may 2015

ocumentsVia Utopia Documents, LAZARUS ‘resurrects’

knowledge from being buried in articles:• entities (‘concepts’, incl. synonyms, e.g.

proteins)• phrases, statements, assertions (e.g. triples)• molecules (incl. Markush structure groups)• graphs• tables

http://utopiadocs.com

Page 43: Velterop 2 a ssp arlington may 2015

• entities (‘concepts’, incl. synonyms, e.g. proteins)• phrases, statements, assertions (e.g. triples)• molecules (incl. Markush structure groups)• graphs• tables

These are captured – with their provenance, e.g. DOI – in a ‘Knowledge Graph’ of their relationshipsWhen assertions are captured, they are compared to the Knowledge Graph and labelled as ‘new’ (to the Graph) or ‘already found earlier’

should be should be interesting interesting for the peer for the peer

reviewer of a reviewer of a newly newly

submitted submitted articlearticle

should be should be interesting interesting for the peer for the peer

reviewer of a reviewer of a newly newly

submitted submitted articlearticle

Page 44: Velterop 2 a ssp arlington may 2015

“Lazarus to harness the crowd reading life-science articles to resurrect the swathes of legacy data buried in charts, tables, diagrams and free-text, to liberate processable data into a shared resource that benefits the community.”

Page 45: Velterop 2 a ssp arlington may 2015

“Lazarus to harness the crowd reading life-science articles to resurrect the swathes of legacy data buried in charts, tables, diagrams and free-text, to liberate processable data into a shared resource that benefits the community.”

“…activities currently carried out anyway by individuals for their own purposes (annotating, cross-referencing articles with databases, organising collections of articles).”

Page 46: Velterop 2 a ssp arlington may 2015

“Lazarus to harness the crowd reading life-science articles to resurrect the swathes of legacy data buried in charts, tables, diagrams and free-text, to liberate processable data into a shared resource that benefits the community.”

Works on any pdf, from

Works on any pdf, from

paywalled and open sources

paywalled and open sources

alikealikeWorks on any pdf, from

Works on any pdf, from

paywalled and open sources

paywalled and open sources

alikealike

“…activities currently carried out anyway by individuals for their own purposes (annotating, cross-referencing articles with databases, organising collections of articles).”

Page 47: Velterop 2 a ssp arlington may 2015
Page 48: Velterop 2 a ssp arlington may 2015

VHL protein binds to HIF-α which is ubiquitinated and tagged for degradation in the proteasome.

Page 49: Velterop 2 a ssp arlington may 2015
Page 50: Velterop 2 a ssp arlington may 2015
Page 51: Velterop 2 a ssp arlington may 2015

‘Assertions’ and ‘significant concepts’ extracted from articles (either by the publisher or by others, like Utopia’s LAZARUS), are added to a growing ‘knowledge graph’ which can be analysed for trends, clusters, areas of intensive activity, etc.

Page 52: Velterop 2 a ssp arlington may 2015

Getting the picture from a large number of data

Page 53: Velterop 2 a ssp arlington may 2015

What we need is information extracted from as many

articles as possible

The more we have, the ‘sharper’ the knowledge

picture

Page 54: Velterop 2 a ssp arlington may 2015

Getting a better picture from even more assertions

Page 55: Velterop 2 a ssp arlington may 2015
Page 56: Velterop 2 a ssp arlington may 2015

Homing in

i.e. making the

choice what to

read in detaili.e. making the

choice what to

read in detail

Page 57: Velterop 2 a ssp arlington may 2015

It’s not just about finding information

It’s also – and possibly more –about the value & power of

‘recombinant knowledge’

Page 58: Velterop 2 a ssp arlington may 2015

BRAIN — Bio Relations And Intelligence Network

Page 59: Velterop 2 a ssp arlington may 2015
Page 60: Velterop 2 a ssp arlington may 2015
Page 61: Velterop 2 a ssp arlington may 2015
Page 62: Velterop 2 a ssp arlington may 2015

“Recombinant Knowledge”

Page 63: Velterop 2 a ssp arlington may 2015
Page 64: Velterop 2 a ssp arlington may 2015

>>>>

Page 65: Velterop 2 a ssp arlington may 2015
Page 66: Velterop 2 a ssp arlington may 2015

Once researchers have identified the articles they really need to read,

it should be made very easy to do so

Page 67: Velterop 2 a ssp arlington may 2015

Ergo, what publishers should do, too, is to make all articles

available in all formats: HTML, XML, PDF and ePub – even print, on demand.

Page 68: Velterop 2 a ssp arlington may 2015

Also on mobile devices

Page 69: Velterop 2 a ssp arlington may 2015

For instance:

Easier than you might think

Page 70: Velterop 2 a ssp arlington may 2015

(www.researchpad.co)

Page 71: Velterop 2 a ssp arlington may 2015
Page 72: Velterop 2 a ssp arlington may 2015
Page 73: Velterop 2 a ssp arlington may 2015

Build collection of favourites

Page 74: Velterop 2 a ssp arlington may 2015

Read full text

Page 75: Velterop 2 a ssp arlington may 2015

Inspect metrics

Page 76: Velterop 2 a ssp arlington may 2015

share with others

Page 77: Velterop 2 a ssp arlington may 2015
Page 78: Velterop 2 a ssp arlington may 2015
Page 80: Velterop 2 a ssp arlington may 2015

ResearchPad Launch Process

ProjectDefinition

Branding

Publishing

Go LiveTurnaround

Time - 8 weeks

Slide borrowed from:

Page 81: Velterop 2 a ssp arlington may 2015

What ResearchPad can do for publishers who want it, at no extra cost*, is to integrate a publisher’s content with anything from elsewhere that’s freely available with open access, so that this open access material can be accessed from

within the publisher’s platform

* personal communication

Page 83: Velterop 2 a ssp arlington may 2015

The End

Page 84: Velterop 2 a ssp arlington may 2015

Thank you

Jan Velterop – 28 May 2015

[email protected]