Literature mining: what is it, and should I care?
-
Upload
lars-juhl-jensen -
Category
Technology
-
view
1.759 -
download
0
description
Transcript of Literature mining: what is it, and should I care?
Literature mining
Explosion
exponential increase
some things never change
“graph calculus”
=
~50 seconds per paper
Information retrieval
find the relevant papers
ad hoc retrieval
user-specified query
“yeast AND cell cycle”
stemming
yeast / yeasts
dynamic query expansion
yeast / S. cerevisiae
ranking
Mitotic cyclin (Clb2)-bound Cdc28 (Cdk1 homolog) directly phosphorylated Swe1 and this modification served as a priming step to promote subsequent Cdc5-dependent Swe1
hyperphosphorylation and degradation
no tool will find it
Entity recognition
identify the substance(s)
Mitotic cyclin (Clb2)-bound Cdc28 (Cdk1 homolog) directly phosphorylated Swe1 and this modification served as a priming step to promote subsequent Cdc5-dependent Swe1
hyperphosphorylation and degradation
good synonyms list
orthographic variation
CDC28
Cdc28p
disambiguation
Cdc2
APC
still too much to read
Information extraction
formalize the facts
co-mentioning
NLPNatural Language Processing
Mitotic cyclin (Clb2)-bound Cdc28 (Cdk1 homolog) directly phosphorylated Swe1 and this modification served as a priming step to promote subsequent Cdc5-dependent Swe1
hyperphosphorylation and degradation
database
integration
STRING & STITCH
Acknowledgments
STRING & STITCH– Christian von Mering
– Michael Kuhn
– Manuel Stark
– Samuel Chaffron
– Philippe Julien
– Tobias Doerks
– Jan Korbel
– Berend Snel
– Martijn Huynen
– Peer Bork
The movie “Brazil”
Reflect– Evangelos Pafilis
– Michael Kuhn
– Heiko Horn
– Peer Bork
– Sean O’Donoghue
– Reinhardt Schneider
NLP pipeline– Jasmin Saric
– Rossitza Ouzounova
– Isabel Rojas
– Peer Bork