Literature mining: what is it, and should I care?

60
Literature mining

description

EMBL Lab Day, European Molecular Biology Laboratory, Heidelberg, Germany, June 10, 2008

Transcript of Literature mining: what is it, and should I care?

Page 1: Literature mining: what is it, and should I care?

Literature mining

Page 2: Literature mining: what is it, and should I care?

Explosion

Page 3: Literature mining: what is it, and should I care?

exponential increase

Page 4: Literature mining: what is it, and should I care?
Page 5: Literature mining: what is it, and should I care?
Page 6: Literature mining: what is it, and should I care?

some things never change

Page 7: Literature mining: what is it, and should I care?
Page 8: Literature mining: what is it, and should I care?

“graph calculus”

Page 9: Literature mining: what is it, and should I care?

=

Page 10: Literature mining: what is it, and should I care?

~50 seconds per paper

Page 11: Literature mining: what is it, and should I care?

Information retrieval

Page 12: Literature mining: what is it, and should I care?

find the relevant papers

Page 13: Literature mining: what is it, and should I care?

ad hoc retrieval

Page 14: Literature mining: what is it, and should I care?

user-specified query

Page 15: Literature mining: what is it, and should I care?

“yeast AND cell cycle”

Page 16: Literature mining: what is it, and should I care?

stemming

Page 17: Literature mining: what is it, and should I care?

yeast / yeasts

Page 18: Literature mining: what is it, and should I care?

dynamic query expansion

Page 19: Literature mining: what is it, and should I care?

yeast / S. cerevisiae

Page 20: Literature mining: what is it, and should I care?

ranking

Page 21: Literature mining: what is it, and should I care?
Page 22: Literature mining: what is it, and should I care?
Page 23: Literature mining: what is it, and should I care?
Page 24: Literature mining: what is it, and should I care?
Page 25: Literature mining: what is it, and should I care?
Page 26: Literature mining: what is it, and should I care?
Page 27: Literature mining: what is it, and should I care?
Page 28: Literature mining: what is it, and should I care?
Page 29: Literature mining: what is it, and should I care?

Mitotic cyclin (Clb2)-bound Cdc28 (Cdk1 homolog) directly phosphorylated Swe1 and this modification served as a priming step to promote subsequent Cdc5-dependent Swe1

hyperphosphorylation and degradation

Page 30: Literature mining: what is it, and should I care?

no tool will find it

Page 31: Literature mining: what is it, and should I care?

Entity recognition

Page 32: Literature mining: what is it, and should I care?

identify the substance(s)

Page 33: Literature mining: what is it, and should I care?

Mitotic cyclin (Clb2)-bound Cdc28 (Cdk1 homolog) directly phosphorylated Swe1 and this modification served as a priming step to promote subsequent Cdc5-dependent Swe1

hyperphosphorylation and degradation

Page 34: Literature mining: what is it, and should I care?

good synonyms list

Page 35: Literature mining: what is it, and should I care?

orthographic variation

Page 36: Literature mining: what is it, and should I care?

CDC28

Page 37: Literature mining: what is it, and should I care?

Cdc28p

Page 38: Literature mining: what is it, and should I care?

disambiguation

Page 39: Literature mining: what is it, and should I care?

Cdc2

Page 40: Literature mining: what is it, and should I care?

APC

Page 41: Literature mining: what is it, and should I care?
Page 42: Literature mining: what is it, and should I care?
Page 43: Literature mining: what is it, and should I care?
Page 44: Literature mining: what is it, and should I care?
Page 45: Literature mining: what is it, and should I care?

still too much to read

Page 46: Literature mining: what is it, and should I care?

Information extraction

Page 47: Literature mining: what is it, and should I care?

formalize the facts

Page 48: Literature mining: what is it, and should I care?

co-mentioning

Page 49: Literature mining: what is it, and should I care?

NLPNatural Language Processing

Page 50: Literature mining: what is it, and should I care?

Mitotic cyclin (Clb2)-bound Cdc28 (Cdk1 homolog) directly phosphorylated Swe1 and this modification served as a priming step to promote subsequent Cdc5-dependent Swe1

hyperphosphorylation and degradation

Page 51: Literature mining: what is it, and should I care?

database

Page 52: Literature mining: what is it, and should I care?
Page 53: Literature mining: what is it, and should I care?

integration

Page 54: Literature mining: what is it, and should I care?
Page 55: Literature mining: what is it, and should I care?
Page 56: Literature mining: what is it, and should I care?
Page 57: Literature mining: what is it, and should I care?
Page 58: Literature mining: what is it, and should I care?

STRING & STITCH

Page 59: Literature mining: what is it, and should I care?
Page 60: Literature mining: what is it, and should I care?

Acknowledgments

STRING & STITCH– Christian von Mering

– Michael Kuhn

– Manuel Stark

– Samuel Chaffron

– Philippe Julien

– Tobias Doerks

– Jan Korbel

– Berend Snel

– Martijn Huynen

– Peer Bork

The movie “Brazil”

Reflect– Evangelos Pafilis

– Michael Kuhn

– Heiko Horn

– Peer Bork

– Sean O’Donoghue

– Reinhardt Schneider

NLP pipeline– Jasmin Saric

– Rossitza Ouzounova

– Isabel Rojas

– Peer Bork