Introduction to text mining

30
Introduction to text mining Lars Juhl Jensen >10 km

description

 

Transcript of Introduction to text mining

Page 1: Introduction to text mining

Introduction to text mining

Lars Juhl Jensen

>10 km

Page 2: Introduction to text mining

exponential growth

Page 3: Introduction to text mining
Page 4: Introduction to text mining

~45 seconds per paper

Page 5: Introduction to text mining

text mining

Page 6: Introduction to text mining

information retrieval

Page 7: Introduction to text mining

find the relevant papers

Page 8: Introduction to text mining

user-specified query

Page 9: Introduction to text mining

“yeast AND cell cycle”

Page 10: Introduction to text mining
Page 11: Introduction to text mining

entity recognition

Page 12: Introduction to text mining

identify the concepts

Page 13: Introduction to text mining

comprehensive lexicon

Page 14: Introduction to text mining

orthographic variation

Page 15: Introduction to text mining

“black list”

Page 16: Introduction to text mining

Reflect

Page 17: Introduction to text mining

augmented browsing

Page 18: Introduction to text mining

Pafilis, O’Donoghue, Jensen et al., Nature Biotechnology, 2009

Page 19: Introduction to text mining

used by publishers

Page 20: Introduction to text mining
Page 21: Introduction to text mining

information extraction

Page 22: Introduction to text mining

formalize the facts

Page 23: Introduction to text mining

co-mentioning

Page 24: Introduction to text mining

NLPNatural Language Processing

Page 25: Introduction to text mining

Gene and protein names

Cue words for entity recognition

Verbs for relation extraction

[nxexpr The expression of [nxgene the cytochrome genes [nxpg CYC1 and CYC7]]]is controlled by[nxpg HAP1]

Page 26: Introduction to text mining

molecular networks

Page 27: Introduction to text mining
Page 28: Introduction to text mining

information on side effects

Page 29: Introduction to text mining

Campillos & Kuhn et al., Science, 2008

Page 30: Introduction to text mining

Acknowledgments

Sean O’Donoghue

Sune Frankild

Heiko Horn

Evangelos Pafilis

Michael Kuhn

Reinhardt Schneider