Introduction to text mining
-
Upload
lars-juhl-jensen -
Category
Documents
-
view
1.545 -
download
3
description
Transcript of Introduction to text mining
Introduction to text mining
Lars Juhl Jensen
>10 km
exponential growth
~45 seconds per paper
text mining
information retrieval
find the relevant papers
user-specified query
“yeast AND cell cycle”
entity recognition
identify the concepts
comprehensive lexicon
orthographic variation
“black list”
Reflect
augmented browsing
Pafilis, O’Donoghue, Jensen et al., Nature Biotechnology, 2009
used by publishers
information extraction
formalize the facts
co-mentioning
NLPNatural Language Processing
Gene and protein names
Cue words for entity recognition
Verbs for relation extraction
[nxexpr The expression of [nxgene the cytochrome genes [nxpg CYC1 and CYC7]]]is controlled by[nxpg HAP1]
molecular networks
information on side effects
Campillos & Kuhn et al., Science, 2008
Acknowledgments
Sean O’Donoghue
Sune Frankild
Heiko Horn
Evangelos Pafilis
Michael Kuhn
Reinhardt Schneider