Network biology: Large-scale data and text mining
-
Upload
lars-juhl-jensen -
Category
Science
-
view
167 -
download
1
description
Transcript of Network biology: Large-scale data and text mining
![Page 1: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/1.jpg)
Network biologyLarge-scale data and text
mining
Lars Juhl Jensen
![Page 2: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/2.jpg)
association networks
![Page 3: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/3.jpg)
guilt by association
![Page 4: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/4.jpg)
![Page 5: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/5.jpg)
biological systems
![Page 6: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/6.jpg)
protein networks
![Page 7: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/7.jpg)
STRING
![Page 8: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/8.jpg)
1100+ genomes
![Page 9: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/9.jpg)
computational predictions
![Page 10: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/10.jpg)
gene fusion
![Page 11: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/11.jpg)
Korbel et al., Nature Biotechnology, 2004
![Page 12: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/12.jpg)
gene neighborhood
![Page 13: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/13.jpg)
Korbel et al., Nature Biotechnology, 2004
![Page 14: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/14.jpg)
phylogenetic profiles
![Page 15: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/15.jpg)
Korbel et al., Nature Biotechnology, 2004
![Page 16: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/16.jpg)
experimental data
![Page 17: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/17.jpg)
gene coexpression
![Page 18: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/18.jpg)
![Page 19: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/19.jpg)
protein interactions
![Page 20: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/20.jpg)
Jensen & Bork, Science, 2008
![Page 21: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/21.jpg)
curated knowledge
![Page 22: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/22.jpg)
complexes
![Page 23: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/23.jpg)
pathways
![Page 24: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/24.jpg)
Letunic & Bork, Trends in Biochemical Sciences, 2008
![Page 25: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/25.jpg)
many databases
![Page 26: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/26.jpg)
different formats
![Page 27: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/27.jpg)
different identifiers
![Page 28: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/28.jpg)
variable quality
![Page 29: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/29.jpg)
not comparable
![Page 30: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/30.jpg)
not same species
![Page 31: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/31.jpg)
hard work
![Page 32: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/32.jpg)
(Ph.D. students)
![Page 33: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/33.jpg)
common identifiers
![Page 34: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/34.jpg)
quality scores
![Page 35: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/35.jpg)
von Mering et al., Nucleic Acids Research, 2005
![Page 36: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/36.jpg)
score calibration
![Page 37: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/37.jpg)
von Mering et al., Nucleic Acids Research, 2005
![Page 38: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/38.jpg)
homology-based transfer
![Page 39: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/39.jpg)
Franceschini et al., Nucleic Acids Research, 2013
![Page 40: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/40.jpg)
missing most of the data
![Page 41: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/41.jpg)
text mining
![Page 42: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/42.jpg)
>10 km
![Page 43: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/43.jpg)
too much to read
![Page 44: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/44.jpg)
computer
![Page 45: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/45.jpg)
as smart as a dog
![Page 46: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/46.jpg)
teach it specific tricks
![Page 47: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/47.jpg)
![Page 48: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/48.jpg)
![Page 49: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/49.jpg)
named entity recognition
![Page 50: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/50.jpg)
comprehensive lexicon
![Page 51: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/51.jpg)
CDC2
![Page 52: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/52.jpg)
cyclin dependent kinase 1
![Page 53: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/53.jpg)
expansion rules
![Page 54: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/54.jpg)
hCdc2
![Page 55: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/55.jpg)
CDC2
![Page 56: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/56.jpg)
flexible matching
![Page 57: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/57.jpg)
cyclin-dependent kinase 1
![Page 58: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/58.jpg)
cyclin dependent kinase 1
![Page 59: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/59.jpg)
“black list”
![Page 60: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/60.jpg)
SDS
![Page 61: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/61.jpg)
co-mentioning
![Page 62: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/62.jpg)
counting
![Page 63: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/63.jpg)
within documents
![Page 64: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/64.jpg)
within paragraphs
![Page 65: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/65.jpg)
within sentences
![Page 66: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/66.jpg)
natural language processing
![Page 67: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/67.jpg)
Gene and protein namesCue words for entity recognitionVerbs for relation extraction
[nxexpr The expression of [nxgene the cytochrome genes [nxpg CYC1 and CYC7]]]is controlled by[nxpg HAP1]
![Page 68: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/68.jpg)
text corpus
![Page 69: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/69.jpg)
~2 million full-text articles
![Page 70: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/70.jpg)
~22 million abstracts
![Page 71: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/71.jpg)
wait there’s more
![Page 72: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/72.jpg)
general approach
![Page 73: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/73.jpg)
curated knowledge
![Page 74: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/74.jpg)
experimental data
![Page 75: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/75.jpg)
text mining
![Page 76: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/76.jpg)
computational predictions
![Page 77: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/77.jpg)
common identifiers
![Page 78: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/78.jpg)
quality scores
![Page 79: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/79.jpg)
score calibration
![Page 80: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/80.jpg)
visualization
![Page 81: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/81.jpg)
protein networks
![Page 82: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/82.jpg)
string-db.org
![Page 83: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/83.jpg)
chemical networks
![Page 84: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/84.jpg)
stitch-db.org
![Page 85: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/85.jpg)
subcellular localization
![Page 86: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/86.jpg)
compartments.jensenlab.org
![Page 87: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/87.jpg)
tissue expression
![Page 88: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/88.jpg)
tissues.jensenlab.org
![Page 89: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/89.jpg)
temporal expression
![Page 90: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/90.jpg)
de Lichtenberg, Jensen et al., Science, 2005
![Page 91: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/91.jpg)
disease associations
![Page 92: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/92.jpg)
medical informatics
![Page 93: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/93.jpg)
electronic health records
![Page 94: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/94.jpg)
drugs
![Page 95: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/95.jpg)
side effects
![Page 96: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/96.jpg)
biodiversity informatics
![Page 97: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/97.jpg)
British Heritage Library
![Page 98: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/98.jpg)
organisms
![Page 99: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/99.jpg)
environments
![Page 100: Network biology: Large-scale data and text mining](https://reader036.fdocuments.in/reader036/viewer/2022062514/558e71191a28ab5e638b46a9/html5/thumbnails/100.jpg)
AcknowledgmentsSTRING and
STITCHMichael Kuhn
Damian SzklarczykAndrea Franceschini
Milan SimonovicAlexander RothSune Pletscher-
FrankildJianyi Lin
Pablo MinguezChristian von Mering
Peer Bork
Localization and diseaseSune Pletscher-FrankildAlberto SantosJanos BinderKalliopi TsafouChristian StolteAlbert PallejaHeiko HornEvangelos PafilisReinhardt SchneiderSean O’ Donoghue