A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight...
-
Upload
cuthbert-long -
Category
Documents
-
view
213 -
download
0
Transcript of A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight...
![Page 1: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/1.jpg)
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
Ali Hasnain et. al
Insight Center for Data Analytics National University of Ireland, Galway
![Page 2: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/2.jpg)
Agenda
• Motivation• Linked Life Sciences Roadmap• Cataloguing and Linking• Extending Catalogue – Metadata &
Provenance• Query Engine• Results
![Page 3: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/3.jpg)
Motivation
• Biomedical Data is heterogeneous and spread across multiple sources (SPARQL endpoints).
• Navigation is a challenge.
• Containing trillions of triples and represented with insufficient vocabulary reuse.
• Biologists sometimes want to get more information regarding the data including its source, creator, publisher and also statistics with respect to its size (Metadata & Provenance).
3
![Page 4: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/4.jpg)
How to deal heterogeneous data?
DrugBank
DailyMed
CheBI, KEGG
Reactome
Sider
BioPax
Medicare
![Page 5: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/5.jpg)
We want to query the content, not the source
Proteins
Molecules
Genes
Diseases
![Page 6: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/6.jpg)
A Linked Life Sciences Roadmap
Proteins
Molecules
Genes
Diseases
:Protein:Molecule
:Gene
:Disease
Uniprot
PDB
Pfam PROSITE
ProDom
UnirefUniPark DailymedDrug
Bank ChemBL
PubChem KEGG
Gene Ontology
GeneID
Affymetrix
Homogene
MGI
Diseasome
SIDER
![Page 7: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/7.jpg)
2- Possible Solutions
• To assemble queries over multiple graphs at multiple endpoints, either:
• vocabularies and ontologies are reused, Or • translation maps between different terminologies
are created (“a posteriori integration”)
![Page 8: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/8.jpg)
a-priori v.s a-posteriori Integration
8
![Page 9: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/9.jpg)
Cataloguing and Linking
9
![Page 10: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/10.jpg)
Describing DataSets- an Extract from Catalogue
![Page 11: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/11.jpg)
Extending Catalogue – Metadata & Provenance
![Page 12: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/12.jpg)
![Page 13: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/13.jpg)
![Page 14: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/14.jpg)
Query Engine
http://srvgal86.deri.ie:8000/graph/Granatum
![Page 15: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/15.jpg)
Visual & Graphical View
![Page 16: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/16.jpg)
SPARQL Endpoints returning results per query
![Page 17: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/17.jpg)
Runtimes taken by different queries (Max, Min, Average, Median)
![Page 18: A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,](https://reader035.fdocuments.in/reader035/viewer/2022081519/56649f465503460f94c68bb6/html5/thumbnails/18.jpg)