Bioinformatics, Translational Bioinformatics, Personalized Medicine
Bioinformatics presentation to students University of Minho
-
Upload
introfini -
Category
Technology
-
view
464 -
download
3
Transcript of Bioinformatics presentation to students University of Minho
![Page 1: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/1.jpg)
PROTOFILWWA computational platform for the analysis of the relationships between
microorganisms and environmental parameters in activated sludge plants
José Fernandes
Bioinformatics Master Thesis
Prof. Anália Lourenço
Prof. Ana Nicolau
![Page 2: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/2.jpg)
System requirements
• Insertion and retrieval of data has to be done quickly and easily
• Should be possible to export the data so it can be analyzed with other informatics
systems
• Should support statistical assessments
• Have user-friendly visualization capabilities
• Controlled access to data, based on user roles, accounting for data privacy issues
• Easy dissemination of related studies and results
• Always online (web-based)
• Help finding additional information about the microorganisms present in the biological
samples
![Page 3: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/3.jpg)
Overview of the workflow of field and lab work
PROTOFILWWPROTOFILWW
![Page 4: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/4.jpg)
1.635 lines x 137 columns
![Page 5: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/5.jpg)
ProtoFilWW system major components
1. Content Management component: supports the
researchers managing and analyzing the data obtained
from the WWTP’s samples
2. Text Mining component: finding additional information
about the microorganisms present in the biological
samples
![Page 6: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/6.jpg)
High-level integration perspective of ProtoFilWW
Drupal core
PLUGINS
Import data
Reports Access control
Other services...
PROTOFILWW
SQL
XLS, TXT, CSV
Export dataXLS, TXT, CSV Solr/LuceneViews Solr Backend
Views
XML
Relational Database
UIMA
![Page 7: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/7.jpg)
Contend Management component
• Open source Content Management System (CMS) and
Framework (CMF)
• Highly modular and with high extensibility
• Built in the PHP scripting language
![Page 8: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/8.jpg)
WWTP Sample
1. Filamentous bacteria
2. Protozoa
3. Metazoa
4. Physical-chemical
5. Sample characterization
![Page 9: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/9.jpg)
User roles
use case visitors collaborators WWTP researchers administrators
Find studies and results x x x x
Contact researchers x x x
Analysis of available data x
Data insertion x x
Creation of reports x
Export data x
Managing users x
Backup data x
Text Mining x x x x
![Page 10: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/10.jpg)
![Page 11: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/11.jpg)
![Page 12: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/12.jpg)
![Page 13: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/13.jpg)
![Page 14: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/14.jpg)
![Page 15: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/15.jpg)
![Page 16: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/16.jpg)
![Page 17: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/17.jpg)
Dynamic reporting and charting
Reports creation Reports display
![Page 18: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/18.jpg)
Geolocation of the WWTPs
Address geocoding Map display
![Page 19: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/19.jpg)
Text Mining
componentListing the species
mentioned in a
document
![Page 20: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/20.jpg)
Major Text Mining technologies used
• Lucene is a high-performance text search engine
library.
• Solr is a standalone enterprise search server with a
REST-like API
• UIMA is a powerful infrastructure for the storage,
transport, and retrieval of document and annotation
knowledge accumulated in NLP pipeline systems
• LINNAEUS is a popular organism name identification
system for biomedical literature that is capable of
normalizing to unambiguous NCBI taxonomy identifiers
![Page 21: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/21.jpg)
Text Mining process in ProtoFilWW
Solr/Lucene
LINNAEUS
Solr UIMA
PMC Open Access SubsetPMC Open Access Subset Solr XML documentsSolr XML documents
XPath convertion
![Page 22: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/22.jpg)
Solr LINNAEUS Annotator
UIMA Component Descriptor Editor plugin
UIMA type system for LINNAEUS
![Page 23: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/23.jpg)
LINNAEUS UIMA wrapper running on CVD
![Page 24: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/24.jpg)
Drupal Views Solr Backend
![Page 25: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/25.jpg)
Major contributions
1. The Web-based computational system
www.protofilww.org
2. The Drupal module Views Solr Backend
3. The Solr UIMA plug-in for LINNAEUS Annotator
![Page 29: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/29.jpg)
![Page 30: Bioinformatics presentation to students University of Minho](https://reader033.fdocuments.in/reader033/viewer/2022052900/555e9756d8b42a0d738b522e/html5/thumbnails/30.jpg)
Preventive Medicine
Alert the user to the risk of Type 2 Diabetes.
How?
1. We know the user has a gene mutation associated with Type 2
Diabetes, because he gave us is genome!
2. We know what he has eaten, because he told us!
3. We know what exercise he’s been doing, because he told us!
4. Genehome connects the dots!