Building and Using Ontologies to do biology
-
Upload
robertstevens65 -
Category
Science
-
view
82 -
download
4
description
Transcript of Building and Using Ontologies to do biology
![Page 1: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/1.jpg)
Building and Using Ontologies to do biology
Robert StevensBioHealth Informatics GroupSchool of Computer Science
University of [email protected]
![Page 2: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/2.jpg)
Ernest Rutherford
“All science is either physics or stamp collecting”
Image: http://en.wikipedia.org/wiki/File:Ernest_Rutherford2.jpg
![Page 3: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/3.jpg)
The Basics
• Unless you know what you’ve got, it is not much use in analysis
• Unless you share that knowledge about what you’ve got, then other people’s data isn’t much use either
• If we can make our information more intelligent, then we can make more sophisticated analyses
![Page 4: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/4.jpg)
The Annotation Baseline
• Most uses of ontology is to supply a vocabulary for annotation
• “This is what my thing is”, “This is what my thing does”
• Get me all gene products labelled as being involved in “cell motility”
• Also gets those gene products labelled with kinds of “cell motility”
![Page 5: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/5.jpg)
What happens when we’ve annotated everything?
• Reliable querying across resources• Wherever one needs to know what something is…• Knowing how to recognise an entity is the heart of
ontology• Computational recognition of entities• Capturing some of human knowledge in computational
form• Ontological definitions are not just written in English
(though they should be this as well)• Write the definition in logical form for the computer to
use…
![Page 6: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/6.jpg)
AmiGO on “Cell motility”
![Page 7: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/7.jpg)
Associations on “Cell motility”
![Page 8: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/8.jpg)
Over expression analysis
Newrzella et al. BMC Genomics 2007 8:370 doi:10.1186/1471-2164-8-370
![Page 9: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/9.jpg)
Annotating more than gene products
• Ontologies spreading from genotype to phenotype
• Ontologies for describing the experiments that produce data and the information they produce
• All comes down to knowing what we’ve got and being able to compare
• Semantic description of bioinformatics services
![Page 10: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/10.jpg)
EMBRACE Data and Methods Ontology
(http://edamontology.sf.net)
![Page 11: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/11.jpg)
Making Deductions
• “All french people like wine”
• “Pierre is a Frenchman”
• Thus we know “Pierre likes wine”
• We’ve already seen the application of this in AmiGO
• …, but there’s a whole lot more “reasoning”
![Page 12: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/12.jpg)
The GOAL User Interface
• http://owl.cs.manchester.ac.uk/goal;• GWT application for building simple queries
against Gene Ontology Annotations (GOA);• Uses OWL API and uses HermiT reasoner;• Simply browse and “add” classes to a
conjunctive query;• Write your own DL queries (can be
dangerous…)
![Page 13: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/13.jpg)
The Gene Product Class
• Class: ‘Gene product’• SubClassOf:
• has_function some ‘molecular function’,
• participates_in some ‘biological process’,
• located_in some ‘cellular component’
![Page 14: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/14.jpg)
Two Mouse Gene Products as Primitive OWL classes
• Class: Frem1• SubClassOf:• has_function some 'sugar binding',• participates_in some ‘cell adhesion’,• located_in some ‘proteinaceous extracellular matrix’,• located_in some 'extracellular region'
• Class: Acan• SubClassOf:• has_function some 'hyaluronic acid binding',• participates_in some ‘collagen fibril organization’,• located_in some ‘proteinaceous extracellular matrix’
![Page 15: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/15.jpg)
Two Defined Classes
• Class: 'proteinaceous extracellular matrix gene product'
• EquivalentTo: • 'Gene product' and located_in some
‘proteinaceous extracellular matrix’
• Class: 'sugar binding gene product'• EquivalentTo:• 'Gene product' and has_function some 'sugar
binding'
![Page 16: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/16.jpg)
The Resulting Hierarchy
![Page 17: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/17.jpg)
Basic GOAL interface
![Page 18: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/18.jpg)
GOAL query
![Page 19: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/19.jpg)
GOAL results
![Page 20: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/20.jpg)
More complex GOAL query
![Page 21: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/21.jpg)
What does an OWL ontology look like?Class: 'Ranunculus Repens'
Annotations: [in flower] label "Ranunculus Repens", comment "Common buttercup"^^string SubClassOf: [in flower] * ActinomorphicFlower, * 'Apopetalous Flower', * 'Aposepalous Flower', * 'Heterosporangiate Flower', Flower and (hasFlowerSymmetry some 'Radial Flower Symmetry') and (hasPart some (Androecium and (hasAndroecialFusion some Apostemonous) and (hasPart some (Stamen and (hasPart some Filament) and (hasPart some (Anther and (hasAntherAttachment some 'Adnate Anther Attachment') and (hasDehiscenceType some 'Longitudinal Dehiscence'))))))) and (hasPart some (Gynoecium and (hasGynoecialFusion some Apocarpous) and (hasPart some (Pistil and (hasPart some Carpel) and (hasPart some Style) and (hasPart some (Stigma and (hasStickiness some Stickiness) and (hasStigmaShape some 'Hooked Stigma Shape'))) and (hasPart only (Carpel or Stigma or Style)))) and (hasSexualPartArrangement some 'Spiral Arrangement')))
![Page 22: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/22.jpg)
Actinomorphic Flower
Class: 'Actinomorphic flower'
Annotations:label "Actinomorphic flower"
EquivalentTo: [in flower] * 'Actinomorphic flower', Flower and (hasFlowerSymmetry some 'Radial Flower Symmetry')
![Page 23: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/23.jpg)
Who knows about biology?
• Biologists know about biology;
• Easier to turn a biologist into an ontologist, rather than a CS person to an ontologists then a biologist
• Do we need to craft axioms to deliver domain knowledge to an ontology?
![Page 24: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/24.jpg)
Can biologists write OWL?
• Yes, but why should they?
• Once we have the ontology design patterns, domain experts can fill them in
• Spreadsheets often used – tablulation of relationships between entities;
• Map contents of spreadsheets to the underlying ontology’s axioms
![Page 25: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/25.jpg)
How can we enable biologists to semantically describe data?
• Scientists like using spreadsheets (and for good reasons)
• Put the data into spreadsheets and suck it out into ontologies
• Constrain the descriptions in the spreasheet to conform to a particular description style
• Semantic Spreadsheets – RightField (http://www.rightfield.org) and Populous (http://www.populous.org)
![Page 26: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/26.jpg)
RightField Application
![Page 27: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/27.jpg)
Ontology terms for marked-up cells in drop-down boxes
The User View
![Page 28: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/28.jpg)
Kidney and Urinary Knowledge Base and Ontology
KUP KB(RDF store)
• Specialised repository of KUP related data• KUP ontology for integration, query and inference• Background knowledge for data mining experiments• Collaborative update by the community
![Page 29: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/29.jpg)
KUPKB: KUP ontology
Anatomy (MAO)Anatomy (MAO) Gene Biological processes(GO)Gene Biological processes(GO)
Cells (CTO)Cells (CTO)
part-of
participate-in
Renal proximal
tubule
Renal proximal
tubule
Proximal straight tubule
Proximal straight tubule
Proximal convoluted
tubule
Proximal convoluted
tubule
Assertion
Inference
subClassOf
Proximal tubule
epithelial cell
Proximal tubule
epithelial cell
Proximal straighttubule
epithelial cell
Proximal straighttubule
epithelial cell
Proximal convoluted
tubule epithelial cell
Proximal convoluted
tubule epithelial cell
subClassOf
part-of
Renal sodium absorption
Renal sodium absorption
Renal sodium ion absorptionRenal sodium ion absorption
participates-in
part-of
participates-in
Kidney CortexKidney Cortex
part-of
part-of
Each kidney cell is currently described by its localisation and function
![Page 30: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/30.jpg)
Populous
• Generic tool for populating ontology templates• Spreadsheet style interface• Supports validation at the point of data entry• Expressive pattern language for OWL Ontology
generation
http://www.e-lico.eu/populous
![Page 31: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/31.jpg)
KUPO cells in PopulousCell type Ontology Mouse anatomy Ontology Biological Process (GO)
![Page 32: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/32.jpg)
Connecting the graphs
GEO:028364GEO:028364
sample
Differentially expressed genes
Differentially expressed genes
observation
contains
Higgins et alHiggins et al
contributor geneid:17638geneid:17638
FaslFasl
symbol
AC18765AC18765 has:00527has:00527participates-in
Fas-ligandFas-ligand
symbol
ApoptosisApoptosis
symbol
Renal proximal
tubule
Renal proximal
tubule
Proximal straight tubule
Proximal straight tubule
Proximal convoluted
tubule
Proximal convoluted
tubule
subClassOf
Proximal tubule
epithelial cell
Proximal tubule
epithelial cell
Proximal straighttubule
epithelial cell
Proximal straighttubule
epithelial cell
Proximal convoluted
tubule epithelial cell
Proximal convoluted
tubule epithelial cell
subClassOf
part-of
Renal sodium absorption
Renal sodium absorption
Renal sodium ion absorptionRenal sodium ion absorption
participates-inpart-of
participates-in
![Page 33: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/33.jpg)
www.kupkb.org
![Page 34: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/34.jpg)
iKUP search results
![Page 35: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/35.jpg)
Doing some Biology
• With the iKUP interface biologists find things to test
• These facts already existed, but difficult to find
• Easy to find if you know to look
• Integration and presentation allows one to see things you weren’t looking for….
![Page 36: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/36.jpg)
Uses of Ontology in Bioinformatics
![Page 37: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/37.jpg)
Is all this doing biology?• Ontologies enable biology;• Ontological machinery can be used to “find new things”;• Most often through a level of indirection to a real biologist• Enabling things to be found more easily, enables hypothesis
generation to be easier• Unforeseen connections enabled through common language and
common definition• “I see these genes differentially regulated across a range of
diseases”• Can only be asked or found through common language and
common description• Can automate the application of knowledge through deduction, but
need a whole lot more…
![Page 38: Building and Using Ontologies to do biology](https://reader031.fdocuments.in/reader031/viewer/2022013011/555d04c8d8b42a08668b57ad/html5/thumbnails/38.jpg)
Acknowledgements
• Simon Jupp, Julie Klein and Joost Schanstra for KUPKB
• Carole Goble and Simon Jupp for slides
• Katy Wolstencroft, Stuart Owen, Matt Horridge for RightField
• Simon Jupp for Populous
• … and the rest of the legion