GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.
-
Upload
gwendolyn-white -
Category
Documents
-
view
216 -
download
1
Transcript of GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.
![Page 1: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/1.jpg)
GO: The Gene Ontology
Pascale Gaudet
dictyBase curator
Northwestern University,
Chicago, IL
![Page 2: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/2.jpg)
Outline
1. Introduction to the Gene Ontology
2. Gene Ontology annotations
3. Editing the Gene Ontology
4. Practical applications for the Gene Ontology
5. The Gene Ontology as one of many biological ontologies
![Page 3: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/3.jpg)
Sequence databases: GenBank, EMBL, DDBJ
Year 1982 2005
Number of records
602 44, 202,133
![Page 4: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/4.jpg)
Genome Databases* Mouse Genome Informatics * FlyBase: Drosophila* WormBase: C. elegans * The Arabidopsis Information Resource * dictyBase: Dictyostelium discoideum * Saccharomyces Genome Database: Budding
Yeast * ZFIN: Zebrafish* EcoGene - E. coli• GeneCards • Human ensembl• NCBI human genome resources
* manually curated by scientists
![Page 5: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/5.jpg)
![Page 6: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/6.jpg)
Published Literature
• PubMed: over 15 million citations • Basic search:
rad51 → 1038 articles
• Limit search:rad51, Human (organism) → 485
• Boolean operators:rad51 AND cancer → 234 articles
![Page 7: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/7.jpg)
Gene Ontology
- Gene annotation system
- Controlled vocabulary that can be applied to all organisms
- Used to describe gene products
![Page 8: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/8.jpg)
What’s in a name?
• What is a cell?
![Page 9: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/9.jpg)
Cell
![Page 10: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/10.jpg)
Cell
![Page 11: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/11.jpg)
Cell
![Page 12: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/12.jpg)
Cell
![Page 13: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/13.jpg)
Cell
Image from http://microscopy.fsu.edu
![Page 14: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/14.jpg)
What’s in a name?
• The same name can be used to describe different concepts
![Page 15: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/15.jpg)
What’s in a name?
![Page 16: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/16.jpg)
What’s in a name?
• Glucose synthesis• Glucose biosynthesis• Glucose formation• Glucose anabolism• Gluconeogenesis
• All refer to the process of making glucose from simpler components
![Page 17: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/17.jpg)
What’s in a name?
• The same name can be used to describe different concepts
• A concept can be described using different names
Comparison is difficult – in particular across species or across databases
![Page 18: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/18.jpg)
What is the Gene Ontology?
A (part of the) solution:
- A controlled vocabulary that can be applied to all organisms
- Used to describe gene products - proteins and RNA - in any organism
![Page 19: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/19.jpg)
Ontology
• In philosophy, the most fundamental branch of metaphysics. It studies being or existence as well as the basic categories thereof—trying to find out what entities and what types of entities exist. – Wikipedia
• Ontologies provide controlled, consistent vocabularies to describe concepts and relationships, thereby enabling knowledge sharing – Gruber 1993
![Page 20: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/20.jpg)
Ontology
Includes:
1. A vocabulary of terms (names for concepts)
2. Definitions
3. Defined logical relationships to each other
![Page 21: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/21.jpg)
Ontologies can be represented as graphs, where the nodes are connected by edges
• Nodes = concepts in the ontology
• Edges = relationships between the concepts
node
nodenode
edge
Ontology Structure
![Page 22: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/22.jpg)
Ontology Structure
• The Gene Ontology is structured as a hierarchical directed acyclic graph (DAG)
• Terms can have more than one parent and zero, one or more children
• Terms are linked by two relationships– is-a– part-of
![Page 23: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/23.jpg)
Simple hierarchies (Trees) Directed Acyclic Graphs
Single parent One or more parents
![Page 24: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/24.jpg)
Directed Acyclic Graphs (DAG)
is-apart-of
[other protein complexes]
[other organelles]
protein complex organelle
mitochondrion
fatty acid beta-oxidation multienzyme complex
![Page 25: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/25.jpg)
True Path Rule
• The path from a child term all the way up to its top-level parent(s) must always be true
cell cytoplasm
chromosome nuclear chromosome
nucleus nuclear chromosome
is-a
part-of
![Page 26: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/26.jpg)
How does GO work?
• What does the gene product do?
• Why does it perform these activities?
• Where does it act?
What information might we want to capture about a gene product?
![Page 27: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/27.jpg)
GO: Three ontologies
Where does it act?
What processes is it involved in?
What does it do? Molecular Function
Cellular Component
Biological Process
gene product
![Page 28: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/28.jpg)
Cellular Component
• where a gene product acts
![Page 29: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/29.jpg)
Mitochondrial membrane
![Page 30: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/30.jpg)
Biological Process
![Page 31: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/31.jpg)
Gluconeogenesis
![Page 32: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/32.jpg)
Molecular Function
• A single reaction or activity, not a gene product
• A gene product may have several functions
• Sets of functions make up a biological process
![Page 33: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/33.jpg)
Molecular Function
![Page 34: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/34.jpg)
Carbonate dehydratase activity
![Page 35: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/35.jpg)
term: gluconeogenesis
id: GO:0006094
definition: The formation of glucose from noncarbohydrate precursors, such as pyruvate, amino acids and glycerol.
What’s in a GO term?
![Page 36: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/36.jpg)
![Page 37: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/37.jpg)
Molecular Function 7,309 terms Biological Process 10,041 terms Cellular Component 1,629 terms
Total 18, 975 terms
Definitions: 94.9 %Obsolete terms: 992
Content of GO
As of October 2005
![Page 38: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/38.jpg)
Outline
1. Introduction to the Gene Ontology
2. Gene Ontology annotations
3. Editing the Gene Ontology
4. Practical applications for the Gene Ontology
5. The Gene Ontology as one of many biological ontologies
![Page 39: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/39.jpg)
Mitochondrial P450
Annotation of gene products with GO terms
![Page 40: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/40.jpg)
Cellular component: mitochondrial inner membrane GO:0005743
Biological process:Electron transportGO:0006118
Molecular function: monooxygenase activity GO:0004497
substrate + O2 = CO2 +H20 product
![Page 41: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/41.jpg)
Other gene products annotated to monooxygenase activity (GO:0004497)
- monooxygenase, DBH-like 1 (mouse)- prostaglandin I2 (prostacyclin) synthase (mouse)- flavin-containing monooxygenase (yeast) - ferulate-5-hydrolase 1 (arabidopsis)
![Page 42: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/42.jpg)
Two types of GO Annotations:
Electronic Annotation
Manual Annotation
All annotations must:
• be attributed to a source
• indicate what evidence was found to support the GO term-gene/protein association
![Page 43: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/43.jpg)
Manual Annotations
• High–quality, specific gene/gene product associations made, using:
• Peer-reviewed papers
• Evidence codes to grade evidence
BUT – is very time consuming and requires trained biologists
![Page 44: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/44.jpg)
Electronic Annotations
• Provides large-coverage
• High-quality
BUT – annotations tend to use high-level GO terms and provide little detail.
![Page 45: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/45.jpg)
1. Database entries
• Manual mapping of GO terms to concepts external to GO (‘translation tables’)
• Proteins then electronically annotated with the relevant GO term(s)
2. Automatic sequence similarity analyses to transfer annotations between highly similar gene products
Electronic Annotations: Methods
![Page 46: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/46.jpg)
Fatty acid biosynthesis (Swiss-Prot Keyword)
EC:6.4.1.2 (EC number)
IPR000438: Acetyl-CoA carboxylase carboxyl transferase beta subunit (InterPro entry)
GO:Fatty acid biosynthesis
(GO:0006633)
GO:acetyl-CoA carboxylase activity
(GO:0003989)
GO:acetyl-CoA carboxylaseactivity
(GO:0003989)
Electronic Annotations
![Page 47: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/47.jpg)
Mappings of external concepts to GO
EC:1.1.1.1 > GO:alcohol dehydrogenase activity ; GO:0004022EC:1.1.1.10 > GO:L-xylulose reductase activity ; GO:0050038EC:1.1.1.104 > GO:4-oxoproline reductase activity ; GO:0016617EC:1.1.1.105 > GO:retinol dehydrogenase activity ; GO:0004745
![Page 48: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/48.jpg)
1. Extract information from published literature
2. Curators performs manual sequence similarity analyses to transfer annotations between highly similar gene products (BLAST, protein domain analysis)
Manual Annotations: Methods
![Page 49: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/49.jpg)
Finding GO terms
In this study, we report the isolation and molecular characterization of the B. napus PERK1 cDNA, that is predicted to encode a novel receptor-like kinase. We have shown that like other plant RLKs, the kinase domain of PERK1 has serine/threonine kinase activity, In addition, the location of a PERK1-GFP fusion protein to the plasma membrane supports the prediction that PERK1 is an integral membrane protein…these kinases have been implicated in early stages of wound response…
Process: response to wounding GO:0009611
serine/threonine kinase activity,
Function: protein serine/threonine kinase activity GO:0004674
integral membrane protein
Component: integral to plasma membrane GO:0005887
…for B. napus PERK1 protein (Q9ARH1)
PubMed ID: 12374299
wound response
![Page 50: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/50.jpg)
• A gene product can have several functions, cellular locations and be involved in many processes
• Annotation of a gene product to one ontology is independent from its annotation to other ontologies
• Annotations are only to terms reflecting a normal activity or location
• Usage of ‘unknown’ GO terms
Additional points
![Page 51: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/51.jpg)
Unknown v.s. Unannotated• “Unknown” is used when the curator has
determined that there is no existing literature to support an annotation.– Biological process unknown GO:0000004– Molecular function unknown GO:0005554– Cellular component unknown GO:0008372
• NOT the same as having no annotation at all – No annotation means that no one has looked yet
![Page 52: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/52.jpg)
GO Evidence Codes
Code Definition
IEA Inferred from Electronic Annotation
NAS Non-traceable Author Statement
TAS Traceable Author Statement
ND No Data Use with annotation to unknown
IDA Inferred from Direct Assay
*IPI Inferred from Physical Interaction
*IGI Inferred from Genetic Interaction
IMP Inferred from Mutant Phenotype
IEP Inferred from Expression Pattern
*IC Inferred from Curator
*ISS Inferred from Sequence Similarity
Manuallyannotated
![Page 53: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/53.jpg)
GO Evidence Codes
*With column required
Manuallyannotated
Code Definition
*IEA Inferred from Electronic Annotation
IDA Inferred from Direct Assay
IEP Inferred from Expression Pattern
*IGI Inferred from Genetic Interaction
IMP Inferred from Mutant Phenotype
*IPI Inferred from Physical Interaction
*ISS Inferred from Sequence Similarity
TAS Traceable Author Statement
NAS Non-traceable Author Statement
*IC Inferred from Curator
RCA Inferred from Reviewed Computational Analysis
ND No Data
IDA:
•Enzyme assays
•In vitro reconstitution (transcription)
•Immunofluorescence
•Cell fractionation
TAS:
•In the literature source the original experiments referred to are traceable (referenced).
![Page 54: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/54.jpg)
GO Evidence Codes: with/from
*With column required
Manuallyannotated
Additional information required for certain evidence codes
Code Definition
*IEA Inferred from Electronic Annotation
IDA Inferred from Direct Assay
IEP Inferred from Expression Pattern
*IGI Inferred from Genetic Interaction
IMP Inferred from Mutant Phenotype
*IPI Inferred from Physical Interaction
*ISS Inferred from Sequence Similarity
TAS Traceable Author Statement
NAS Non-traceable Author Statement
*IC Inferred from Curator
RCA Inferred from Reviewed Computational Analysis
ND No Data
IGI:
• a gene identifier for the "other" gene involved in the interaction
IPI:
• a gene or protein identifier for the "other" protein involved in the interaction
IC:
• GO term from another annotation used as the basis of a curator inference
![Page 55: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/55.jpg)
TAS/IDA
IMP/IGI/IPI
ISS/IEP
NAS
IEA
Term Hierarchy
![Page 56: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/56.jpg)
1. NOT• a gene product is NOT associated with the GO term • to document conflicting claims in the literature.
2. Contributes to• distinguishes between individual subunit functions and
whole complex functions• used with GO Function Ontology
3. Colocalizes with• transiently or peripherally associated with an organelle
or complex • used with GO Component Ontology
Modifying the interpretation of an annotation: the Qualifier column
![Page 57: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/57.jpg)
Annotation of a genome
• GO annotations are always work in progress
• Part of normal curation process
– More specific information
– Better evidence code
• Replace obsolete terms
• “Last reviewed” date
![Page 58: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/58.jpg)
How to access the Gene ontology and its annotations
1. Downloads • Ontologies
• Annotations : Gene association files
• Ontologies and Annotations
2. Web-based access • AmiGO (http://www.godatabase.org)
• QuickGO
(http://www.ebi.ac.uk/ego)
among others…
![Page 59: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/59.jpg)
GO ontology (gene_ontology.obo)
format-version: 1.0 date: 20:10:2005 17:32 saved-by: jlomax auto-generated-by: DAG-Edit 1.419 rev 3 default-namespace: gene_ontology remark: cvs version: $Revision: 3.1176 $
[Term] id: GO:0000001 name: mitochondrion inheritance namespace: biological_process def: "The distribution of mitochondria\, including the mitochondrial genome\, into daughter cells after mitosis or meiosis\, mediated by interactions between mitochondria and the cytoskeleton." [PMID:10873824, PMID:11389764, SGD:mcc] is_a: GO:0048308 ! organelle inheritance is_a: GO:0048311 ! mitochondrion distribution
[Term] id: GO:0000002 name: mitochondrial genome maintenance namespace: biological_process def: "The maintenance of the structure and integrity of the mitochondrial genome." [GO:ai] is_a: GO:0007005 ! mitochondrion organization and biogenesis
[Term] id: GO:0000003 name: reproduction alt_id: GO:0019952 namespace: biological_process def: "The production by an organism of new individuals that contain some portion of their genetic material inherited from that organism." [GO:curators, ISBN:0198506732] subset: goslim_generic subset: goslim_plant subset: gosubset_prok is_a: GO:0008150 ! biological_process
[Term] id: GO:0000004 name: biological process unknown namespace: biological_process def: "Used for the annotation of gene products whose process is not known or cannot be inferred." [SGD:curators] subset: goslim_generic subset: goslim_goa subset: goslim_plant subset: goslim_yeast subset: gosubset_prok is_a: GO:0008150 ! biological_process
![Page 60: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/60.jpg)
Viewing GO terms (DAG-Edit)
![Page 61: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/61.jpg)
http://www.geneontology.org/GO.current.annotations.shtml
Gene Association Files
![Page 62: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/62.jpg)
Anatomy of a gene association fileColumn Content Example
1 DB SGD, MGI
2 DB_Object ID MGI:1234568
3 DB_Object_Symbol Gras3
4 GO_ID Qualifier NOT, co_localizes_with, contributes_to
5 GO_ID GO:0001515
6 DB_Ref PMID:234567
7 Evidence_Code IDA, etc.
8 With/From
9 GO_aspect P (process), C (component) F (function)
10 DB_Object_Name Grasshopper 3 homlog
11 DB_Object_Synonym Locust III, 0122345E12Rik
12 DB_Object_Type Gene, transcript, or protein
13 Taxon taxon:4932
14 Date 20050101
15 Assigned_by DB (usually same as column 1)
![Page 63: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/63.jpg)
![Page 64: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/64.jpg)
Viewing Annotations
• Amigo Browser: http://www.godatabase.org– A GO browser that tracks contributed
GO annotations across species.– Uses annotation sets supplied in a
specific format.
![Page 65: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/65.jpg)
AmiGO: http://www.godatabase.org
![Page 66: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/66.jpg)
![Page 67: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/67.jpg)
![Page 68: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/68.jpg)
Symbol Information Source Evidence ReferenceAnxa6 annexin A6, RGD TAS RGD:724802 gene from Rattus norvegicus
![Page 69: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/69.jpg)
![Page 70: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/70.jpg)
![Page 71: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/71.jpg)
Filter queries by organism, data source or evidence
Search for GO terms or by Gene symbol/name
Querying the GO
![Page 72: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/72.jpg)
Querying the GO
![Page 73: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/73.jpg)
Querying the GO
![Page 74: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/74.jpg)
![Page 75: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/75.jpg)
http://www.ncbi.nlm.nih.gov/entrez
![Page 76: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/76.jpg)
www.uniprot.org/
![Page 77: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/77.jpg)
www.ensembl.org/
![Page 78: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/78.jpg)
dictyBase Gene Page
![Page 79: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/79.jpg)
Outline
1. Introduction to the Gene Ontology
2. Gene Ontology annotations
3. Editing the Gene Ontology
4. Practical applications for the Gene Ontology
5. The Gene Ontology as one of many biological ontologies
![Page 80: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/80.jpg)
How is GO maintained?
• Several full-time editors• Requests from community
– database curators, researchers, software developers
– SourceForge tracker
• GO Consortium meetings for large changes
• Mailing lists
![Page 81: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/81.jpg)
Reactome
![Page 82: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/82.jpg)
• Terms become obsolete when they are removed or redefined
• GO IDs are never deleted
• For each term, a comment is added to explains why the term is now obsolete
Ensuring Stability in a Dynamic Ontology
Obsolete Cellular ComponentObsolete Molecular FunctionObsolete Biological Process
Biological ProcessMolecular FunctionCellular Component
![Page 83: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/83.jpg)
Why modify the GO
• GO reflects current knowledge of biology
• New organisms being added makes existing terms arrangements incorrect
• Not everything perfect from the outset
![Page 84: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/84.jpg)
Example - parasites
• Original GO:
![Page 85: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/85.jpg)
Example - parasites
• Annotation of P. falciparum– protozoan cellular parasite– intracellular infection (erythrocytes)
• Parasite proteins located in host nucleus
• What cellular component term to annotate to?– ‘nucleus’ refers to parasite nucleus when
annotating parasite
![Page 86: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/86.jpg)
Example - parasites
• Added new term ‘host’:
![Page 87: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/87.jpg)
Example - parasites
parasite gene products located in host nucleus annotated here
parasite gene products located in parasite nucleus annotated here
![Page 88: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/88.jpg)
Requesting changes to GO - curator requests tracker
• Common changes suggested:– new term requests
– reporting errors (typos, etc)
– obsoletion/merge requests
– add synonym
– queries
– term move (change parents)
![Page 89: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/89.jpg)
The GO editorial office
• Primary responsibility to edit ontologies in response to community needs
• Also:– website– documentation– outreach
• GO in other systems• new annotation groups
– training
![Page 90: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/90.jpg)
Outline
1. Introduction to the Gene Ontology
2. Gene Ontology Annotations
3. Editing the Gene Ontology
4. Practical applications for the Gene Ontology
5. The Gene Ontology as one of many biological ontologies
![Page 91: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/91.jpg)
• Access gene product functional information
• Find how much of a proteome is involved in a process/ function/ component in the cell
• Map GO terms and incorporate manual annotations into own databases
• Provide a link between biological knowledge and …
• gene expression profiles
• proteomics data
What can scientists do with GO?
![Page 92: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/92.jpg)
Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...
Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)
attacked
time
control
Puparial adhesionMolting cyclehemocyanin
Defense responseImmune responseResponse to stimulusToll regulated genesJAK-STAT regulated genes
Immune responseToll regulated genes
Amino acid catabolismLipid metobolism
Peptidase activityProtein catabloismImmune response
Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...
Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)
Bregje Wertheim at the Centre for Evolutionary Genomics, Department of Biology, UCL and Eugene Schuster Group, EBI.
…analysis of high-throughput data according to GOMicroArray data analysis
![Page 93: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/93.jpg)
Color indicates up/down regulation
GoMiner Tool, John Weinstein et al, Genome Biol. 4 (R28) 2003
![Page 94: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/94.jpg)
http://www.geneontology.org/GO.tools
![Page 95: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/95.jpg)
Outline
1. Introduction to the Gene Ontology
2. Gene Ontology Annotations
3. Editing the Gene Ontology
4. Practical applications for the Gene Ontology
5. The Gene Ontology as one of many biological ontologies
![Page 96: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/96.jpg)
Beyond GO – Open Biomedical Ontologies• Orthogonal to existing ontologies to facilitate combinatorial approaches
- Share unique identifier space- Include definitions
• Anatomies• Cell Types• Sequence Attributes• Temporal Attributes• Phenotypes• Diseases• More….
http://obo.sourceforge.net
![Page 97: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/97.jpg)
Sequence Ontology
http://song.sourceforge.net
![Page 98: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/98.jpg)
• Ontology of ‘small molecular entities’
http://www.ebi.ac.uk/chebi
![Page 99: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/99.jpg)
http://www.fruitfly.org/cgi-bin/ex/go.cgi
![Page 100: GO: The Gene Ontology Pascale Gaudet dictyBase curator Northwestern University, Chicago, IL.](https://reader030.fdocuments.in/reader030/viewer/2022032805/56649ee95503460f94bfa5d1/html5/thumbnails/100.jpg)
Anatomy
Physiology
Phenotype
Pathway
Disease
Molecular
MetabolicDevelopmental
Stage
Ontologies