Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for...
Transcript of Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for...
![Page 1: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/1.jpg)
1 of 19
Genomes with EnsemblGenomes with Ensembl
![Page 2: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/2.jpg)
2 of 19
SubjectsSubjectsAn introduction to genome browsers.
What is an Ensembl gene?
How can we extract data from Ensembl?
![Page 3: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/3.jpg)
3 of 19
Genome BrowsersGenome Browsers
• Ensembl Genome browserhttp://www.ensembl.org
• NCBI Map Viewerhttp://www.ncbi.nlm.nih.gov/mapview/
• UCSC Genome Browserhttp://genome.ucsc.edu
![Page 4: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/4.jpg)
4 of 19
Ensembl Genome BrowserEnsembl Genome Browser
Protein/ mRNA Sequence Assembly
Ensembl Genes
![Page 5: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/5.jpg)
5 of 19
NCBI Map ViewerNCBI Map Viewer
![Page 6: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/6.jpg)
6 of 19
UCSC Genome BrowserUCSC Genome Browser
![Page 7: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/7.jpg)
7 of 19
To meet a challengeTo meet a challenge……Ensembl’s AIM: To provide annotation for the biological
community that is freely available and of high quality
• Started in 2000
• Joint project between EBI and Sanger
• Funded primarily by the Wellcome Trust, additional funding by EMBL, NIH-NIAID, EU, BBSRC and MRC
![Page 8: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/8.jpg)
8 of 19
What Distinguishes Ensembl from What Distinguishes Ensembl from the UCSC and NCBI Browsers?the UCSC and NCBI Browsers?
• The gene set. Automatic annotation based on mRNA and protein information.
• Programmatic access via the Perl API (open source)
• BioMart
• Integration with other databases (DAS)
• Comparative analysis (gene trees)
![Page 9: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/9.jpg)
9 of 19
Ensembl Genes Ensembl Genes –– biological basisbiological basis
• UniProt/Swiss-Prot (manually curated)• UniProt/TrEMBLwww.uniprot.org
• NCBI RefSeq (manually curated)www.ncbi.nlm.nih.gov/RefSeq
All Ensembl transcripts are based on proteins and mRNAs in:
![Page 10: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/10.jpg)
10 of 19
Ensembl and VEGA/HavanaEnsembl and VEGA/Havana
• Automatic annotation pipeline: Gene building all at once (whole genome) Ensembl
• Manual curation: case-by-case basisVEGA: Vertebrate Genome Annotation Havana
![Page 11: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/11.jpg)
11 of 19
Genes and Transcripts in EnsemblGenes and Transcripts in Ensembl
• Ensembl known transcripts• Ensembl novel transcripts• Ensembl merged transcripts (Havana)
• EST clusters
• More manual curation (SGD, WormBase, FlyBase)
![Page 12: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/12.jpg)
12 of 19
What annotation is available?What annotation is available?• Gene/transcript/peptide models (coding and
noncoding (ncRNAs))• IDs in other databases• Mapped cDNAs, peptides, micro array probes,
BAC clones etc.• Other features of the genome:
cytogenetic bands, markers, repeats etc.• Comparative data:
orthologues and paralogues, protein families, whole genome alignments, syntenic regions
• Variation data:SNPs
• Regulatory data:“best guess” set of regulatory elements from ENCODE
• Data from external sources (DAS)
![Page 13: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/13.jpg)
13 of 19
What species are available What species are available (v52)? Chordate focus.(v52)? Chordate focus.
![Page 14: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/14.jpg)
14 of 19
How is this information organised?How is this information organised?
• Ensembl Views (Website)
• Ensembl Database (open source)
• BioMart ‘DataMining tool’
![Page 15: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/15.jpg)
15 of 19
Names in EnsemblNames in Ensembl
• ENSG### Ensembl Gene ID• ENST### Ensembl Transcript ID• ENSP### Ensembl Peptide ID• ENSE### Ensembl Exon ID
• For other species than human a suffix is added:MUS (Mus musculus) for mouse: ENSMUSG### DAR (Danio rerio) for zebrafish: ENSDARG###, etc.
![Page 16: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/16.jpg)
16 of 19
Gene Structure in EnsemblGene Structure in Ensembl
Calmodulin Human
Calmodulin ChickenNo UTRs
UTRs annotated
![Page 17: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/17.jpg)
17 of 19
Help and InformationHelp and Information• Comments and questions?
• View our help videos
• Mailing lists:[email protected]@ebi.ac.uk
• Come visit our blog! http://ensembl.blogspot.com/
![Page 18: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/18.jpg)
18 of 19© Sean T. McHugh (http://www.cambridgeincolour.com)
Cambridge
![Page 19: Genomes with Ensembl · 7o 1f 9 To meet a challenge… Ensembl’s AIM: To provide annotation for the biological community that is freely available and of high quality • Started](https://reader035.fdocuments.in/reader035/viewer/2022071119/601975bdbe52bf479029dbca/html5/thumbnails/19.jpg)
19 of 19
Ensembl TeamEnsembl Team
Xosé M Fernández, Bert Overduin, Michael Schuster (QC), Giulietta SpudichOutreach
Ian Dunham, Damian Keefe, Alison Meynert, Dace Ruklisa, Guy Slater, Daniel ZerbinoResearch
Ewan Birney, Richard Durbin, Tim HubbardEnsembl Strategy
Nathan Johnson, Stefan Gräf, Steven WilderFunctional Genomics
Javier Herrero, Kathryn Beal, Benoît Ballester, Stephen Fitzgerald, Leo Gordon, Albert VilellaComparative Genomics
Fiona Cunningham, Yuan Chen Variation
Mario Caccamo, Laura Clark, Jonathan Hinton, Zam Iqbal, Vasudev Kumanduri, IlkkaLappalainenVertebrate Genomics
Bronwen Aken, Julio Banet, Susan Fairley, Jan-Hinnerck Vogel, Simon White, Amonida ZadissaAnalysis and Annotation
Paul Flicek (EBI), Steve Searle (Sanger Institute)Ensembl
Guy Coates, Tim Cutts, Shelley GoddardSystems & Support
Dan Lawson, Martin Hammond, Karyn MegyVectorBase
Kerstin Howe, Britt Reimholz, James TorranceZebrafish
James Smith, Eugene Bragin, Anne Parker, Bethan Pritchard, Steve Trevanion (VEGA)Web Team
Glenn Proctor, Syed Haider, Andrew Jenkinson, Andreas Kähäri, Stephen Keenan, Rhoda Kinsella, Eugene Kulesha, Ian Longden, Daniel Rios Software