Page 1 Center for Biological Sequence Analysis The Technical University of Denmark DTU Comparison of...
-
Upload
victor-kennedy -
Category
Documents
-
view
215 -
download
1
Transcript of Page 1 Center for Biological Sequence Analysis The Technical University of Denmark DTU Comparison of...
Page 1
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Comparison of sequenced Bacterial Genomes
Comparative Microbial Genomics Group
Dave Ussery
Advanced Bioinformatics course lecture
21 November, 2003
overview
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
OverviewLast week: DNA structures in Bacterial Genomes
Today: How do you cp. 150 genomes?
1. Introduction & a look at 4 Pseudomonas genomes
2. 5 epsilon Proteobacteria (Campylobacter, Helicobacter, and Related Organisms)
3. Comparison of 5 Clostridium genomes
title
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Part 1: Introduction
And A look at 4 Pseudomonas genomes
OVERVIEW
1. Introduction to Bacterial Genome cp.
2. An example of “Current methods” (Bordetella)
3. Comparison of Four Pseudomonas Species
[CBS tools for bacterial genome comparison]
overview
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
3 Bordetella genomes
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Introduction Example: 3 Bordetella genomes
Parkhill et al., Nature Genetics, 35:32-40, (2003).
Wheel plots
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Genome alignment
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
circles
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Comparing Genomes
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Comparing genomes:
2. Atlas plots for visualisation of the whole chromosome
• Genome Alignment (sometimes useful)
1. Table for comparisons
3. Proteome comparisons
Cp 100 genomes
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Comparative Microbial Genomics Group
How do you compare more than 100 bacterial genomes?
Organism perAT lengthPseudomonas aeruginosa, strain PAO1 33.4 6,264,403Pseudomonas fluorescens SBW25 40.0 6,703,654Pseudomonas putida KT2440 38.5 6,181,863P. syringae pv. tomato strain DC3000 41.0 6,397,126Average of 150 genomes 52.8 3,004,048
http://www.cbs.dtu.dk/services/GenomeAtlas/Bacteria
Comparison tool #1 - The table
Repeats table
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Organism Global Direct Global InvertedLocal Direct Local InvertedP. aeruginosa PAO1 3.6 2.5 12.1 7.7P. fluorescens SBW25 3.8 3.2 6.5 4.5P. putida KT2440 4.7 3.8 7.1 5P. syringae DC3000 6.9 6.2 5.2 3.8Average of 150 genomes 4.1 2.9 8.3 6.1
repeats
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Purine stretches
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Genome Letters
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Di and Tetranucleotides
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
GL tables 1 and 2
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
REP REPREPREP
Repetative sequences
REP logos
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
GL Figure 10
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
GL Table 3
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Know your insects
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Paer. Genome atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Comparison tool #2 - DNA atlases
Pput Genome Atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Psyr. Genome Atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
3 genome alignment
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Table 2
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Organism perAT length rRNA operons # genes # sigmasPseudomonas aeruginosa, strain PAO1 33.4 6,264,403 4 5566 23Pseudomonas fluorescens SBW25 40.0 6,703,654 5 5480 29Pseudomonas putida KT2440 38.5 6,181,863 7 5350 23P. syringae pv. tomato strain DC3000 41.0 6,397,126 5 5471 14Average of 150 genomes 52.8 3,004,048 3 2890 8
sigmas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Sig70 tree
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Blast table
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Comparison tool #3 - Proteome cp.
Blast atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Paer blast atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Pput blast atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Psyr blast atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
zoom
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Table 3.1
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Phylome 1 atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
phylome2atlas
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
An aside
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
An aside…
Bacteria has a periodicity of 11
Archaea has a periodicity of 10
Peder Worning et al., Nucl. Acids Res., 28:706-709, 2000
E.coli autocorrel.
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
E.coli autocorrel
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Fourier transform
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
E.coli O157
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
E.coli periodicity
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Haemophilus
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
C. jejuni
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
H.pylori J99
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
M. janasschii
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
T. maritima
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Cp. 3 genomes
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Pseudomonas periodicity
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
Summary
1. Table (gives single-value, general features)
summary
Comparative Microbial Genomics Group
Cen
ter fo
r Bio
log
ical S
eq
uen
ce A
naly
sis Th
e T
ech
nica
l Un
iversity
of D
en
mark D
TU
2. DNA Atlases (circular maps of the whole chromosome)
3 ways to compare genomes:
3. Proteome comparison (Blast, Phylome-atlas, etc.)