Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

63
Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann

Transcript of Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Page 1: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Metagenomics of Microbial Communities

Scott SprouleCam MacMillanDaniel Hann

Page 2: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

OutlineContext- A brief history of microbiology- A brief history of genomics- Defining metagenomicsMetagenomics - Transcending genomics- Accurate diversity measurementsTwo approaches of metagenomics- Sequence based approach- Function based approachEnvironmental analysis- Marine- SoilApplications of Metagenomics- Industrial - Agriculture & Renewable Energy- Environmental remediation- Life sciencesConclusion

Page 3: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Context

Page 4: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Date Contributor Contribution

1703 Robert Hooke Observed Cells

1677 Antonie van Veeuwenhoek Observed microbes

1776 Edward Jenner Vaccine

1862 Louis Pasteur Germ Theory

1875 Ferdinand J. Cohn Classification of Bacteria

1881 Robert Koch Bacteria & Disease

1928 Frederick Griffith Transformation

1950’s

Jonas Salk Advances in cell culturing

1963 Jacob & Monod Operon concept

1973 Cohen, Chang, Helling, & Boyer

Plasmids as vectors

1986 Kary Mullis Polymerase chain reaction

Microbiology: Perspective

Page 5: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Main Techniques

Culturing Techniques Microscopic Techniques

- Direct observation- Combine with

- Staining- Isotopes- Flourescence

- Growing- Isolation- Examination- Manipulation- experimentation

Inconsistent estimates of diversity and organisms numbers

Page 6: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Accounting for Inconsistencies

- How much were they missing?

- What were they missing?

- Why were they missing it?

“The Great Plate Count Anomaly”

It was clear that there were many viable cells that could not be cultured

Page 7: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

“Unculturability”

Environmental- Nutritional factors- Signaling factors- Essential factors- Artificial reproduction

difficult

Bacteria- Specific requirements- Competition- Community structure- Defense mechanisms

“Can we culture the unculturable?

Page 8: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Genome

- Common all living things on earth

- A universal language

- An instruction manual

- A toolbox

- A record of history

Page 9: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Genomics

the study of whole genomes

Nucleotides DNA Genes Genomes

Page 10: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

History of Genomics

Date Contributor Contribution

1858 Darwin Natural selection

1865 Mendel Genetic inheritance

1941 Beagle and Tatum One gene, one enzyme

1944 Avery, MacLeod, & McCarty

DNA genetic material

1946 Lederberg & Tatum Bacterial recombination

1953 Watson & Crick Double helix

1969 Bell laboratories UNIX

1974 Cerg & Kahn TCP protocol

1977 Sanger Sanger sequencing

1982 GenBank Online database

1990 Altshul BLAST

1995 Venter & Celera Corp Shotgun sequencing

Page 11: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Rapid sequencing of whole genomes

The pinnacle of genomics?

Metagenomics transcends genomics

Multiple genome level

Genomics Today

Page 12: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Metagenomics

Meta-analysis – combination of separate analysis

Genomics – analysis of an organisms genetic material

Page 13: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

What is metagenomics?Study of the collection of microbial genomes or genome fragments through direct extraction

A culture independent technique that can provide meta-analytic level information about:

- Population structure- Genetic Diversity- Functional elements- Novel genetic material

A synthesis of a number of fields:- Molecular Genetics- Microbiology- Bioinformatics- Population Genetics- Computer science

Page 14: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Applying Metagenomics to Microbial Communities

Traditional methods of quantifying microbial diversity- Sample environments independently- Isolate each species through culturing

techniques- Characterize through biochemical &

sequencing techniques

Realistic?- Very labour intensive- Time estimate: 100’s of years- Incredibly expensive $$$$$$- Most organisms can not currently be be

isolated through culture

Page 15: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Microorganisms

Are Everywhere!

Page 16: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Scratching the surface

Page 17: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Tools of Metagenomics- Cloning techniques- PCR- Cutting edge sequencing techniques- Bioinformatics- Open source databases

- Genbank- Protein Database- TreeBASE

Page 18: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

GenBank - “An annotated collection of all publicly available nucleotide and amino acid sequences.” --NCBI

Growth of GenBank

Steep hill to climb!

Doubling every 18 months!

Founded: 1982

Shotgun Sequencing: 1995

Page 19: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Approaches to metagenomics?

Page 20: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Questions for metagenomics

Page 21: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Analysis

Two main approaches

(1) Sequence driven

- What genes are there

(2) Function driven

- What the genes do

Page 22: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Sequence driven approaches- Data collection- Relies on conserved DNA- Phylogenic analysis- Used to measure biological

diversity

Functional driven approaches- Functional screening- Can identify novel genes- Relies on gene expression

- Proteins

Extract DNA

Metagenomic Process

Determine what the genes are

Determine what the genes do

Page 23: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Sequence-Based Approach

Page 24: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Sequencing

• One way to classify metagenomic fragments• Relies on nucleotide diversity analysis

– Discriminate between species

Seq. A GACTACGATCCGTATACGCACA--GGTTCAGAC|| ||||| ||||||||||||| |||||||||

Seq. B. GAATACGAGCCGTATACGCACACAGGTTCAGA

• Requires use of online databases– Ex: BLAST in GenBank

Compares “unknown to known”

Page 25: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Restrictions

Genomics Metagenomics

Whole Genome Sequenced

Know Species of Origination

Many DNA elements Identified

✔ ✖

✖✖

Page 26: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Sequence Metagenomics

• Not necessary to determine species of origin

• Obtain large volume of data ~ Less redundant

• Fragment’s = 20bp – 700bp

• Assembled sequence reads don’t exceed 5000 bp

Page 27: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Random Shotgun Sequencing

1. Library construction

a. isolate DNA

b. fragment DNA

c. clone DNA

2. Random Sequencing Phase

a. Automated pyrosequencing DNA

VECTORACTGTTC...

3. Assembly

a. assemble sequences

b. close gaps

C. edit sequence4. Annotation

AndPublication

Missassemblies?

Page 28: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Random Sequencing

Objective– To estimate bacterial biodiversity ~ Species Richness– Identify 1000’s of prokaryotic, viral & eukaryotic species– Mass amounts of genomic data obtained– Does not depend on PCR– Put sequence in computer BLAST

– Studies in:• Sea water• Soil microbial mats• Dead whale carcass• Feces etc.• Organism level (microbiome)

– Microorganisms are Everywhere!

Page 29: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Sequence Specific: Phylogenic

• Look at evolutionary relationships

Page 30: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Key Challenge

• Analysis based on evolutionarily conserved marker sequences

Want– High conservation across species– Slight & measureable changes over millions of

years

What to look for?

16S rRNA

Page 31: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

16S rRNA

Value• Vital for translation

– Essential• Short• Conserved within a species• Different between different species• Very slow mutation rate

Species Concept- Sequence based arbitrary - ~ Consensus: ~97% identity = species- Changing all the time

Page 32: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Screen for 16S rRNA sequence

• Extract DNA• Construct clone

library• Ex BAC cloning

• Screen using sequence specific primers

• When desired fragment is found• Sequence &

compare

Alignment represents a hypothesis

Page 33: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Function Based Approach

Page 34: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Sample

Genomic DNA extraction

DNA sheared

Plasmid vector

Transformation

Functional screening

Functional Approach: Overview

Page 35: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

DNA Extraction and Isolation

Removal of contaminants

Aspects of Sample

Blender Centrifugation

Cell purification

Cell lysis

DNA Isolation & Restriction Digest

Page 36: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Cloning & Transformation

Random Fragmentscloned into expressionvectors

Expression vectorstransfected into broad expression hosts

Page 37: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Plasmid Expression vectors

Page 38: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Functional Screening

Page 39: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Treasure Hunting

Looking for novel genes

Biological tool boxes

Detergents- Proteases- Lipases- Esterase’s

Antibiotics- Novel antibiotics- Not synthetic- Mutate the antibiotics

Page 40: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

- Microbes = 90% of marine biomass

Marine Metagenomics

- 98% of Primary Producers in Sea

- .001-.1% are cultivable

Page 41: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Craig Venter

Ocean Exploration

Page 42: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

• Ocean exploration genome project in aims of assessing the microbial diversity of marine microorganisms

• 7.7 million sequence reads 44 different samples 41 sites

• Surface water at 320km intervals

Page 43: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.
Page 44: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Sample Collection

• Determine physical characteristics of sample site – Salinity, pH, depth, dissolved O2,

temperature– Filtered & storage

• Characterize Genetic Material– DNA Isolation– Constructing a Library– Automated Sequencing

• Metagenomic Analysis

Page 45: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Discoveries

• First twelve hours in Sargasso Sea– Tripled the number of known prokaryotes on earth

• Six million new genes – 1.3 million new genes + 50 000 species from single site

• Tens of thousands of new protein families

• 782 rhodopsin-like photoreceptors– Previously only found in Archaea

• Unexpected links between genetics & environment– Different rhodopsin proteins in open ocean vs coast line

Better understanding of key biological processes?– New ideas for alternative energy production?– Solutions to deal with climate change?

Page 46: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Components

Page 47: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Soil Metagenomics

Soil is very diverse- Nutrients- Moisture- pH- Organic- Oxygen- Temperature- Surface vs. Subsurface

Elusive- Poor recovery rate- Nuclease- Cell bias to lysis techniques- Different methods yield

dramatically different estimates of diversity and organism number

Page 48: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Soil Diversity

The number of prokaryotic species found in a single soil sample exceeds the number of known cultured prokaryotes

40X more diverse than marine

Soil is very diverse- Nutrients- Moisture- pH- Organic- Oxygen- temperature

Estimates:

Page 49: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Accounting for different extraction methods

Soil Requires more Complicated Approach

Variability in Extraction Methods

How much are we still missing?

Page 50: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Other Metagenomic Hot Spots…

Whale Carcass

Wastewater

Human Gut

Feces

Page 51: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Applications of Metagenomics

“The metagenome provides a potentially inexhaustible genetic resource for biomolecules of potential utility in a variety of industries”

Page 52: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Applications of Metagenomics- Metagenomics offers potential solutions to some of the most complex

medical, environmental, agricultural and economic challenges of today

- Biotechnological potential of uncultivated bacteria might be accessible by directly cloning DNA sequences retrieved from the environment

Information on why certain organisms

are unculturable

Culture these organisms

Discover novel Pathways

Page 53: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Industrial Applications

• Novel enzymes rare through culturing

• Novelty: Avoid infringing on a competitor’s intellectual rights• Bacillus Protease Novo

• Hundreds of variations with a single AA substituted

• Enzymes are vital for many different industries and their sales are estimated at $2.3 billion in 2003.

• Food applications, detergents, textiles, agriculture, pulp/paper and other chemicals

Page 54: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Industrial Applications

- Temperature

- pH

- Pressure

- Speed

- Turnover

Search for the “Ideal” bio-catalyst

Page 55: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Environmental Remediation

Environmental Contamination • Toxic metals• Fossil fuels• Chemicals• Xenobiotics

Microorganisms can interact withcontaminants

• Oxidize• Bind• Transform• Immobilize

Metagenomic searches forgenes & proteins involved

Picking on Alberta: Tar Sands

Page 56: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Agriculture– Detecting diseases in livestock, crops and other

products

– Soils rich in microbial communities.

– Communities very complex, poorly understood and their intimacy with crops means they are of economic importance• nutrient cycling, nitrogen fixation, sequestering

metals– Understanding soil composition Enhanced farming

Page 57: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Renewable Energy

• Typically derived from biomass sources

• Cellulose and other non-edible parts of plants transformed into biofuels

• Transform cellulose into usable ethanol, methanol• Ex: Cellulosic ethanol

• Produce energy sources such as hydrogen and methane

• Capture and store these by-products

• Metagenomics approaches for new, efficient ways of producing energy sources

Page 58: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Renewable Energy

Searching mircobial communities for biomolecules that can be used as energy

Looking in unlikely places

Cow Rumen:

- Cellulose digestion Methane

- Cleaner methane

- Metagenomic analysis for compounds involved in this reaction

Ex: Bio- Alcohols, Bio-Diseases, Oils, etc….

Page 59: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.
Page 60: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Human Health

– The microbiome: The relationship between the human body and the microbial communities will lead to new methods for diagnosing, treating and preventing diseases

– Being used to sequence the microbial communities from ~18 body sites from 250 individuals to determine if changes to the human microbiome can be correlated with human health.

– Drugs from microbe-derived compounds: Look for function• metagenomics searches

Page 61: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Metagenomic Approach to Microbiome

- Microbiome very influential to human health

- What do know know about the microbiome?

- Metagenomic approach tells us not very much

- Comparing microbiome of healty and non. Healthy

- Microbiome transplants?

Page 62: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Future Directions

• New enzymes, antibiotics, and other reagents identified

• More exotic habitats can be intently studied

• Can only progress as library technology progresses, including sequencing technology

• Improved bioinformatics will quicken library profile analysis

• Investigating ancient DNA remnants

• Discoveries such as phylogenic tags (rRNA genes, etc)

• Learning novel pathways will lead to knowledge about the current nonculturable bacteria to then learning to culture these systems

Page 63: Metagenomics of Microbial Communities Scott Sproule Cam MacMillan Daniel Hann.

Conclusion