Computers are to biology what mathematics is to physics ...

40
“Computers are to biology what mathematics is to physics” – Harold Morowitz

Transcript of Computers are to biology what mathematics is to physics ...

Page 1: Computers are to biology what mathematics is to physics ...

“Computers are to biology what mathematics is to physics”– Harold Morowitz

Page 2: Computers are to biology what mathematics is to physics ...

Dowell Lab Overview

Transcriptomics Individual Differences

Page 3: Computers are to biology what mathematics is to physics ...

Techniques of the Trade

Page 4: Computers are to biology what mathematics is to physics ...

Techniques: Comparative Genomics

Page 5: Computers are to biology what mathematics is to physics ...

Techniques: Comparative Genomics

Page 6: Computers are to biology what mathematics is to physics ...

Techniques: Systems Biology

Page 7: Computers are to biology what mathematics is to physics ...

Techniques: Systems Biology

Page 8: Computers are to biology what mathematics is to physics ...

Techniques: Machine Learning

Page 9: Computers are to biology what mathematics is to physics ...

Transcriptomics

• Unbiased detection and characterization of transcripts(Today)

• Transcriptional Regulation (Tues)

• Noncoding Transcripts (Tues)

Page 10: Computers are to biology what mathematics is to physics ...

Conventional Microarray Design

Page 11: Computers are to biology what mathematics is to physics ...

Example data from tiled microarray of two yeast strains

Page 12: Computers are to biology what mathematics is to physics ...

Normalization critical for multiple array comparison

Yij = Yij − bjyij = Q(Yij)

Page 13: Computers are to biology what mathematics is to physics ...

Normalization critical for multiple array comparison

Yij = Yij − bjyij = Q(Yij)

Page 14: Computers are to biology what mathematics is to physics ...

Probe Affinity Correction and Gene Averaging

Page 15: Computers are to biology what mathematics is to physics ...

Probe Affinity Correction and Gene Averaging

Page 16: Computers are to biology what mathematics is to physics ...

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 17: Computers are to biology what mathematics is to physics ...

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 18: Computers are to biology what mathematics is to physics ...

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 19: Computers are to biology what mathematics is to physics ...

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 20: Computers are to biology what mathematics is to physics ...

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 21: Computers are to biology what mathematics is to physics ...

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 22: Computers are to biology what mathematics is to physics ...

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 23: Computers are to biology what mathematics is to physics ...

Random priming versus poly-dT priming

Page 24: Computers are to biology what mathematics is to physics ...

What regions are differentially transcribed?

yij = αi + gij + εij

gij ⇒ gsj + β~xi

Page 25: Computers are to biology what mathematics is to physics ...

Estimate posterior probability that the samples are different

yij = gsj + β~xi + αi + εij

P(Esj > Esj ′ |y,Θ)

Page 26: Computers are to biology what mathematics is to physics ...

Example region with differential expression

Page 27: Computers are to biology what mathematics is to physics ...

Differential expression hints at transcript interactions

Page 28: Computers are to biology what mathematics is to physics ...

In general, antisense transcripts consistent withinterference

Page 29: Computers are to biology what mathematics is to physics ...

A screen for same strand overlapping transcription

Danford, et. al. (RECOMB 2010)

Page 30: Computers are to biology what mathematics is to physics ...

SER3: Canonical same strand transcription

Martens et. al. Nature (2004)

Page 31: Computers are to biology what mathematics is to physics ...

Variety of transcriptional architectures detected

Page 32: Computers are to biology what mathematics is to physics ...

Verifying same strand transcription

Page 33: Computers are to biology what mathematics is to physics ...

RNA-Seq: Nucleotide resolution

Wang et. al. Nature Reviews Genetics 10, 57-63 (2009)

Page 34: Computers are to biology what mathematics is to physics ...

GRO-Seq: Nuclear Run-On Sequencing

Supplementary figures S1-S8, Table S1

Figure S1

Figure S1: Overview of the GRO-seq method. Polymerases are allowed to run-on ~100

bases in isolated nuclei in the presence of sarkosyl and Br-UTP. The RNA is then base

hydrolyzed to ~100 bases and bound to agarose beads that are coated with an !-BrdUTP

antibody. 5’-7meG caps are then removed, and the ends of the RNA are prepared for

adapter ligations. Illumina small RNA adapters are added to the 5’ end follwed by the 3’end,

with an additional round of immuno-enrichment after each adapter ligation. The RNAs are

then reverse transcribed, amplified, and PAGE purified prior to sequencing from the 5’end on

the Illumina 1G genome analyzer. See SOM for detailed description and protocol.

Core, Waterfall, Lis. Science 322:1845-1848 (2008)

Page 35: Computers are to biology what mathematics is to physics ...

Clustering: Tissue Expression as Bag of “Words”

Gerber et. al. PLoS Comp. Biol (2007)

Page 36: Computers are to biology what mathematics is to physics ...

Hierarchical Dirichlet Clustering

Page 37: Computers are to biology what mathematics is to physics ...

Expression largely conserved across species

Page 38: Computers are to biology what mathematics is to physics ...

Related Projects in the Lab ...

• To what extent are patterns of transcription conserved?

• How does transcription respond to perturbation by a drug(Nutlin)?

• Over a developmental time course, how and when aretranscripts made?

Page 39: Computers are to biology what mathematics is to physics ...

Acknowledgments

MIT

• David Gifford• Timothy Danford• Alex Rolfe

Whitehead Institute

• Gerry Fink• Stacie Bumgarner• Sudeep Agarwala• An Jansen• Paula Grisafi• Doug Bernstein

ABI Sequencing and Initial Assembly by Broad Institute

Page 40: Computers are to biology what mathematics is to physics ...

Transcriptomics

• Unbiased detection and characterization of transcripts

• Transcriptional Regulation (Tues)

• Noncoding Transcripts (Tues)