A Simple Approach to Study Designs in Complex Biochemical ...

35
Indian Institute of Technology, New Delhi, 31 October 2008 Indian Academy of Sciences, 74 th Annual Meeting, New Delhi Oct. 31 – Nov. 2, 2008 SOMDATTA SINHA Centre for Cellular and Molecular Biology (CSIR) Hyderabad A Simple Approach to Study A Simple Approach to Study Designs in Complex Designs in Complex Biochemical Pathways Biochemical Pathways Symposium on “Complexity & Computation in the Natural Sciences”

Transcript of A Simple Approach to Study Designs in Complex Biochemical ...

Page 1: A Simple Approach to Study Designs in Complex Biochemical ...

Indian Institute of Technology,New Delhi, 31 October 2008

Indian Academy of Sciences,74th Annual Meeting, New Delhi

Oct. 31 – Nov. 2, 2008

SOMDATTA SINHACentre for Cellular and Molecular Biology (CSIR)

Hyderabad

A Simple Approach to StudyA Simple Approach to Study

Designs in ComplexDesigns in Complex

Biochemical PathwaysBiochemical Pathways

Symposium on“Complexity & Computation in the Natural Sciences”

Page 2: A Simple Approach to Study Designs in Complex Biochemical ...

(Machta, Complexity, 2006: courtesy NOAA and NASA)

Hurricane approaching Florida

The Spiral Galaxy M51

LIVING SYSTEMS ARE COMPLEX SYSTEMSLIVING SYSTEMS ARE COMPLEX SYSTEMS

Complexity arises from selective and nonlinear interactions offunctionally diverse components to produce a coherent structure/function

Page 3: A Simple Approach to Study Designs in Complex Biochemical ...

gcgtacgtacgtagagtgctagtctagtcgtagcgccgtagtcgatcgtgtgggtagtagctgatatgatgcgaggtaggggataggatagcaacagatgagcggatgctgagtgcagtggcatgcgatgtcgatgatagcggtaggtagacttcgcgcataaagctgcgcgagatgattgcaaagragttagatgagctgatgctagaggtcagtgactgatgatcgatgcatgcatggatgatgcagctgatcgatgtagatgcaataagtcgatgatcgatgatgatgctagatgatagctagatgtgatcgatggtaggtaggatggtaggtaaattgatagatgctagatcgtaggtagtagctagatgcagggataaacacacggaggcgagtgatcggtaccgggctgaggtgttagctaatgatgagtacgtatgaggcaggatgagtgacccgatgaggctagatgcgatggatggatcgatgatcgatgcatggtgatgcgatgctagatgatgtgtgtcagtaagtaagcgatgcggctgctgagagcgtaggcccgagaggagagatgtaggaggaaggtttgatggtagttgtagatgattgtgtagttgtagctgatagtgatgatcgtag …….

Our genome encodes an enormous amount ofinformation about our beings.

our looksour sizehow our bodies workour healthour behaviorswho we are

Information plus regulationbio-complexity

proteins

30 000+genes

complexes

pathways

whole cell

community

organs

biological data

The genomic information andits regulation is organised atmultiple inter-connectedmodular and hierarchicallevels of increasingcomplexity

Page 4: A Simple Approach to Study Designs in Complex Biochemical ...

Bacterium E. coli divide in 20 min

Yeast cell cycle - 90 -120 min

Rapidly dividingmammalian cell cycle ~ 24 hours

Large cells -nerve cells in giraffe’s neck ~ 3 m

(9.7 ft) in length.

Smallest cell -Mycoplasma ~ 10-7 cm diameter

Cell is the basic unit of life

Living systems are made up of cells

– single or multi-cellular

Human RBC(7µm)

Bacteria(E.coli 3µm) Muscle Cell (50µm)

Acetabularia(3cm)

Amoeba (330µm)

Page 5: A Simple Approach to Study Designs in Complex Biochemical ...

Individual cells need to have mechanisms tomonitor their environmental composition.

They need to discern and synchronize their responsesaccording to variations in external and internal

conditions.

To achieve this level of coordination, metabolites andchemical compounds are used by the cell asmessages to know the composition of these

environments.

AND

it requires computation and information transferacross breadth and depth of processes

at all levels of organisation

Page 6: A Simple Approach to Study Designs in Complex Biochemical ...

http://www.expasy.ch/cgi-bin/show_thumbnails.pl

Cellular functions are controlled by networks of biochemical reactions

Cellular behaviour is the emergent property of many biochemical reactionsnetworked through feedback/feed-forward processes

Negative Feedback ensures stability and conservation of energy bydesensitizing the system to perturbations - are naturally selected to be the

most common form of regulation in pathways

Positive Feedback is potentially destabilizing, and primarily employed forexcitable, amplification, and switching processes.

Intricate networks of inter-connected chemicalreactions between molecular species in the cell.

Page 7: A Simple Approach to Study Designs in Complex Biochemical ...

Complex network of biochemical reactions in cells co-ordinate and control cellular functions

two interacting sets

GENETIC REACTIONS

Gene induction,repression,replication,

transcription

METABOLICREACTIONS

Conversion of substratemolecules by enzymes,

enzyme inhibition oractivation

Gene & Transcription FactorGene & Transcription Factornetworksnetworks

Metabolic reactionMetabolic reactionnetworksnetworks

Page 8: A Simple Approach to Study Designs in Complex Biochemical ...

Part of the network of direct transcriptional interactions in the E.coli data set, represented using network motifs

Network Motifs: Simple Building Blocks of Complex Networks. Science, (2002)

Page 9: A Simple Approach to Study Designs in Complex Biochemical ...

http://regulondb.ccg.unam.mx/; byF. Sanchez and E. Diaz

Escherichia coli transcriptional regulatory network for sensing the extra-cellular andintracellular environment

TFs:Green: extracellular sensingBlue: internal sensingPink: sensing intracellular conditions using endogenous signals synthesized inside the cellLight orange: unknown mechanisms to modulate their activities.

Lines: Green-activation; Red-repression;Blue: dual (activation & repression)Loops in the TFs: auto-regulation positive, negative or dual.Yellow: genes do not code for TF productsAbbreviations:S, substrate; E, enzyme and P, product.

Global TFs

Page 10: A Simple Approach to Study Designs in Complex Biochemical ...

Srinivasan & Morowitz, Biological Bulletin, 2008

Core Metabolic Network of reductive chemoautotrophs(bacteria that fix carbon by the reductive TCA cycle)

(Minimal metabolome - 287 metabolites)

Page 11: A Simple Approach to Study Designs in Complex Biochemical ...

How can one do predictive studies of

these complex information processingthese complex information processingunits/pathwaysunits/pathways

&&

understand the role of pathway designs inunderstand the role of pathway designs intheir function ?their function ?

“Multiple steps in every pathway”&

“Regulation - Positive/Negative”

The dynamical consequences of these designs canbe quite opposite in pathway functions.

Page 12: A Simple Approach to Study Designs in Complex Biochemical ...

MODELLING BIOCHEMICAL PATHWAYSThree complementary approaches

Construction & analysis offunctionally related

pathways from large scalegene expression and

protein interaction datausing network theory

Model existing pathways based oninformation derived from –

• Genome sequences

• Protein sequences

• Biochemical & Genetic information

REVERSE ENGINEERING LARGE NETWORKS

FORWARD ENGINEERING

All designs that are not physicallyAll designs that are not physicallyforbidden are realizable,forbidden are realizable,

but not all realizable designs arebut not all realizable designs arefunctionally effectivefunctionally effective

(in relation to context and constraintsof the system and environment).

Synthetic oscillatory circuit;Toggle switch in bacteria;

Amplifiers of gene expression.

‘Rational Network Design’

Artificial genetic and enzymaticnetworks with specific

properties constructed basedon mathematical models

Page 13: A Simple Approach to Study Designs in Complex Biochemical ...

FLOW

of

INFORMATION

Transcription rate - ~ 1,000 nucleotides/minuteTranslation rate - ~ 900 amino acids/minute

Production of the proteinto the binding of dimer -~ 3 min

THE INHERENT DELAY IN THE

TRANSCRIPTION-TRANSLATION PROCESS

Starting from gene expression to cellular function involves a sequence ofreactions over a period of time - a common feature

Page 14: A Simple Approach to Study Designs in Complex Biochemical ...

Question

A generic feature in all intracellular biochemical processes is the timerequired to complete the whole sequence of reactions to yield anyobservable quantity - widespread presence of time delay in biological functions.

Theoretically time delay is known to be a source of instability, and hasbeen attributed to lead to oscillations or transient dynamics in several biological functions.

Negative feedback loops, common in biochemical pathways, are known to provide stability, and withstand considerable variations and random perturbations of biochemical parameters.

(Savageau, 1974; Becskei and Serrano, 2000).

Interaction of these two opposing factors- instability and homeostasis -

are common features in intracellular processes

Effect of these divergent forces in the dynamics of gene expression?

Page 15: A Simple Approach to Study Designs in Complex Biochemical ...

Forward Engineering of gene circuitsForward Engineering of gene circuits(Rational network design)(Rational network design)

Construction of desired network with specific properties predictedfrom mathematical models using knowledge from biochemistry,

molecular biology, and genetics.

“It is obvious from analysis of these [bacterial genetic regulatory]mechanisms that their known elements could be connected into a widevariety of ‘circuits’ endowed with any desired degree of stability”

Boolean/Logical Circuits in Biology :Organisms take decisions based on input signals and give a binary (0/1)

response in some cases.

Jacob & Monod Model of theprokaryotic operon (1961)

Gene APromoter Operator

RNAP

Repressor

Inducer

Page 16: A Simple Approach to Study Designs in Complex Biochemical ...

“ Rational Network Design ” can -

a) engineer new cellular behaviour, and

b) improve understanding of naturally occurring networks.

Genetic Circuit Engineering Paradigm

Design - Simulate - Implement & Test

A basic assumption underlying such ‘synthetic’ biology

The properties of individual genetic components can be used

to understand and quantitatively predict circuit-level

behaviors.

Page 17: A Simple Approach to Study Designs in Complex Biochemical ...

v Designed simple negatively auto-regulated transcriptional modules consisting of a basic regulator and transcriptional repressor - with and without delay in repression - and their controls.

Constructed gene circuits in E. coli.

v Developed mathematical models of a simple negativefeedback pathway based on the design of the negatively auto-regulated gene circuit -

deterministic and stochastic

v Compared the gene expression dynamics with theoretical predictions.

ApproachesApproaches

Page 18: A Simple Approach to Study Designs in Complex Biochemical ...

Designing negatively auto-regulated gene circuits -with and without delay in repression

S1 S2(-)

Basic Circuit

Delay Circuit

S1 S’ S2(-)

The presence of one or more genes (“Delay element”) increase the length ofthe transcript, thereby introducing a delay in establishment of negative

feedback by the repressor in Delay circuit compared to the Basic circuit.

Page 19: A Simple Approach to Study Designs in Complex Biochemical ...

Design of negatively auto-regulated gene circuits

(a) Basic Circuit (TG) (b) Delay (C2TG)

Control Delay circuit (TC2G): Position of repressor same as in TG, but position ofreporter is as in C2TG. This is identical to the Delay circuit (C2TG) in length, number ofcistrons, and position of the Reporter gene, except for the position of the repressor, TetR.

TG: tetR gene and reporter gene(gfp) after the promoter-operator

unit (pLtet-01).

C2TG: two copies of cI gene from λ phagebefore the repressor gene - production of the

repressor is delayed.

Page 20: A Simple Approach to Study Designs in Complex Biochemical ...

Deterministicmodel

gt = total no. of promoters;

α1, α2, α3 = degradation rates

β1, β2 = transcription & translation rates

K1, k2 = promoter-repressor complexreaction rate

τ1, τ2 = time delays on the production of pand f from the common mRNA;

m = mRNA

p = repressor

f = reporter

g = freepromoters

ϕ - degradation products of mRNA,TetR and GFP

Stochastic modelmolecular reactions

(Gillespie,1977;Bratsun et al.,2005)

Page 21: A Simple Approach to Study Designs in Complex Biochemical ...

Basic circuit (circles with dashed lines)Delay circuit (squares with solid lines)

Kinetics of TetR

Kinetics of GFP

Det

erm

inis

tic

mo

del

Sto

chastic m

od

el(average of 100 sim

ulations).

Page 22: A Simple Approach to Study Designs in Complex Biochemical ...

The circuit dynamics is stable in all conditions

considered, but with increasing delay time, the

system shows damped oscillations, and the steady

state is reached with progressively larger excursion

in the phase plane.

Our theoretical analysis predicts that the negatively

auto-regulated pathway, as represented in these

models, can show a transient overshoot in gene

expression and protein production due to the delay

in the kinetics of the repression process.

Page 23: A Simple Approach to Study Designs in Complex Biochemical ...

Basic (circles) and the Delay (triangles) circuits upon induction (25 ng/ml)in four independent experiments.

(a) Normalised fluorescence versus time (minute);

(b) Normalised fluorescence versus growth (OD600).

Error bars (one standard deviation) are for both fluorescence and growth.

Experimental kinetics of GFP

Delay circuit shows a large overshoot in gene expression.

Page 24: A Simple Approach to Study Designs in Complex Biochemical ...

Gene expression in population of E. coli cells

with the gene circuits.

Model prediction

and

Experimental measurements.

Page 25: A Simple Approach to Study Designs in Complex Biochemical ...

Frequency distributions of GFP in cell populations:(a) Basic (TG), and (b) Delay (C2TG) circuits at different time intervals.

Intra-population heterogeneity in gene expressionExperimental: FACS analysis

The population of cells with Delay circuit shows –

(i) A significantly higher fluorescence in time, which later return to lower levels;

(ii) Presence of bimodality in fluorescence distribution; and,

(iii) Broader distribution of fluorescence in cell population - larger heterogeneity ingene expression among the individual cells within a population

Page 26: A Simple Approach to Study Designs in Complex Biochemical ...

Intra-population heterogeneity in gene expressionTheoretical: GFP expression at different time points in model 1000 cells with both

circuits having plasmid copy number variation (50±10, normally distributed)

The population of cells with Delay circuit shows –

(i) A significantly higher fluorescence in time, which later return to lower levels;

(ii) No bimodality in fluorescence distribution; and,

(iii) Broader distribution of fluorescence in cell population - larger heterogeneity ingene expression among the individual cells within a population

Frequency distributions of GFP molecules in cell populations:(a) Basic (TG), and (b) Delay (C2TG) circuits at different time intervals.

Page 27: A Simple Approach to Study Designs in Complex Biochemical ...

Contour plots of GFP fluorescence distribution in cell populations.At 1, 2 and 5 hrs after induction with different inducer concentrations -

(i) 25ng/ml, (ii) 50 ng/ml, and (iii) 75ng/ml, of Doxycyline.

BimodalityTG C2TG

The presence of bimodality in Delay circuit cell populations, induced at 25ng/ml,is a consequence of, but not an inherent property of, the delay element in the

circuit. Removal of this low-expressing fraction of cells by gating shows that C2TGcontinues to have a greater spread than TG.

25ng/ml

50ng/ml

75ng/ml

Page 28: A Simple Approach to Study Designs in Complex Biochemical ...

Common measures of comparing variability (noise) in a system -

Coefficient of Variation CV = (standard deviation/mean)*100 and Fano Factor FF = variance/mean.

Basic (TG - dashed line + circle)

Delay (C2TG-solid line + square)

Inset:

Changes in Fano Factor (FF) forboth the circuits.

• During the time of the build-up of the overshoot (till 90 minutes):

CV of the Delay population > Basic population

• The initial decreasing trend in CV in both the circuits indicates reduction in theintrinsic noise levels with time due to the establishment of the repression.

• Fano Factor shows continuing difference between the two circuits. Delay circuitexhibits greater variability compared to the Basic circuit.

Heterogeneity of gene expression in a population of cells

Experiments

Page 29: A Simple Approach to Study Designs in Complex Biochemical ...

Model Delay and Basic circuits -No significant difference in their CV over time except at an

early time point.

Prediction is not consistent with the experimental results ?

Page 30: A Simple Approach to Study Designs in Complex Biochemical ...

Coefficient of Variation for both Basic (dashed lines) and Delay (solid lines) circuits: a) Experimental populations for inducer concentrations

(25 ng/ml - circles, 75ng/ml - triangles) till 60 min from three experiments;b) Theoretical simulation (Basic: dashed line and Delay: solid line).

This prediction is consistent with experimental results.

The experimental Delay circuit at 75ng/ml induction shows similardifference in CV as is seen in the model circuits.

The hypothesis that delay in repression is the primary factor forinducing increased inter-cellular heterogeneity in gene expression in

a population is shown theoretically and experimentally.

Page 31: A Simple Approach to Study Designs in Complex Biochemical ...

CONCLUSIONS

Robustness of the results

Experimental methodology used involved –

1) Population approach (Fluorimetry) – observations on ~ 109 cells

2) Cell level (FACS) – observation on ~ 104 cells

Theoretical model only highlighting the delay in repression – calculations on ~ 103 cells

(no consideration of real factors, e.g., cell size changes, growth, folding delaysof GFP, nonlinearities involved in degradation, etc)

All three approaches show“Overshoot” and “Heterogeneity” in gene expression

Page 32: A Simple Approach to Study Designs in Complex Biochemical ...

The generic origin of delay in biochemical pathways impliesthat there is a high likelihood that the two properties -

transient overshoot & generation of heterogeneity ingene expression in cell population -play important roles in gene regulation.

• The overshoot allows for gene products being available in large amount for multi-step pathways to function,

• It can also act as a dominant source of large deterministic variability paving way to increase the phenotypic diversity in a population ofcells before the negative regulation sets in.

CONCLUSIONS

Our theoretical and experimental results provide importantclues and give possible rationale for delayed feedbacks to

be such a generic feature in gene organisations in cells.

Page 33: A Simple Approach to Study Designs in Complex Biochemical ...

Development of Gene Circuits

Develop integrated computational infrastructure forComputer Aided Design (CAD) of genetic circuitsSimulation and dynamic analysis

Build increasingly complex genetic circuits using well-characterized parts

The Circuit Engineering Vision

Develop a standard library of interoperable “parts” that correspondsto various control functions (www.parts.mit.edu)

Page 34: A Simple Approach to Study Designs in Complex Biochemical ...

We have the “parts list”

How do these “parts” interact as a “whole”, andhow does this system function to create an organism?

Ultimate goal is to link

behaviour of cells, organisms, and populations

to the information encoded in the genome

“Systems Biology”is about identifying, characterizing, and integrating the

parts-lists of complex biological systemsto find the underlying design and working principles of the

biological computational units

Page 35: A Simple Approach to Study Designs in Complex Biochemical ...

Dr. R. Maithreye

Dr. R. R. Sarkar

Dr. Veena K. Parnaik

Dr. Gopal Pande

Council for Scientific & Industrial Research (CSIR)

Department of Biotechnology

Acknowledgements