Combinatorics of cis-regulatory elements in osmotic · PDF fileCombinatorics of cis-regulatory...
Transcript of Combinatorics of cis-regulatory elements in osmotic · PDF fileCombinatorics of cis-regulatory...
Combinatorics of cis-regulatory elements in osmotic stress response of
Ascomycetes. Aurian García-González"
Mentors: Sushmita Roy, Jay Konieczka, Dawn Thompson"Broad Institute- Summer Research Program 2011"
August 3, 2011"
~ 95 MYA ~ 300 MYA
Model system"
PNAS Vol. 100, No. 3, pp. 1056-1061, 2003
How do we study evolution of gene regulation?"
Phenotype: "KCl osmotic stress"
FEBS Letters 583 pp. 3959-3965, 2009.
Model system"How do we study evolution of gene regulation?"
FEBS Letters 583 pp. 3959-3965, 2009.
Nat Genet. 2008 Nov; 40(11):1300-6.
Phenotype: "
• Readily tractable experimentally. • Thoroughly described in S. cerevisiae as Hog1-directed.
KCl
Model system"
osmotic stress"
How do we study evolution of gene regulation?"
1) "Perform stress experiments "to obtain expression profiles"
2) "Cluster genes to identify "phenomenologs"
Most repressed"
Most induced" Clusters with similar
expression – regardless of gene content.
3) What are the cis-regulatory elements that regulate "these phenomenologs?"
5’ 3’ non-coding region coding region
Characterizing osmotic stress response"
Characterizing osmotic stress response "1) "Perform stress experiments
"to obtain expression profiles"
2) "Cluster genes to identify "phenomenologs"
Most repressed"
Most induced" Clusters with similar
expression – regardless of gene content.
Characterizing osmotic stress response "1) "Perform stress experiments
"to obtain expression profiles"
2) "Cluster genes to identify "phenomenologs"
Most repressed"
Most induced" Clusters with similar
expression – regardless of gene content.
5’ 3’ promoter coding region
3) What are the cis-regulatory elements present in these "phenomenologs?"
Characterizing osmotic stress response "1) "Perform stress experiments
"to obtain expression profiles"
2) "Cluster genes to identify "phenomenologs"
Most repressed"
Most induced" Clusters with similar
expression – regardless of gene content.
3) What are the cis-regulatory elements present in these "phenomenologs?"
5’ 3’ coding region single motif
Characterizing osmotic stress response "1) "Perform stress experiments
"to obtain expression profiles"
2) "Cluster genes to identify "phenomenologs"
Most repressed"
Most induced" Clusters with similar
expression – regardless of gene content.
5’ 3’ coding region multiple motifs
3) What are the cis-regulatory elements present in these "phenomenologs?"
Use ModuleDigger to find CRMs"
Uses motif data for genes in a cluster and computes hierarchical scores for modules."
ModuleDigger:"
BMC Bioinformatics 2009, 10(Suppl I):S30.
Use ModuleDigger to find CRMs"
BMC Bioinformatics 2009, 10(Suppl I):S30.
Uses motif data for genes in a cluster and computes hierarchical scores for modules."
ModuleDigger:"
Cis-Regulatory Module!
Use ModuleDigger to find CRMs"
BMC Bioinformatics 2009, 10(Suppl I):S30.
Uses motif data for genes in a cluster and computes hierarchical scores for modules."
ModuleDigger:"
…"
Studying combinations of motifs in CRMs
0 1 1
1 0 1
1 1 0
Are they conserved across all species?"
?"
…"
Studying combinations of motifs in CRMs
0 1 1
1 0 1
1 1 0
Are they conserved across all species?"
?"NO
Most induced" Most induced " Most induced"
Most repressed" Most repressed" Most repressed"
RCS1, SIP4
RCS1, SIP4 MBP1, TEC1 MBP1, PDR3
MIG1, MSN2/MSN4 MIG1, ADR1 MSN2/MSN4, ADR1
RCS1, SIP4 CRZ1, SIP4
RCS1, SIP4
Motif combinations are conserved in clades"
Pre-WGD
WGD: whole genome duplication
Most induced" Most induced"
Most repressed" Most repressed"
RCS1, SIP4
RCS1, SIP4 MBP1, TEC1 MBP1, PDR3
RCS1, SIP4 CRZ1, SIP4
RCS1, SIP4
Most induced "
MIG1, MSN2/MSN4 MIG1, ADR1 MSN2/MSN4, ADR1
Most repressed"
Pre-WGD
WGD: whole genome duplication
Post-WGD
Motif combinations are conserved in clades"
Most induced" Most induced " Most induced"
Most repressed" Most repressed" Most repressed"
RCS1, SIP4
RCS1, SIP4 MBP1, TEC1 MBP1, PDR3
MIG1, MSN2/MSN4 MIG1, ADR1 MSN2/MSN4, ADR1
RCS1, SIP4 CRZ1, SIP4
RCS1, SIP4
Pathogens Pre-WGD
WGD: whole genome duplication
Post-WGD
Motif combinations are conserved in clades"
RCS1, SIP4
MBP1, TEC1 MBP1, PDR3
MIG1, MSN2/MSN4 MIG1, ADR1 MSN2/MSN4, ADR1
RCS1, SIP4 CRZ1, SIP4
Stress Glucose Metabolism Cell-cycle progression Metabolite transport
Pathogens Pre-WGD Post-WGD
Stress-related regulatory elements co-occur with glucose-related elements"
Conclusions"• Very few motif pairs were species-
specific. Combinatorics are conserved in at least two species."
Conclusions"• Very few motif pairs were species-
specific. Combinatorics are conserved in at least two species."
• The most induced clusters showed more motif pairs than the repressed clusters."
Conclusions"• Very few motif pairs were species-
specific. Combinatorics are conserved in at least two species."
• The most induced clusters showed more motif pairs than the repressed clusters."
• Glucose responsive elements occur with stress elements."
Future work"• Explore higher order combinations. "• Develop more sophisticated algorithms
to discover significant CRMs."
Future work"• Explore higher order combinations. "• Develop more sophisticated algorithms
to discover significant CRMs."• Correlate trans interactions with
observed phenotype."