HUMIES presentation: Evolutionary design of energy functions for protein structure prediction

22
Evolutionary design of energy functions for protein structure prediction Natalio Krasnogor . k@ a o u n c k t . s t. nx c Pawel Widera, Jonathan Garibaldi 7th Annual HUMIES Awards 2010-07-09

description

 

Transcript of HUMIES presentation: Evolutionary design of energy functions for protein structure prediction

Page 1: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Evolutionary design of energy functionsfor protein structure prediction

Natalio Krasnogor.k@ ao unc kt .s t .nx c

Paweł Widera, Jonathan Garibaldi

7th Annual HUMIES Awards

2010-07-09

Page 2: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Protein structure predictionFrom 1D sequence to 3D structure

LFSKELRCMMYGFGDDQNPYTESVDILEDLVIEFITEMTHKAMSIFSEEQLNRYEMYRRSAFPKAAIKRLIQSITGTSVSQNVVIAMSGISKVFVGEVVEEALDVCEKWGEMPPLQPKHMREAVRRLKSKGQIP

Protein basics20 amino acid alphabetsequence encodesstructurestructure determinesactivityratio structures

sequences = 0.2%

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 2 / 15

Page 3: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

The algorithm of foldingAnfinsen’s thermodynamic hypothesis [Anfinsen, 1973]

[Dill and Chan, 1997]

Refolding experimentfolds to the samenative statenative state isenergetically stable

Energy funnelroll down freeenergy hillavoid local minimatraps

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 3 / 15

Page 4: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

The two aspects of foldingTowards practical prediction

[Dill and Chan, 1997]

Energy landscapeall-atom force fieldstatistical potential

Search methodrandom walkstructureoptimisation

[email protected] peta FLOPS

10 000 CPU daysfor 10µs of folding

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 4 / 15

Page 5: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

The two aspects of foldingTowards practical prediction

[Dill and Chan, 1997]

Energy landscapeall-atom force fieldstatistical potential

Search methodrandom walkstructureoptimisation

[email protected] peta FLOPS

10 000 CPU daysfor 10µs of folding

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 4 / 15

Page 6: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Community wide prediction experimentCritical Assessment of techniques for protein Structure Prediction

CASP factsbiannual competition started in 1994parallel prediction and experimental verificationmodel assessment by human experts

9th edition of CASP150 human groups140 server groups

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 5 / 15

Page 7: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

How to find good quality models?Correlation between energy and distance to the native structure

distance

energy

native state

Requirementsenergy reflectsdistancedistance reflectssimilarity

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 6 / 15

Page 8: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

How the best of CASP do it?Energy of models vs. distance to a target structure

Similarity measure

RMSD =

√√√√ 1N

i=N∑i=1

δ2i

Decoys generated byI-TASSER[Wu et al., 2007]

Robetta[Rohl et al., 2004]

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 7 / 15

Page 9: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

How the best of CASP do it?Energy of models vs. distance to a target structure

Similarity measure

RMSD =

√√√√ 1N

i=N∑i=1

δ2i

Decoys generated byI-TASSER[Wu et al., 2007]

Robetta[Rohl et al., 2004]

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 7 / 15

Page 10: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

How the energy function is designed?Weighted sum vs. free combination of terms

F (~T ) = w1 ∗ T1 + . . .wn ∗ Tn[Zhang et al., 2003]

F (~T ) =T1∗T3

w1∗log(T2)+ sin

„T4−w2∗T1

T5∗exp(cos(w1∗T3))

«

[Widera et al., 2010]

Decision supportlocal numericalapproximation

GP inputterminals:T1, . . . ,T8

functions:add sub mul divsin cos exp lograndom ephemeralsin range [0,1]

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 8 / 15

Page 11: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

How the energy function is designed?Weighted sum vs. free combination of terms

F (~T ) = w1 ∗ T1 + . . .wn ∗ Tn[Zhang et al., 2003]

F (~T ) =T1∗T3

w1∗log(T2)+ sin

„T4−w2∗T1

T5∗exp(cos(w1∗T3))

«

[Widera et al., 2010]

Decision supportlocal numericalapproximation

GP inputterminals:T1, . . . ,T8

functions:add sub mul divsin cos exp lograndom ephemeralsin range [0,1]

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 8 / 15

Page 12: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

How the energy function is designed?Weighted sum vs. free combination of terms

F (~T ) = w1 ∗ T1 + . . .wn ∗ Tn[Zhang et al., 2003]

F (~T ) =T1∗T3

w1∗log(T2)+ sin

„T4−w2∗T1

T5∗exp(cos(w1∗T3))

«

[Widera et al., 2010]

Decision supportlocal numericalapproximation

GP inputterminals:T1, . . . ,T8

functions:add sub mul divsin cos exp lograndom ephemeralsin range [0,1]

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 8 / 15

Page 13: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

What is the GP objective?Beyond simple symbolic regression

1 construction of the reference ranking RR(structures sorted by distance to the native)

3 1 5 0 4 7 6 2

2 construction of the evolutionary ranking RE(structures sorted by the evolved energy function)

5 1 3 4 0 2 6 7

3 rankings comparison — RR vs. RE4 total fitness = average distance for m proteins

F =1m

m∑i=1

d(RRi ,REi)

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 9 / 15

Page 14: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Can GP improve over a weighted sum of terms?Nelder-Mead downhill simplex optimisation

spearman-sigmoid correlation

method d-100 all d-100 all

simplex 0.734 0.638 0.650 0.166GP 0.835 0.714 *0.740 *0.200

Computational cost of experiments55 proteins, 1000-2000 structures for each5 different ranking distance measures20 different configurations of GP parameterstotal of 150 CPU days

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 10 / 15

Page 15: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Can GP improve over a weighted sum of terms?Nelder-Mead downhill simplex optimisation

spearman-sigmoid correlation

method d-100 all d-100 all

simplex 0.734 0.638 0.650 0.166GP 0.835 0.714 *0.740 *0.200

Computational cost of experiments55 proteins, 1000-2000 structures for each5 different ranking distance measures20 different configurations of GP parameterstotal of 150 CPU days

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 10 / 15

Page 16: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Criteria for human-competitivness

CRITERION Fresult >= past achievement in the field

CRITERION Eresult >= most recent human-created

solution to a long-standing problem

CRITERION Hresult holds its own in a competition

involving human contestants

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 11 / 15

Page 17: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Criteria for human-competitivness

CRITERION Fresult >= past achievement in the field

CRITERION Eresult >= most recent human-created

solution to a long-standing problem

CRITERION Hresult holds its own in a competition

involving human contestants

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 11 / 15

Page 18: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Criteria for human-competitivness

CRITERION Fresult >= past achievement in the field

CRITERION Eresult >= most recent human-created

solution to a long-standing problem

CRITERION Hresult holds its own in a competition

involving human contestants

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 11 / 15

Page 19: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Comparison to the human made solution

1 automated method to discover the best combinationof the energy terms

2 human-competitive improvement to the solution of along-standing problem

3 challenge weighted sum of terms with expert-pickedweights

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 12 / 15

Page 20: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Potential impact

1 automated energy design using a free functionalcombination of terms haven’t been used before

2 energy functions determines the search landscapeand its smoothness is a key to the efficientprediction

3 long-term effects in protein science that theimprovement in prediction quality could bring

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 13 / 15

Page 21: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

Why this is the best entry?

1 innovates the field with a novel approach to along-standing problem

2 could be a step towards more accurate prediction andin a long-term improve drug design and identification ofdisease-causing mutations

3 represent a new and difficult challange for GPhttp://www.infobiotics.org/gpchallenge/

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 14 / 15

Page 22: HUMIES presentation: Evolutionary design of energy functions  for protein structure prediction

References

Anfinsen, C. (1973).Principles that Govern the Folding of Protein Chains.Science, 181(4096):223–30.

Dill, K. A. and Chan, H. S. (1997).From Levinthal to pathways to funnels.Nat Struct Mol Biol, 4(1):10–19.

Rohl, C. A., Strauss, C. E. M., Misura, K. M. S., and Baker, D. (2004).Protein Structure Prediction Using Rosetta.In Brand, L. and Johnson, M. L., editors, Numerical Computer Methods, Part D, volume Volume 383 of Methods inEnzymology, pages 66–93. Academic Press.

Widera, P., Garibaldi, J., and Krasnogor, N. (2009).Evolutionary design of the energy function for protein structure prediction.In IEEE Congress on Evolutionary Computation 2009, pages 1305–1312, Trondheim, Norway.

Widera, P., Garibaldi, J., and Krasnogor, N. (2010).GP challenge: evolving energy function for protein structure prediction.Genetic Programming and Evolvable Machines, 11(1):61–88.

Wu, S., Skolnick, J., and Zhang, Y. (2007).Ab initio modeling of small proteins by iterative TASSER simulations.BMC Biol, 5(1):17.

Zhang, Y., Kolinski, A., and Skolnick, J. (2003).TOUCHSTONE II: A New Approach to Ab Initio Protein Structure Prediction.Biophys. J., 85(2):1145–1164.

Natalio Krasnogor Evolutionary design of energy functions for PSP HUMIES 2010 15 / 15