Protein folding

Protein folding

Process of folding

Modeling the process of folding

Evolution vs. folding

Impact of function on protein evolution

Process

Local Interactions

Secondary Structure Elements (SSE)

Assembly of SSE

Equilibrium Structure

Protein folding

http://www.blueprint.org/proteinfolding/trades/details/trades_movies.html

Protein folding

Important thing to note

It is possible that residues that are not doing anything in the folded protein were

actually critically important to get the peptide folded in the

first place.

Protein folding

Simulation studies are demonstrating that the most common protein folds are those who can

withstand the most sequence variation over time without

affecting their topologies. The prion protein is a posterchild example

of the opposite.

Protein Evolution

Evolutionary meaning

Most common folds are those able to

withstand point mutations the best.

These are known as designable folds.

Protein folding

Marginal stability

The most stable folds are not necessarily these

with the lowest energy.

But these that maximally penalize switching to an alternative conformation.

Protein Evolution

Marginal stability

Evolutionary implication(s)

There is thus selective pressure on residues in

protein not only to maintain important

interaction, but also to make sure that some interaction NEVER

happen.

Summary

Proteins fold into energetically stable conformations.

For one chain, there are a large number of possible conformations, however.

The biological conformation is selected during folding: not necessarily the “best” conformation.

Role of biology on structures

A few examples using mapping of rate of evolution.

The fitness of a protein is ultimately its biological function, not its structure.

We’ll have a look at their structural requirements.

Structural Biology

Outline

How genetics encode structure.

What make a protein fold.

Role of biological function on preserving a fold.

Comparing two structures for similarities.

Genetic information and proteins

3D information is encoded into (1D) sequences.

STKKKPLTQEQLEDARRLKA IYEKKKNELGLSQESVADKM GMGQSGVGALFNGINALNAY NAALLAKILKVSVEEFSPSIAREIYEMYEA

Protein structure of CRO repressor in phage Lambda, PDB: 1LMB

?


The encoding can only be indirect

Because there is nothing in the DNA

that tells each amino acid where to go.


However,

There is a few types of physical interactions that are dominating

the process of protein folding.

Amino-acidsComponentsMain Chain

Side Chains

Side ChainsResponsible for the “name”.

Can be clustered based on:

- chemical properties

- Structure

This ultimately determine the evolutionary interchangeability.

Protein folding

Van der Waal forces

The electron clouds around the nuclei are more

stable if they can lightly interact with other electron clouds.

Makes atoms sticky relative to each other.

Protein folding

Electrostatic forces

Long range interactions.

Pull/Push over longer distances.

Protein folding

Hydrogen bonds

Electrostatic. Short range, not flexible

Can be seen as the velcro holding proteins

together.

Protein folding

Hydrophobic interactions

Water molecules in liquid pack as to minimize their

energies

This implies that water molecules are more than often are doing H-bond

with their neighbors.

Protein folding

Hydrophobic interactions

If you introduce a droplet of oil in solution, many hydrogen bonds will have to be broken at the interface, at an energy cost.

This is why hydrophobic and hydrophilic groups look like they are avoiding

each other.

Protein folding

During folding,

The polypeptide has to follow a strict

sequence of event in order to find the

correct conformation in a timely fashion.

Protein folding

Secondary Structures

Stable because of local h-bonds.

Makes larger block with fewer freedom of

movement

Protein folding

Geometry plays a very important role.

Because there are only a few angles that can

change along the backbone, there is a

limited number of ways a protein can

fold onto itself.

Protein structures are organized in a Hierarchical fashion

Secondary structures - Geometry

Dihedral AngleBecause most main chain atoms are constrained in a “amide bond”, the entire trajectory of the chain can be defined by the pair of angles (for each AA):

This can be represented with a

“Ramachandran Plot”.From which it is obvious that there are some kind of clustering going-on.

,

l

l


Secondary structures – The alpha helix

The Hydrogen BondAgain, a helix is an ideal setup to place our “velcro” H-bond always at the right place.

PeriodicityTo the delight of statisticians and computer scientists.


Secondary structures – The beta strand (beta sheets)

Another periodical pattern ( )Responsible for super-structure rigidity and some truly amazing patterns.

2f


Secondary structures – The myth of “random” coil.

Random structures in protein are extremely rare.Many uses the expression anyway to refer to the “rest” of the protein.

Other minor secondary structuresTurns, loops, bridges. Although these don’t have the critical periodicity found in and structures.


Tertiary structures – The reason why to care about 2nd structures.

Secondary structures are building blocksDetecting and predicting secondary structures is a key process in structural biology.

Other usesVisualization, classification…

Protein Diversity

The current release of PDB contains 28,000

structure entries.

26,000 are proteins

There is an estimated 600-8000 possible

unique protein folds.

http://www.jacquesdeshaies.com/expositions/virtual/new-virtual/uppsala-invit.html

PDB

Overview

Repository of structuresProteins, Nucleotides, complexes, mutants

Quality improve over timeData validation tools are getting better. More redundant structure are available for cross-reference.

Small number of folds

Does this means that all proteins are

coming from a small set of ancestor

molecule?

Perhaps, but not necessarily.