Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas...

63
Protein structure and modelling Orientation Protein structure Protein modelling Andreas Heger University of Helsinki Bioinformatics Group s will be available at: na.biocenter.helsinki.fi:8080/downloads/teaching/hut2004

Transcript of Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas...

Page 1: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein structure and modelling

● Orientation● Protein structure● Protein modelling

Andreas HegerUniversity of HelsinkiBioinformatics Group

Slides will be available at: ekhidna.biocenter.helsinki.fi:8080/downloads/teaching/hut2004/

Page 2: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Proteins

● Proteins are involved in all processes inside a cell– Gene regulation– Metabolism– Signalling– Development– Structure

http://www.websters-online-dictionary.org/definition/english/ce/cell.html

Page 3: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Chemistry

● Proteins are linear hetero-polymers of amino acids– twenty different amino acids (building blocks)

ARG LYS VAL ILE PRO ARG GLU LYS

R K V I P R E K

3-letter code

1-letter code

Page 4: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Peptide bond

http://www.imb-jena.de/~rake/Bioinformatics_WEB/basics_peptide_bond.html

The peptide bond is planar

2 angles freely rotatable1 is fixed

Peptide ~ 2-10 amino acidsPolypeptide ~ 10-50 amino acidsProtein ~ 50- amino acids

Double bond character of the peptide bond

Page 5: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Amino acids

● Side chain properties– Size– Charge– Polarity

http://www.ch.cam.ac.uk/SGTL/Structures/amino/

Page 6: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Proteins are very special polymers:● A given protein has always the same amino acid

sequence– Protein sequence is determined by DNA sequence

● A given protein has always a unique three- dimensional structure.– Protein structure is determined by protein sequence.

always = biological always (there are exceptions)

Page 7: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein evolutionSequence – Structure - Function

DNA sequence

Protein sequence Protein structure

Protein functionSelection

Page 8: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Summary

● Protein structure is the key to understanding protein function

● Topics in protein structure

1.Protein structure determination

2.Protein architecture

3.Protein function

4.Protein folding● Protein modelling and computational methods

Page 9: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein structure determination

● Protein expression– membrane proteins– aggregation

● X-Ray crystallography● NMR (nuclear magnetic resonance)● Cryo-EM (electron microscopy)

Page 10: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Structures by X-ray crystallography

➔ Crystallize protein● Collect diffraction patterns● Improve iteratively:

– Calculate electron density map● Phase problem

– Fit amino acid trace through map

Page 11: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

X-ray crystallography

● Crystallization

● “An art as much as a science”Charges

http://crystal.uah.edu/~carter/protein/crystal.htm

Page 12: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Diffraction and electron density maps

Diffraction pattern

X-ray source Crystal

Intensities

Page 13: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Iterative refinement

http://www.sci.sdsu.edu/TFrey/Bio750/Bio750X-Ray.html

Higher resolution =more accurate positioning of atoms

Resolution

Page 14: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

NMR

● Create highly concentrated protein solution● Record spectra● Assign peaks to residues● Calculate constraints● Compute structure

Page 15: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

NMR spectra

1D 2D

http://www.cryst.bbk.ac.uk/PPS2/projects/schirra/html/2dnmr.htm

Page 16: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Distance constraints from NMR

● From the sequence– Topology– Bond angles– Bond lengths

● From the NMR experiment– Torsion angles– Distance constraints

HαR

CO

H

CO

Torsion angle

Page 17: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Ensemble of structures

SH3-domain

1aey

Page 18: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

What is the true protein structure?

● X-Ray– “frozen” state of a protein

● crystal contacts✔ large protein structure

● NMR✔ protein in solution– limited in size

Page 19: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Molecular complexesvia X-ray

1fjg

30 S subunit of the ribosome

Protein

RNA

Page 20: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Cryo-EMSingle particle image reconstruction

Koning et al. (2003)

Bacteriophage MS2

Page 21: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Fitting X-Ray structures into density maps

Page 22: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

GroEL-complex

1gr6

Hemoglobin

Page 23: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein structure databases

http://www.wwpdb.org/index.html

Page 24: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein architecture

● Protein structure is the key to understanding protein function

● Topics in protein structure

1.Protein structure determination

2.Protein architecture

3.Protein function

4.Protein folding● Protein modelling and computational methods

Page 25: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Topics in protein architecture

● Principles of protein architecture– Secondary structure– Supersecondary structure– Tertiary structure– Quarternary structure

● Classification of protein structures

Page 26: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

The big surprise

DNA is a regular structure Watson & Crick (1953)

Page 27: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Myoglobin

Kendrew and Perutz1957

1mbn

Page 28: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Secondary structure● backbone

– no amino acid side chains● regular patterns

– of hydrogen-bonds– backbone torsion angles

● types of secondary structure

– α-helix– β-sheet– ...

Page 29: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

α-Helix

β-Sheethydrogen bond pattern: n, n+4

Page 30: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

β-sheet

http://broccoli.mfn.ki.se/pps_course_96

view from the top view from the side

β-strands

Page 31: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Cartoon representation

2TRX 2AAC

Page 32: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Supersecondary structures

● local arrangments of secondary structure elements

http://www.expasy.org/swissmod/course/text/chapter2.htm

Page 33: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Tertiary structure

1coh

Page 34: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Quaternary structure

1coh

Page 35: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein structure

● Primary structure

● Secondary structure

● Super-secondary structure

● Tertiary structure

● Quaternary structure

Page 36: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein domains/modules

● globular● independently foldable● occur in different contexts

Page 37: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Domains via the contact matrix

Page 38: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Structure classification

● 24908 structures in the Protein Databank (PDB)● major classifications of proteins:

– SCOPhttp://scop.mrc-lmb.cam.ac.uk/scop/

– CATHhttp://www.biochem.ucl.ac.uk/bsm/cath/

– DALI DOMAIN DICTIONARY/FSSPhttp://ekhidna.biocenter.helsinki.fi:8080/dali/index.html

Page 39: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Hierachical description of protein architecture

1.Class:

α, β, α/β, α+β

2.FoldStructural similarity

3.SuperfamilyEvolutionary relationship

4.FamilySequence similarity

1.Class

α, β, α&β

2.ArchitectureSS: Spatial arrangement

3.TopologySS: Topology

4.Homologystructural/sequence similarity

SCOP CATH

Page 40: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

CATH

http://www.biochem.ucl.ac.uk/bsm/cath/cath_info.html

Class

Architecture

Topology

Page 41: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Dali Domain Dictionary

1.Fold space attractor region

Secondary structure composition and supersecondary structural motifs

2.Globular folding topology

Structural comparison

3.Functional family

Neural network

4.Sequence family

Sequence comparison

Page 42: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.
Page 43: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Deviation from globularity

● Domain swapping● Repetitive structures● Open/closed conformations

1bsr

5rsa

1amy

1d0b

Page 44: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein function

● Protein structure is the key to understanding protein function

● Topics in protein structure

1.Protein structure determination

2.Protein architecture

3.Protein function

4.Protein folding● Protein modelling and computational methods

Page 45: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Topics in protein function

● How does structure determine function?– Structural proteins– Enzymes– Transcription factors– ...

Page 46: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Structural proteins

● Collagen

1K6F http://www.aw-bc.com/mathews/ch06/fi6p13ad.htm

Page 47: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Actin and muscles

Page 48: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Enzymes

● Catalytic triad: Asp, Ser, His

1CHO

Page 49: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Mechanism

● Enzymes speed up chemical reactions● Enzymes are not consumed by the reaction● Stabilization of the transition state● Charge-relay cascade

Page 50: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Convergent evolution in serine proteases

● same reaction● same mechanism● same orientation of

catalytic residues● different structures

– Chymotrypsin:● His-57, Asp-102, Ser-195

– Subtilisin:● Asp-32, His-64, Ser-221

1cho / 1sib

Page 51: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Substrate specificity

Perona & Craik (1997)

Page 52: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Transcription factors

1L3L

Ligand

DNA

Page 53: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Hydrogen bonding pattern

Vannini (2002)

Page 54: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein folding

● Protein structure is the key to understanding protein function

● Topics in protein structure

1.Protein structure determination

2.Protein architecture

3.Protein function

4.Protein folding● Protein modelling and computational methods

Page 55: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein denaturation

● Denatured state = unfolded state● Native state = folded state● Denaturation = heat, urea, salts

Reaction coordinate

Energy

FoldedUnfolded

Reaction coordinate

Energy

FoldedUnfolded

Page 56: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein stability

● Native state only marginally more stable than denatured state

● Contributions to protein stability– hydrophobic effect: entropic effect– hydrogen bonds: net effect = 0– others

● salt bridges● disulphide bonds● aromatic-aromatic interactions● metal binding

Page 57: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Hydrophobic core of lysozyme

1HELHydrophobic amino acid

Hydrophilic amino acid

Page 58: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein folding

● Folding Funnel● Energy landscape

guides protein towards native structure

Dobson (2004)

C: total contacts

Q: native contacts

Page 59: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Energy landscape for the folding of lysozyme

Fast trackSlow track

Dobson (2004)

Page 60: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Misfolded proteins

● Disulfid-isomerases, Prolin-isomerases● Chaperones: unfold misfolded proteins● Protein folding diseases

– BSE– Alzheimer's disease– Parkinson's disease– ...

Page 61: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

GroEL – a chaperone

1gr6

Page 62: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Wang & Weissmann (1999)

Roseman et al. (1996)

GroEL mechanism

Page 63: Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.

Protein structure

● Protein structure is the key to understanding protein function

● Topics in protein structure

1.Protein structure determination

2.Protein architecture

3.Protein function

4.Protein folding● Protein modelling and computational methods