Protein structure
description
Transcript of Protein structure
![Page 1: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/1.jpg)
Protein structure
Anne Mølgaard, Center for Biological Sequence Analysis
![Page 2: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/2.jpg)
“Could the search for ultimate truth really have revealed so hideous and visceral-looking an object?”
Max Perutz, 1964on protein structure
John Kendrew, 1959with myoglobin model
![Page 3: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/3.jpg)
Sep. 2001 Feb. 2005X-ray 13116 25350NMR 2451 4383 theoretical 338 0total 15905 29733
Holdings of the Protein Data Bank (PDB):
![Page 4: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/4.jpg)
Methods for structure determination
X-ray crystallography
Nuclear Magnetic Resonance (NMR)
Modeling techniques
![Page 5: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/5.jpg)
Modeling
Only applicable to ~50% of sequences
FastAccuracy poor for low sequence id.
• There is still need for experimental structure determination!
![Page 6: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/6.jpg)
Structual genomics consortium (SGC)• The SGC deposited its 275th structure into the Protein Data Bank in August 2006 • currently operating at a pace of 170 structures per year
• at a cost of USD$125,000 per structure. • Scientific highlights include:
• several (> 1!!) novel structures of protein kinases
• completing the structural descriptions of the human 14-3-3
• adenylate kinase and cytosolic sulfotransferase protein families
• human chromatin modifying enzymes; human inositol phosphate signaling
• and a significant number of structures from human parasites.
![Page 7: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/7.jpg)
http://www.ch.cam.ac.uk/magnus/molecules/amino/
Amino acids
![Page 8: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/8.jpg)
Amino acids
Livingstone & Barton, CABIOS, 9, 745-756, 1993
A – AlaC – CysD – AspE – GluF – PheG – GlyH – HisI – IleK – LysL – LeuM – MetN – AsnP – ProQ – GlnR – ArgS – SerT – ThrV – ValW – TrpY - Tyr
![Page 9: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/9.jpg)
•Primary
•Secondary
•Tertiary
•Quarternary
Levels of protein structure
![Page 10: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/10.jpg)
Primary structure
MKTAALAPLFFLPSALATTVYLAGDSTMAKNGGGSGTNGWGEYLASYLSATVVNDAVAGRSAR…(etc)
![Page 11: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/11.jpg)
-helix
-sheetleft-handed -helix
Ramachandran plot
![Page 12: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/12.jpg)
Hydrophobic core
Hydrophobic side chains go into the core of the molecule – but the main chain is highly polar
The polar groups (C=O and NH) are neutralized through formation of H-bonds
![Page 13: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/13.jpg)
Secondary structure
-helix C=O(n)…HN(n+4)
-sheet(anti-parallel)
![Page 14: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/14.jpg)
… and all the rest
310 helices (C=O(n)…HN(n+3)), -helices
(C=O(n)…HN(n+5))
-turns and loops (in old textbooks sometimes referred to as random coil)
![Page 15: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/15.jpg)
The -helix has a dipole moment
+-C
N
![Page 16: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/16.jpg)
Two types of -sheet:
anti-parallel parallel
![Page 17: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/17.jpg)
Tertiary structure (domains, modules)
Rhamnogalacturonan lyase (1nkg) Rhamnogalacturonan acetylesterase (1k7c)
![Page 18: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/18.jpg)
Quaternary structure
B.caldolyticus UPRTase (1i5e) B.subtilis PRPP synthase (1dkr)
![Page 19: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/19.jpg)
Classification schemes
SCOP– Manual classification (A. Murzin)
CATH– Semi manual classification (C. Orengo)
FSSP– Automatic classification (L. Holm)
![Page 20: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/20.jpg)
Levels in SCOP
Class # Folds # Superfamilies # FamiliesAll alpha proteins 202 342 550All beta proteins 141 280 529Alpha and beta proteins (a/b) 130 213 593Alpha and beta proteins (a+b) 260 386 650Multi-domain proteins 40 40 55Membrane and cell surface
proteins 42 82 91Small proteins 72 104 162Total 887 1447 2630
http://scop.berkeley.edu/count.html#scop-1.67
![Page 21: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/21.jpg)
Major classes in SCOP
Classes– All alpha proteins– Alpha and beta proteins (a/b)– Alpha and beta proteins (a+b)– Multi-domain proteins– Membrane and cell surface proteins– Small proteins
![Page 22: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/22.jpg)
All : Hemoglobin (1bab)
![Page 23: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/23.jpg)
All : Immunoglobulin (8fab)
![Page 24: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/24.jpg)
Triose phosphate isomerase (1hti)
![Page 25: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/25.jpg)
a+b: Lysozyme (1jsf)
![Page 26: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/26.jpg)
Folds*• Proteins which have >~50% of their secondary structure elements arranged the in the same order in the protein chain and in three dimensions are classified as having the same fold
• No evolutionary relation between proteins
*confusingly also called fold classes
![Page 27: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/27.jpg)
Superfamilies
Proteins which are (remote) evolutionarily related– Sequence similarity low– Share function– Share special structural features
Relationships between members of a superfamily may not be readily recognizable from the sequence alone
![Page 28: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/28.jpg)
Families
Proteins whose evolutionarily relationship is readily recognizable from the sequence (>~25% sequence identity)
Families are further subdivided into Proteins
Proteins are divided into Species– The same protein may be found in several species
![Page 29: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/29.jpg)
Links
PDB (protein structure database)– www.rcsb.org/pdb/
SCOP (protein classification database) – scop.berkeley.edu
CATH (protein classification database)– www.biochem.ucl.ac.uk/bsm/cath
FSSP (protein classification database)– www.ebi.ac.uk/dali/fssp/fssp.html
![Page 30: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/30.jpg)
Why are protein structures so interesting?
They provide a detailed picture of interestingbiological features, such as active site, substratespecificity, allosteric regulation etc.
They aid in rational drug design and protein engineering
They can elucidate evolutionary relationshipsundetectable by sequence comparisons
![Page 31: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/31.jpg)
COOH
NH2
Asp His Ser Topological switchpoint
Inferring biologicalfeatures from the structure
1deo
![Page 32: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/32.jpg)
Inferring biological features from the structure
Active site
Triose phosephate isomerase (1ag1) (Verlinde et al. (1991) Eur.J.Biochem. 198, 53)
![Page 33: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/33.jpg)
Engineering thermostability in serpins
OverpackingBuried polar groupsCavities
Im, Ryu & Yu (2004) Engineering thermostability in serine protease inhibitorsPEDS, 17, 325-331.
![Page 34: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/34.jpg)
Evolution...
Structure is conserved longer thanboth sequence and function
![Page 35: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/35.jpg)
Rhamnogalacturonanacetylesterase(A. aculeatus) (1k7c)
Platelet activating factor acetylhydrolase(Bos Taurus) (1wab)
Serine esterase (S. scabies)(1esc)
![Page 36: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/36.jpg)
Platelet activatingfactor acetylhydrolase
Serine esterase
Rhamnogalacturonanacetylesterase
Mølgaard, Kauppinen & Larsen (2000) Structure, 8, 373-383.
![Page 37: Protein structure](https://reader035.fdocuments.in/reader035/viewer/2022070500/56816866550346895ddec2f8/html5/thumbnails/37.jpg)
"We wish to suggest a structure for the salt of deoxyribose nucleic acid (D.N.A.). This structure has novel features which are of considerable biological interest….…It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material."
J.D. Watson & F.H.C. Crick (1953) Nature, 171, 737.