I Characterization of non-planar peptide groups in protein...

8
Indian Journal of Biochemistry and Biophysics Vol. 4l, October 2004, pp. 233-240 Characterization of non-planar peptide groups in protein crystal structures Anjan Kumar Dasgupta 1 , Rabi Majumdar" and Dhananjay Bhattacharyya'< 'Department of Biochemistry, University of Calcutta, 35 Ballygunge Circular Road, Kolkata 700019 2Department of Biophysics, Molecular Biology and Genetics, University of Calcutta 92 Acharya Prafulla Chandra Road, Kolkata 700009 3Biophysics Division, Saha Institute of Nuclear Physics, 11 AF Bidhan Nagar, Kolkata 700064 Received 1 January 2004; revised 4 August 2004 Peptide groups are generally assumed to be planar in protein structure, due to 'rigid' partial double bond character of peptide bonds, thus the value of peptide torsion angle ro should be restricted to 180 0 for the usual/rans form of peptide unit. However, on analyzing the ultra-high resolution protein crystal database, we find that in some cases, ro deviates> l O? from its usual value of 180 0 , indicating significant non-planarity of peptide groups. Moreover, the non-planarity for most of the amino acids is found to be 'biased' towards values of co smaller than 180 0 Similar trend for ro is confirmed by the neutron diffraction data for proteins. The neutron diffraction database also reveals that non-planar peptide groups are generally correlated to 'pyramidal' structure of the peptide-nitrogen bonds. Consequently, the hydrogen atom of peptide group deviates from its planar position, as measured by the 'improper' torsion angle 8. Thus, we find that both the angles (j) and 8 point towards a significant amount of non-planarity of peptide groups, which cannot be ignored. The role of peptide non- planarity in protein function is, however, not yet clear. Keywords: X-ray diffraction, database analysis, neutron diffraction, non-planar peptide, peptide bond, protein crystal structure. IPC Code: C 07 K Protein structure has enormous flexibility due to the possibility of alteration of the two main chain torsion angles, <!> and ur, as well as the side chain torsion angles X in some cases. This gives rise to its spatial stereo structure and folding, directed by its amino acid sequence'. Peptide units are believed to be the planar components of a protein, as a result of the partial double bond character of peptide bonds. This imparts rigidity to the peptide torsion angle 00, the value of which is restricted to 180 0 for the usual trans peptide unit having staggered geometry. This hypothesis, originally put forward by Pauling, was first questioned by Ramachandran et al. 2 on the basis of semi-empirical quantum chemical calculations. Experimental evidence of non-planar amide groups in cyclic peptides also established a similar picture.". Also, in folded proteins, cis peptide units are often found to exist, indicating the possibility of cis-trans *Author for correspondence E-mail: [email protected] Fax: +91-33-2337-4637 Tel: +91-33-2337-0379 isomerization for peptide units under folding strain. It was first shown experimentally by Watenpaugh et al. 9 that 00 in a protein structure can have values different from 180° and it is biased towards values lower than 180°. This was later extensively studied using structural database analysis!", as well as using ab initio quantum chemical calculations 11. These studies indicate that the peptide groups are generally non- planar having 00 closer to 175° rather than 185°, and have recently been reviewed12. The literature on protein structure analysis, however, does not indicate any cause or effect of such non-planar peptides'", It is not even clear whether these non-planarities are due to errors in the atomic positions of the crystal structures. We have, therefore, studied the ultra-high resolution crystal structures obtained from protein data bank (PDB)13, for several unrelated proteins to find out if the torsion angle w indeed shows a systematic and asymmetric deviation from 180°. We have further analyzed all the available protein crystal structures solved by using neutron diffraction data to confirm our results for the peptide torsion angle 00. In addition, the neutron diffraction

Transcript of I Characterization of non-planar peptide groups in protein...

  • plMi-

    ashi H

    ada T,9

    I l76,

    6488-

    ") Arch

    II R J

    ,799-

    Indian Journal of Biochemistry and Biophysics

    Vol. 4l, October 2004, pp. 233-240

    Characterization of non-planar peptide groups in protein crystal structuresAnjan Kumar Dasgupta 1, Rabi Majumdar" and Dhananjay Bhattacharyya'<

    'Department of Biochemistry, University of Calcutta, 35 Ballygunge Circular Road, Kolkata 700019

    2Department of Biophysics, Molecular Biology and Genetics, University of Calcutta92 Acharya Prafulla Chandra Road, Kolkata 700009

    3Biophysics Division, Saha Institute of Nuclear Physics, 11AF Bidhan Nagar, Kolkata 700064

    Received 1 January 2004; revised 4 August 2004Peptide groups are generally assumed to be planar in protein structure, due to 'rigid' partial double bond character of

    peptide bonds, thus the value of peptide torsion angle ro should be restricted to 1800 for the usual/rans form of peptide unit.However, on analyzing the ultra-high resolution protein crystal database, we find that in some cases, ro deviates> l O? fromits usual value of 1800, indicating significant non-planarity of peptide groups. Moreover, the non-planarity for most of theamino acids is found to be 'biased' towards values of co smaller than 1800• Similar trend for ro is confirmed by the neutrondiffraction data for proteins. The neutron diffraction database also reveals that non-planar peptide groups are generallycorrelated to 'pyramidal' structure of the peptide-nitrogen bonds. Consequently, the hydrogen atom of peptide groupdeviates from its planar position, as measured by the 'improper' torsion angle 8. Thus, we find that both the angles (j) and 8point towards a significant amount of non-planarity of peptide groups, which cannot be ignored. The role of peptide non-planarity in protein function is, however, not yet clear.

    Keywords: X-ray diffraction, database analysis, neutron diffraction, non-planar peptide, peptide bond, protein crystalstructure.

    IPC Code: C 07 K

    Protein structure has enormous flexibility due to thepossibility of alteration of the two main chain torsionangles, and ur, as well as the side chain torsionangles X in some cases. This gives rise to its spatialstereo structure and folding, directed by its aminoacid sequence'. Peptide units are believed to be theplanar components of a protein, as a result of thepartial double bond character of peptide bonds. Thisimparts rigidity to the peptide torsion angle 00, thevalue of which is restricted to 1800 for the usual transpeptide unit having staggered geometry. Thishypothesis, originally put forward by Pauling, wasfirst questioned by Ramachandran et al.2 on the basisof semi-empirical quantum chemical calculations.Experimental evidence of non-planar amide groups incyclic peptides also established a similar picture.".Also, in folded proteins, cis peptide units are oftenfound to exist, indicating the possibility of cis-trans

    *Author for correspondenceE-mail: [email protected]: +91-33-2337-4637Tel: +91-33-2337-0379

    isomerization for peptide units under folding strain. Itwas first shown experimentally by Watenpaugh et al.9

    that 00 in a protein structure can have values differentfrom 180° and it is biased towards values lower than180°. This was later extensively studied usingstructural database analysis!", as well as using abinitio quantum chemical calculations 11. These studiesindicate that the peptide groups are generally non-planar having 00 closer to 175° rather than 185°, andhave recently been reviewed12.

    The literature on protein structure analysis,however, does not indicate any cause or effect of suchnon-planar peptides'", It is not even clear whetherthese non-planarities are due to errors in the atomicpositions of the crystal structures. We have, therefore,studied the ultra-high resolution crystal structuresobtained from protein data bank (PDB)13, for severalunrelated proteins to find out if the torsion angle windeed shows a systematic and asymmetric deviationfrom 180°. We have further analyzed all the availableprotein crystal structures solved by using neutrondiffraction data to confirm our results for the peptidetorsion angle 00. In addition, the neutron diffraction

  • 234 INDIAN J. BIOCHEM. BIOPHYS., VOL4I, OCTOBER 2004

    database helped us to determine the improper'dihedral angle e (value of which is also 1800 for aplanar peptide unit), as a measure of pyramidalcharacter of the amide nitrogen atom.

    MethodIn analyzing ultra-high resolution crystal

    structures obtained from protein data bank, aresolution cut-off value of 1 A was first attempted,which gives only 82 structures. In order to obtain areasonably large number of such structures forstatistical analysis, the resolution cut-off criterion wasfinally relaxed to 1.2 A, which gives 236 structures,according to June 2002 update, some of theseconsisting of multiple polypeptide chains. Takingthese multiple chains into account and also removingthe much smaller peptide chains, such as thoseconsisting of 20 residues or less, 244 chains werefinally analyzed altogether. The details are availablein hUp:llwww.saha.ernet.in!cmb_page/htmllNonPlanarPeptides.html.

    Only 10 protein crystal structures are currentlyavailable in the protein data bank solved on the basisof neutron diffraction method, a relatively newtechnique in structural biologyl3. In view of such asmall number of protein structures solved by theneutron diffraction technique, so far, we analyzed allthe available structures, without any resolution cut-off. The peptide non-planarities are evaluated fromtorsion angles 00,while the resulting partial pyramidal

    Torsion Angle co

    s:Pl Ga·~~ I, --G"c~~Ni~\1 P2

    H

    Improper Angie 9

    Fig. l-Schematic representation of a peptide unit describingproper torsion angle (J) and the improper torsion angle on the

    nitrogen atom

    characters of peptide nitrogen bonds are evaluated interms of improper dihedral angles e, defined by(C'(i-l)--Ca(i)--N(i)--HN(i» as illustrated in Fig. 1.

    The software STRIDE'4 was used to analyzehelical and the other secondary structures of eachamino acid in different protein chains. The lessfrequently observed structures, such as the 310helixand some others, were not considered for furtheranalysis. The propensity values shown in Table 1were calculated, following Chou and Fasman 15 byusing the following expression:

    N( 00)straiNaaPaa(oo)=----- ...(1)

    where N(W)straa'is the number of occurrence of anamino acid 'aa' with large Wdeviation (either 00190°) in a given secondary structure, Nstraa thetotal number of occurrence of the same amino acid inthe same secondary structure, Naa the total number ofoccurrence of the same amino acid in the entiredatabase, and N the total number of all amino acidsoccurring in the database. Thus, the parametersPaaC170-)and Paa(i90+) represent the propensity of anamino acid 'aa' to satisfy our unusual torsion anglecriteria, namely w < 170° and co> 190°, respectively.

    All the four different moments of torsion angle00were calculated, following standard statisticaldescription" for large degrees of freedom. Thedimensionless parameters for testing of skew andkurtosis were computed by using the followingexpressions:

    ...(2)

    and

    I( OOi -

  • Table I-Number of Occurrence and propensity values (within parenthesis for each secondary structure type) of different amino acids indifferent secondary structures for adopting non-planar peptide geometry with peptide torsion angle (a) smaller than 170", (b) larger than 190"

    (a) Paa(170·)Occurrence

    Occurrence and propensity inAmino acid with W

  • 236 INDIAN 1. BIOCHEM. BIOPHYS., VOL 41, OCTOBER 2004

    Results and Discussion'Biased' peptide torsion angle deviations in protein structures

    In analyzing protein structures on the basis ofultra-high resolution crystal database, we have foundthat a sizable number of amino acids develop 'large'non-planarity, due to peptide torsion angle deviation~w = Iw-180ol = 10° or more (which we call 'large'deviations). This is irrespective of the protein size, itsamino acid content or its secondary structure, asshown in supplementary material available on-line fordetails (http://www.saha.emet.in/cmb_page/htrnl/NonPlanarPeptides.html), which also lists the number ofhighly non-planar pep tides in each of the proteinstrands and the parameters for testing skew andkurtosis. We have calculated the correlationcoefficient between the gl parameter (for testingskew) and the amount of non-planarity bias[N(w1900)] for all 244 proteins chains,as shown in Fig. 2. It is found to be -0.14. This valueof the correlation coefficient is about 98% significantfor 244 degrees of freedom, indicating that the twovariables are highly correlated.

    The number of occurrence and the propensity ofeach amino acid with such 'large' deviations in w asparts of proteins having different secondary structuresare shown in Table 1. The results clearly indicate thatthere is a significantly larger number of amino acidswith co< 170° (2395 cases), than those with w>190°(1571 cases), out of a total number of 42764 peptidesin the database. This asymmetry or 'biased' deviationtowards smaller values of co is definitely non-randomand is specially highlighted in Fig. 3, where the

  • its theTable 2-Statistical analysis of variation of peptide torsion angle (co) for different amino acid residues at C-terminal end

    ned in[Four different moments, mean, standard deviation, skew and kurtosis are calculated along with the corresponding dimensionless

    es are quantities for skew and kurtosis values]shows

    Amino Total Average Standard Skew, reflecting Kurtosis, reflecting Test of Skew Test of Kurtos-erage, acid occurrence co deviation asymmetry spread (gl) (g2)for all Ala 4140 179.60 5,10 -17,22 3468.90 -0.1294 2.1113inctive

    Cys 891 179.45 5.58 -11.41 3111.23 -0.0656 0.2002of the

    Asp 2611 179.22 6.06 -12.90 5331.67 -0.0578 0.9429

    Glu 2299 179.33 5.15 -9.50 3040.91 -0.0696 1,3228I, the Phe 1553 179.47 6,56 -9,59 6382.21 -0.0340 0.4556I atom

    Gly 3709 178.90 5.63 21.52 4365,94 0.1191 1.2741ith the

    likelyHis 923 179.76 6.61 44.86 7666.05 0.1551 1.0064

    sibility Ile 2216 178.62 6.07 -54.62 5378.33 -0.2439 0,9540

    ith the Lys 2232 179.255.49 -5.23 4193.44 -0.0316 1.6126

    acids.Leu 3117 179.17 5.25 -27,84 2873.25 -0.1923 0.7784

    able to Met 758 179.29 4.83 3.27 1993.13 0.0290 0.6603

    ~ atom. Asn 2097 179,39 5,63 1.54 3623.64 0.0086 0.6192

    Fids in Pro 1945 180.02 6.24 -57.07 7344.59 -0.2347 1.8390

    ro, Trp GIn 1545 179.18 5.40 -9,94 3156,29 -0.0633 0.7214

    lbit the Arg 1735 179.47 5.47 -1.67 3463.58 -0.0102 0.8640

    ~where Ser 2849 179.33 6.01 -8.12 5652.98 -0.03741.3345

    value Thr 2802 178.74 6.29 -8.12 5652.98 -0.0374 1.3345

    eptides Val 3018 178,78 5.61 -17.77 3973.29 -0.1005 1.0055

    own In Trp 639 179.77 5.43 -4.16 2798.34 -0.0260 0.2238

    fOur of Tyr 1595 180.02 6.69 40.20 6671.68 0.1345 0.3393

    gard toTable 3-List of all the protein structures obtained from PDB and solved by neutron diffraction technique [(The positions of hydrogen atoms arealso determined in these structures, which allowed us to calculate the improper dihedral angles 8 at the nitrogen atomic positions. The refinement

    and structural parameters are also given to highlight the relation between peptide torsion angles and improper torsion angles (8 )]

    PDB ID Protein name No. of Resolution Refinement Cases with Cases with Correlatioramino protocol between coacids to < 170 w>190 8 190 and 9

    lC57 Concanavalin A 237 2.4 A X-PLOR 0 0 0 0 0.99

    ICQ2 Sperm whale myoglobin 153 2.oA X-PLOR 0 0 4 6 -0.47

    IGKT Endothapepsin 329 2.1 A SHELXL 3 3 1 1 0.84

    II05 Hen egg-white lysozyme 129 2.oA X-PLOR 1 0 35 43 -0.35

    1LZN Hen egg-white lysozyme 129 1.7 A X-PLOR 7 5 2 1 0.93

    INTP Serine protease 223 1.8 A Constrained 13 5 1 2 0.86difference fourier

    2MB5 Carbonmonoxy myoglobin 153 1.8 A PROLSQ 0 0 0 0 0.7

    7209 3INS* Zn-insulin 21 2.2A PROLSQ 0 0 1 0 0.6230 0 1 0 3 0.09

    protein 21 0 0 1 0 0.56

    ar and co 30 0 0 1 0 0.58nature of 5RSA Ribonuclease A 124 2.oA PROLSQ 0 1 2 4 0.831800 and 6RSA Ribonuclease 124 2.oA PROLSQ 0 1 3 4 0.82

    equencies*This protein has four chains and their structural properties are given in separate lines.

    DASGUPTA et al.: NON-PLANAR PEPTIDE GROUPS IN PROTEIN CRYSTAL STRUCTURES 237

  • 238 INDIAN J. BIOCHEM. BlOPHYS., VOL 41, OCTOBER 2004

    A comparison of Tables 1 and 2 indicates thatGly and Pro residues have no preference for directionof non-planarity as their skewness are of oppositesense from the two measures, which is expected. Thelargest residue viz., Trp also has no preference,possibly due to its bulky size, which may lead toseveral unusual contacts with rest of the protein. Theside chain of His most probably forms severalimportant functional contacts, such as those withheme group, and may have unusual strains. It is notclear why Tyr prefers 0) >190°, but this may be due toboth polar and non-polar interactions of its side chainor some other yet unknown factor. There are,however, several other strains and steric factors in afolded protein, which may often dominate the localfactors.

    Analysis of the propensity values in Table 1indicates that 0) 190°. Thisconfirms the asymmetry demonstrated in Fig. 3.

    The fact that e and 0) are correlated is even bettershown in Fig. 6. This shows deviation of both theangles in few representative proteins. The pair-wisecorrelation coefficients for these angles in the case ofall protein structures are also listed in Table 3, whichindicate high degree of correlation for majority of thestructures. The correlation coefficient value of 0.28 is

  • 40

    ~50

    [,60

    proteinlJSE;

    globinsneutron~ctively.mctures,ar. The, where

    tainedihown.

    thesemailerout of

    have. This

    betterth the-wise

    caseofwhichof the.28 is

    DASGUPTA et al.: NON-PLANAR PEPTIDE GROUPS IN PROTEIN CRYSTAL STRUCTURES 239

    200 A200

    180

    1600 20 40 60 80 100 120 140

    1i:; 200a.

    (b)0!....a.E.....-0 180c~s,\lJa.

    ~ 1600.. 0 50 100 150 200 250

    200

    180

    160 L-~ __ -L__~ __L-~ __-L__~~o 20 40 60 80 100 120. 140 160

    Residue number

    Fig.6-Variation of peptide torsion angle co (in black) andimproper torsion angle (in green), describing pyramidal nature ofpeptide nitrogen atom are shown as functions of residue number forthree representative proteins solved from neutron diffraction data

    over 99.9% significant for about 1700 data points.The overall correlation between the two variables(taking all proteins) is also demonstrated in Fig. 7.

    ConclusionIt is seen from our results that the non-planarity

    of peptide units is a significant feature of proteins andcannot be ignored. Most of the amino acids at C-terminal side of a peptide tend to induce a biased non-planarity to the peptide. Notable exceptions are Glyand Pro, possibly indicating that these non-planaritybias arise from steric clash between Cfl atom and lonepair electrons at nitrogen. Earlier, we reRorfed similarnon-planarity of amino groups of DNA2 ,21,where wefound that this improves the strength of the bifurcatedcross-strand hydrogen bonds across the successivebase pairs, thereby contributing significantly to therigidity of DNA molecule for some specific basesequences. Similarly, for proteins, the question iswhether the peptide non-planarity can facilitate

    150

    100 -

    50 -

    ~ 0 ~~~~LY~.u~~~~~~~~cQ):J0-

    ~ 200 -.--------.-~--------.u,

    180 182 184 186 188Deviattofls- in w

    190 192

    50

    B

    150 -

    100 -

    o180 183 186 189 192 195 198 201 204

    Deviations in e

    Fig. 5-Frequency distribution of (A): peptide torsion angle to;and (B): variations in peptide improper dihedral angle e in theprotein structures obtained from neutron diffraction technique

    [The convention adopted in Fig. 3 is followed]

    205 I

    200

    195

    '" 190IIIenc185co

    c0

    "@ 180.9Q;

    175a.ea.S 170

    165

    160

    ..,

    • •

    •• 1'r •,.-...- ..••••••• t; I•• _'Ii ~

    • • ••I••.~. It'. I.,r. ,

    .. ~. I•••155 ----,--.-.------.--------.~------r____--- ----------155 165 175 185 195 205

    •• •

    ••

    Peptide torsion angle co

    Fig. 7-Correlation between the peptide torsion angles coandimproper torsion angles e

  • 240 INDIAN 1. BIOCHEM. BIOPHYS., VOL 41, OCTOBER 2004

    hydrogen bonding in DNA-protein recogrutionprocesses. The inadequate number of hydrogen bondsreported, so far (on the basis of planar peptide units),in the crystal structures of DNA-protein and DNA-drug complexes22-24 has led attempts to find outalternative mechanism for specific DNA-ligandrecognition, such as shape-complementarity, water-mediated interactions, etc.24-26. Consideration ofpyramidal character of peptide nitrogen bonds mayindicate importance of hydrogen bonding interactionas the dominant process in protein-DNA recognition.

    Acknowledgement

    We thank Ms Jhuma Das of the Dept. of Physics,University of Pune, India and Ms Ranjita GhoshMullick of the Dept. of Biochemistry, University ofCalcutta for computational assistance. One of us(RM) thanks the Council of Scientific and IndustrialResearh (CSIR), India for the grants and facilitiesunder the Emeritus Scientist Scheme.

    References

    Schultz G E & Schirmer R H (1979) Principles of ProteinStructure in Springer Advanced Text in Chemistry (Cantor CR, ed.) Springer Verlag, New York

    2 Ramachandran G N, Laksminarayan A V & Kolaskar A S,(1976) Biochim Biophys Acta, 303, 8-13

    3 Winkler F K & Dunitz 1 D (1975) Acta Cryst B 31, 268-2694 Winkler F K & Dunitz J D (1975) Acta Cryst B 31, 273-2755 Winkler F K & Dunitz J D (1975) Acta Cryst B 31,276-2786 Winkler F K & Dunitz J D (1975) Acta Cryst B 31, 278-281

    7 Winkler F K & Dunitz 1 D (1975) Acta Cryst B 31, 286-2888 Dunitz 1 D & Winkler F K (1975) Acta Cryst B 31, 251-2639 Watenpaugh K D, Sieker L C & Jensen L H (1979) J Mol

    Bioi, 131 509-53210 MacArthur M W & Thornton J M (1996) J Mol Bioi 264,

    1180-119511 Head-Gordon T, Head-Gordon M, Frisch M J, Brooks III C

    L & Pople J A (1991) JAm Chem Soc 113, 5989-599712 Edison A E (2001) Nature Struct Bioi 8, 201-20213 Berman H M, Westbrook J, Feng Z, Gilliland G, Bhat T M,

    Weissig H, Shinbyalov N & Bourne P E (2000) Nucl AcidsRes 28, 235-242

    14 Frishman D & Argos P (1995) Proteins 23, 566-57915 Chou P Y & Fasman G D (1974) Biochemistry 13,211-22216 Snedecor G W & Cochran W G (1989) Statistical Methods,

    8th edn., Iowa State University Press, Ames, Iowa17 Shamovsky I L, Ross G M & Riopelle R J (2000) J Phys

    Chem B 104,11296-1130718 Herzberg 0 & Moult J (1991) Proteins: Struct Funct Genet

    11,223-22919 Merritt E A, Kuhn P, Sarfaty S, Erbe J L, Holmes R K & Hol

    W G J (1998) J Mol Bioi 282, 1043-105920 Bhattacharyya D & Majumdar R (2001) Indian J Biochem

    Biophys, 38,16-1921 Bhattacharyya D, Kundu S, Thakur A R & Majumdar R

    (1999) J Biomol Struct Dynam 17,289-30022 Werner M H, Gronenborn A M & Clore G M (1996) Science

    271,778-78423 Tabernero L, Bella J & Aleman C (1996) Nucl Acids Res 24,

    3458-346624 Pjura P E, Grzeskowiak K & Dickerson R E (1987) J Mol

    Bioi 197, 257-27125 Wemmer D (1998) Nature Struct Bioi 5, 169-17126 Janin J (1999) Structure 7, R277-R279