MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics...

28
Structural Bioinformatics Lecture 1 Xavier Daura Institut de Biotecnologia i de Biomedicina Module 2: Core Bioinformatics MSc in Bioinformatics

Transcript of MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics...

Page 1: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

StructuralBioinformatics

Lecture1XavierDaura

Institut deBiotecnologia i deBiomedicina

Module2:CoreBioinformatics

MScinBioinformatics

Page 2: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 2StructuralBioinformatics.Lecture1.

• Smallmolecules• Lipids• Vitamins• Hormones• Neurotransmitters• Metabolites• …• combinationsandintersections

• Monomers,oligomersandpolymers• Aminoacids,oligopeptides (<20),polypeptides(<50),proteins• Monosaccharides,oligosaccharides(<10),polysaccharides• Nucleotides,oligonucleotides(<30,s-stranded),polynucleotides(s-stranded),

nucleicacids(DNA,RNA)• Isoprene,terpenes,polyterpenes (e.g.rubber)

Biomolecules

Page 3: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 3StructuralBioinformatics.Lecture1.

StructuralBiology

Concernedwiththemolecularstructureofbiologicalmacromolecules,especiallyproteinsandnucleicacids,howtheyacquirethestructurestheyhave,andhowalterationsintheirstructuresaffecttheirfunction.

Hammerheadribozyme B-DNAMyoglobin

Page 4: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 4StructuralBioinformatics.Lecture1.

Proteins:themostversatilemacromolecules

Page 5: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 5StructuralBioinformatics.Lecture1.

Proteins:themostversatilemacromolecules

Page 6: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Thereare20aminoacidscommoninproteins

Course2017-18 6StructuralBioinformatics.Lecture1.

Aminoacids

Page 7: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 7StructuralBioinformatics.Lecture1.

Aminoacids

Page 8: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Proteinsarelinearpolymersofaminoacidsconnectedbyamidebonds

Course2017-18 8StructuralBioinformatics.Lecture1.

Peptidebond

Page 9: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Thepropertiesofthepeptidebondhaveimportanteffectsonthestabilityandflexibilityofpolypeptidechains

Course2017-18 9StructuralBioinformatics.Lecture1.

Degreesoffreedomofthepolypeptidebackbone

Page 10: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Therearefourlevelsofproteinstructure

Thenumberofproteinfolds(tertiarystructures)islargebutlimited

Course2017-18 10StructuralBioinformatics.Lecture1.

Proteinstructurelevels

Page 11: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Stericconstraintsdictatethepossibletypesofsecondarystructure

Course2017-18 11StructuralBioinformatics.Lecture1.

Secondarystructureelements

Page 12: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 12StructuralBioinformatics.Lecture1.

Secondarystructureelements

Page 13: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 13StructuralBioinformatics.Lecture1.

Secondarystructureelements

Page 14: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

14

• Biologicallyrelevantmetalshaveahighenvironmentalavailability

• Manyalkali,alkaliearthandtransitionmetalsarenaturalbiometals:– Na,K,Mg,Ca– Mn,Fe,Co,Ni,Cu,Zn,...

• Theirinvolvementintheproteincanbe:– Folding– Functional

Metalions

Course2017-18 StructuralBioinformatics.Lecture1.

Page 15: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

15

• Inmanycases,livingorganismshaveundertakentheassimilationof organicorinorganicentitiestoincludespecificfunctionalityinproteins

Prostheticgroups

Course2017-18 StructuralBioinformatics.Lecture1.

Page 16: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Foldedproteinsarestabilizedmainlybyweaknoncovalentinteractions

Formostproteinsthedifferenceinfreeenergybetweenthefoldedandunfoldedstatesissmall,i.e.onlyabout10timestheaveragethermalenergyavailableatphysiologicalconditions

Course2017-18 16StructuralBioinformatics.Lecture1.

Proteinfoldingandstability

Page 17: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Thefoldedstructureofaproteinisdirectlydeterminedbyitsprimarystructure

Thebalanceofself-interactions,interactionswithwater(andothercomponentsifpresent)andentropydrivesproteinfoldingandstabilizesthefoldedstructure

Itisallaboutthermodynamicsandkinetics,asusual

Course2017-18 17StructuralBioinformatics.Lecture1.

Proteinfoldingandstability

Funnelmodelofproteinfolding

Dill&MacCallum.Science2012,338:1042

!ΔG = ΔH−TΔSHydrophobiccollapse

Page 18: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 18StructuralBioinformatics.Lecture1.

X-raydiffraction

• AnX-raystrikinganelectronproducessecondarysphericalwavesemanatingfromtheelectron(elasticscattering)

• Aregulararrayofscatterers producesaregulararrayofsphericalwaves• Althoughthesewavescanceloneanotheroutinmostdirectionsthroughdestructive

interference,theyaddconstructivelyinafewspecificdirections,determinedbyBragg'slaw:

• Thesespecificdirectionsappearasspotsonthediffractionpatterncalledreflections• X-raysareusedtoproducethediffractionpatternbecausetheirwavelengthλ istypically

thesameorderofmagnitude(1–100Å)asthespacingd betweenplanes

!!2d sinθ = nλ

Page 19: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 19StructuralBioinformatics.Lecture1.

Single-crystalX-raydiffraction

• AbeamofmonochromaticX-rayscollimatedtoasingledirectionstrikesasinglecrystal,producingscatteredbeams

• Thebeamsproduceadiffractionpatternofspots(reflections)onthedetector• Theintensitiesandanglesofthesebeamsarerecordedasthecrystalisgraduallyrotated• Multipledatasetsarecollected,witheachsetcoveringslightlymorethanhalfafull

rotationofthecrystalandtypicallycontainingtensofthousandsofreflections• Thesedataarecombinedcomputationallywithcomplementarychemicalinformationto

produceandrefineamodelofthearrangementofatomswithinthecrystal

Page 20: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 20StructuralBioinformatics.Lecture1.

• Thedatacollectedfromadiffractionexperimentisareciprocalspacerepresentationofthecrystallattice

• Thepositionofeachdiffractionspotisgovernedbythesizeandshapeoftheunitcell,andtheinherentsymmetrywithinthecrystal

• Theintensityofeachdiffractionspotisproportionaltothesquareofthestructurefactoramplitude

• Thestructurefactorisacomplexnumbercontaininginformationrelatingtoboththeamplitudeandphaseofawave

• Inordertoobtainaninterpretableelectrondensitymap,bothamplitudeandphasemustbeknown

• Thephasecannotbedirectlyrecordedduringadiffractionexperiment• Initialphasescanbeassignedby:

• Ab initiophasingordirectmethods:smallmolecules• Molecularreplacement:knownrelatedstructure• AnomalousX-rayscattering:introducinganomalouslydiffractingatoms(e.g.SeMet)• Multipleisomorphous replacement:introducingheavyatoms

Thephaseproblem

Page 21: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 21StructuralBioinformatics.Lecture1.

• Allisotopesthatcontainanoddnumberofprotonsand/orofneutronshaveanintrinsicmagneticmomentandangularmomentum(anon-zeronuclearspin),whichgivesrisetodifferentenergylevelsandresonancefrequenciesinamagneticfield

NMRinvolvestwosequentialsteps:• Thealignment(polarization)ofthemagneticnuclearspinsinanapplied,constant

magneticfieldH0

• Theperturbationofthisalignmentofthenuclearspinsbyemployinganelectro-magnetic,usuallyradiofrequencypulse.Therequiredperturbingfrequencyisdependentuponthestaticmagneticfield(H0)andthenucleiofobservation

• Electronicshieldingreducesthemagneticfieldatthenucleus(whichiswhatdeterminestheNMRfrequency).Asaresult,thefrequencyrequiredtoachieveresonanceisreduced:thisshiftintheNMRfrequencyiscalledchemicalshift

Nuclearmagneticresonance

Page 22: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

• Thesignalsareaconsequenceofrelaxationofthenucleitoequilibrium• Two-dimensionalNMRspectra(e.g.COSY,NOESY)provideinformationonthe

interactionsbetween1Hnucleithroughcovalentbonds(through-bondJcouplings)orthroughspace(nuclearOverhauser effect,NOE)

• Onlyatomswithin5Å ofeachothershowaNOE,anditsmagnitudevarieswiththedistancebetweenthem

• Thestructureoftheproteinmaybereconstructedfromasetofinter-proton(upper-bound)distancesinferredfromtheNOEintensities

Course2017-18 22StructuralBioinformatics.Lecture1.

NMRspectroscopy

Page 23: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Theresolutionofastructuregivesanideaofitsprecision.Theresolutionisdeterminedbytheamountofdatameasured:themoredatathehighertheresolution

• X-ray:adistanceinÅ indicatingthatanytwoatomsseparatedbymorethanaboutthisdistancewillappearasseparatemaximaintheelectrondensitycontourplot.1Å isatomicresolution

• NMR:doesnothaveaproperresolutionmeasure.Theatomic-positionroot-mean-squaredifferencebetweenthevariousmodelsgeneratedisgivenasanindicator

Course2017-18 23StructuralBioinformatics.Lecture1.

Precisionofanexperimentalstructure

Page 24: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

X-raycrystallography:• Theproteinmustbecrystallisable• Itdoesnotprovideinformationondynamics

NMRspectroscopy:• Hasaprotein-sizelimit:normally70kDa,150kDa withTROSY• Theproteinmustbesolublewithoutaggregatingatconcentrationsclosetothoseofaproteininacrystallattice

• LessprecisethanX-raycrystallography(lessdataandlessprecisedata)

Both:• Theexperimentalconditionsarefarfromphysiologicalconditions:thestructuremaybeaffected

• Theremightbedisorderedregionsforwhichthestructurecannotbeassessed

• Theexperimentaldatahastobemanipulatedbeforeitcanbetransformedintocoordinates.Thismanipulationincludesanumberofassumptions(especiallyinNMR):experimentalstructuresarealsomodels

Course2017-18 24StructuralBioinformatics.Lecture1.

Limitationsofexperimentalstructuredetermination

Page 25: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

• PolarresiduescanseetheirpKa substantiallyalteredwhenpassingfromawatermediumtoainnerproteinmedium

• Sincemanysimulationmodelshaveanexplicitatomicrepresentation,oneneedstohaveaprettygoodideaoftherealprotonationstate

• Wrongprotonationstatescanleadto:– Modifieddynamics– Wronghydrogen-bondingnetworkindockings– etc…

• Histidine isthemostimportantresiduetotakecareof• Asimpleneighbourhooddistanceanalysisisoftensufficientforaninitialguess.

ReasonablyaccuratepKa calculationswillbeinothercasesneeded.

Course2017-18 25StructuralBioinformatics.Lecture1.

Theprotonationproblem

!!pKa = −log10Ka

!ka =

A−⎡⎣ ⎤⎦ H+⎡⎣ ⎤⎦HA⎡⎣ ⎤⎦

Page 26: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 26StructuralBioinformatics.Lecture1.

pKas ofaminoacidsinwater

Page 27: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Thefiguresinthissectionbelongto"ProteinStructureandFunction",byPetsko andRinge,NewSciencePress,2004,andweredownloadedfrom:http://www.new-science-press.com/browse/protein/resources/#illustrations

Otherrecommendedbooks:

"IntroductiontoProteinStructure",byBranden andTooze,GarlandPublishing,2ndedition,1999

"Proteins:StructuresandMolecularProperties",byCreighton,WHFreeman&Co.,2ndedition,1993

"StructureandMechanisminProteinScience:AGuidetoEnzymeCatalysisandProteinFolding",byFersht,WHFreeman&Co.,1999

Course2017-18 27StructuralBioinformatics.Lecture1.

Bibliography

Page 28: MSc in Bioinformatics - UABbioinf.uab.es/xavier/Module2-StructBiol-Lecture1.pdfMSc in Bioinformatics Module 2: Core Bioinformatics Folded proteins are stabilized mainly by weak noncovalent

MScinBioinformatics Module2:CoreBioinformatics

Course2017-18 28StructuralBioinformatics.Lecture1.

3DproteinstructuresPDB,http://www.rcsb.org/pdb/CheckingstructuresPROCHECK,onlineathttp://www.ebi.ac.uk/pdbsum/ProteindatabaseUniProt,http://www.uniprot.org/StructuralclassificationofproteinsSCOP,http://scop.mrc-lmb.cam.ac.uk/scop/Proteinstructurecomparisonhttp://ekhidna2.biocenter.helsinki.fi/dali/DomainfamiliesPfam,http://pfam.xfam.org/SecondarystructurePSIPred,http://bioinf.cs.ucl.ac.uk/psipred/SmallcompoundsDrugBank,http://www.drugbank.ca/

Sitesfortheexercises