Structural Bioinformatics Analysis of Acid Alpha-Glucosidase
Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... ·...
Transcript of Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... ·...
![Page 1: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/1.jpg)
BIMM 143Structural Bioinformatics II
Lecture 12
Barry Grant
http://thegrantlab.org/bimm143
![Page 2: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/2.jpg)
Next Up:• Overview of structural bioinformatics • Motivations, goals and challenges
• Fundamentals of protein structure • Structure composition, form and forces
• Representing, interpreting & modeling protein structure • Visualizing and interpreting protein structures• Analyzing protein structures• Modeling energy as a function of structure • Drug discovery & Predicting functional dynamics
![Page 3: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/3.jpg)
Key concept: Potential functions describe a systems
energy as a function of its structure
Ener
gy
Structure/Conformation
![Page 4: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/4.jpg)
Two main approaches:(1). Physics-Based(2). Knowledge-Based
![Page 5: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/5.jpg)
Two main approaches:(1). Physics-Based(2). Knowledge-Based
![Page 6: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/6.jpg)
V(R) = Ebonded + Enon.bonded
For physics based potentials energy terms come from physical theory
![Page 7: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/7.jpg)
Sum of bonded and non-bonded atom-type and position based terms
V(R) = Ebonded + Enon.bonded
![Page 8: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/8.jpg)
V(R) = Ebonded + Enon.bonded
is itself a sum of three terms:Ebonded
![Page 9: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/9.jpg)
V(R) = Ebonded + Enon.bonded
is itself a sum of three terms:Ebonded
Ebond.stretch + Ebond.angle + Ebond.rotate
![Page 10: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/10.jpg)
V(R) = Ebonded + Enon.bonded
is itself a sum of three terms:Ebonded
Ebond.stretch + Ebond.angle + Ebond.rotate
Stretch
Angle
Rotate
![Page 11: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/11.jpg)
Bond Stretch
Bond Angle
Bond Rotate
Ebond.stretch
Ebond.angle
Ebond.rotate
![Page 12: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/12.jpg)
∑bonds
Kbsi (bi − bo)
∑angles
Kbai (θi − θo)
∑dihedrals
Kbri [1 − cos(niϕi − ϕo)
Bond Stretch
Bond Angle
Bond Rotate
![Page 13: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/13.jpg)
∑bonds
Kbsi (bi − bo)
∑angles
Kbai (θi − θo)
∑dihedrals
Kbri [1 − cos(niϕi − ϕo)
Bond Stretch
Bond Angle
Bond Rotate
![Page 14: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/14.jpg)
V(R) = Ebonded + Enon.bonded
is a sum of two terms:Enon.bonded
![Page 15: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/15.jpg)
V(R) = Ebonded + Enon.bonded
is a sum of two terms:Enon.bonded
Evan.der.Waals + Eelectrostatic
![Page 16: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/16.jpg)
V(R) = Ebonded + Enon.bonded
is a sum of two terms:Enon.bonded
Evan.der.Waals + Eelectrostatic
Stretch
Angle
Rotate
Non-bonded
![Page 17: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/17.jpg)
Evan.der.Waals + Eelectrostatic
Evan.der.Waals = ∑pairs.i.j
[ϵij(ro.ij
rij)12 − 2ϵij(
ro.ij
rij)6]
Eelectrostatic = ∑pairs.i.j
qiqjϵrij
Stretch
Angle
Rotate
Non-bonded
![Page 18: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/18.jpg)
V(R) = Ebond.stretch+Ebond.angle+Ebond.rotate+Evan.der.Waals+Eelectrostatic
}}
Ebonded
Enon.bonded
Total potential energyThe potential energy can be given as a sum of terms for: Bond stretching, Bond angles, Bond
rotations, van der Walls and Electrostatic interactions between atom pairs
![Page 19: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/19.jpg)
Now we can calculate the potential energy surface that fully describes the energy of a
molecular system as a function of its geometry En
ergy
(V)
Position (x)
Potential energy surface
![Page 20: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/20.jpg)
Now we can calculate the potential energy surface that fully describes the energy of a
molecular system as a function of its geometry
img044.jpg (400x300x24b jpeg)
Position (x)
Ener
gy (V
)
Potential energy surface
![Page 21: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/21.jpg)
Key concept: Now we can calculate the potential energy surface that fully describes the energy of a
molecular system as a function of its geometry
img044.jpg (400x300x24b jpeg)
Position (x)
Ener
gy (V
) • The forces are the gradients of the energy
F(x) = − dV/dx
![Page 22: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/22.jpg)
Moving Over The Energy Surface
•Energy Minimization drops into local minimum
•Molecular Dynamics uses thermal energy to move smoothly over surface
•Monte Carlo Moves are random. Accept with probability:
exp(−ΔV/dx)
img054.jpg (400x300x24b jpeg)
Position (x)En
ergy
(V)
![Page 23: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/23.jpg)
PHYSICS-ORIENTED APPROACHESWeaknesses
Fully physical detail becomes computationally intractableApproximations are unavoidable
(Quantum effects approximated classically, water may be treated crudely)Parameterization still required
StrengthsInterpretable, provides guides to designBroadly applicable, in principle at leastClear pathways to improving accuracy
StatusUseful, widely adopted but far from perfectMultiple groups working on fewer, better approxs
Force fields, quantumentropy, water effects
Moore’s law: hardware improving
![Page 24: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/24.jpg)
–Johnny Appleseed
Put Levit’s Slide here on Computer Power Increases!
![Page 25: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/25.jpg)
SIDE-NOTE: GPUS AND ANTON SUPERCOMPUTER
![Page 26: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/26.jpg)
SIDE-NOTE: GPUS AND ANTON SUPERCOMPUTER
![Page 27: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/27.jpg)
Two main approaches:(1). Physics-Based(2). Knowledge-Based
POTENTIAL FUNCTIONS DESCRIBE A SYSTEMS ENERGY AS A FUNCTION OF ITS STRUCTURE
![Page 28: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/28.jpg)
KNOWLEDGE-BASED DOCKING POTENTIALS
Histidine
Ligand carboxylate
Aromaticstacking
![Page 29: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/29.jpg)
Example: ligand carboxylate O to protein histidine NFind all protein-ligand structures in the PDB with a ligand carboxylate O
1. For each structure, histogram the distances from O to every histidine N2. Sum the histograms over all structures to obtain p(rO-N)3. Compute E(rO-N) from p(rO-N)
ENERGY DETERMINES PROBABILITY (STABILITY)
Boltzmann distribution
Ene
rgy
Pro
babi
lity
x
Boltzmann:
Inverse Boltzmann:
Basic idea: Use probability as a proxy for energy
![Page 30: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/30.jpg)
KNOWLEDGE-BASED POTENTIALSWeaknesses
Accuracy limited by availability of data
StrengthsRelatively easy to implementComputationally fast
StatusUseful, far from perfectMay be at point of diminishing returns
(not always clear how to make improvements)
![Page 31: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/31.jpg)
Computer Aided Drug Discovery
![Page 32: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/32.jpg)
Next Up:• Overview of structural bioinformatics • Motivations, goals and challenges
• Fundamentals of protein structure • Structure composition, form and forces
• Representing, interpreting & modeling protein structure • Visualizing and interpreting protein structures• Analyzing protein structures• Modeling energy as a function of structure • Drug discovery & Predicting functional dynamics
![Page 33: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/33.jpg)
THE TRADITIONAL EMPIRICAL PATH TO DRUG DISCOVERY
Compound library(commercial, in-house,
synthetic, natural)
High throughput screening (HTS)
Hit confirmation
Lead compounds(e.g., µM Kd)
Lead optimization(Medicinal chemistry)
Potent drug candidates(nM Kd)
Animal and clinical evaluation
![Page 34: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/34.jpg)
COMPUTER-AIDED LIGAND DESIGN
Aims to reduce number of compounds synthesized and assayed
Lower costs
Reduce chemical waste
Facilitate faster progress
![Page 35: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/35.jpg)
Two main approaches:(1). Receptor/Target-Based(2). Ligand/Drug-Based
![Page 36: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/36.jpg)
Two main approaches:(1). Receptor/Target-Based(2). Ligand/Drug-Based
![Page 37: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/37.jpg)
SCENARIO 1:RECEPTOR-BASED DRUG DISCOVERY
HIV Protease/KNI-272 complex
Structure of Targeted Protein Known: Structure-Based Drug Discovery
![Page 38: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/38.jpg)
PROTEIN-LIGAND DOCKING
VDW
Dihedral
Screened Coulombic+ -
Potential function Energy as function of structure
Docking softwareSearch for structure of lowest energy
Structure-Based Ligand Design
![Page 39: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/39.jpg)
STRUCTURE-BASED VIRTUAL SCREENING
Candidate ligands
Experimental assay
Compound database
3D structure of target(crystallography, NMR,
bioinformatics modeling)
Virtual screening (e.g., computational
docking)
Ligands
Ligand optimization Med chem,
crystallography, modeling
Drug candidates
![Page 40: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/40.jpg)
COMPOUND LIBRARIES
Commercial (in-house pharma) Government (NIH) Academia
![Page 41: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/41.jpg)
COMMON SIMPLIFICATIONS USED IN PHYSICS-BASED DOCKING
Quantum effects approximated classically
Protein often held rigid
Configurational entropy neglected
Influence of water treated crudely
![Page 42: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/42.jpg)
Do it Yourself!
Hand-on time!
You can use the classroom computers or your own laptops. If you are using your laptops then you will need
to install MGLTools
https://bioboot.github.io/bimm143_S19/lectures/#13
![Page 43: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/43.jpg)
Two main approaches:(1). Receptor/Target-Based(2). Ligand/Drug-Based
![Page 44: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/44.jpg)
e.g. MAP Kinase Inhibitors
Using knowledge of existing inhibitors to discover more
Scenario 2Structure of Targeted Protein Unknown:
Ligand-Based Drug Discovery
![Page 45: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/45.jpg)
Why Look for Another Ligand if You Already Have Some?
Experimental screening generated some ligands, but they don’t bind tightly enough
A company wants to work around another company’s chemical patents
An high-affinity ligand is toxic, is not well-absorbed, difficult to synthesize etc.
![Page 46: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/46.jpg)
LIGAND-BASED VIRTUAL SCREENING
Compound Library Known Ligands
Molecular similarityMachine-learning
Etc.
Candidate ligands
Assay
Actives
Optimization Med chem, crystallography,
modeling
Potent drug candidates
![Page 47: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/47.jpg)
CHEMICAL SIMILARITY LIGAND-BASED DRUG-DISCOVERY
Compounds(available/synthesizable)
Compare with known ligandsDifferent
Test experimentally
Similar
Don’t bother
![Page 48: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/48.jpg)
CHEMICAL FINGERPRINTSBINARY STRUCTURE KEYS
Molecule 1
Molecule 2
phen
yl
methyl
keton
eca
rboxyl
ate
amide
aldeh
yde
chlor
ine
fluori
ne
ethyl
naph
thyl
S-S bond
alcoh
ol …
![Page 49: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/49.jpg)
Molecule 1
Molecule 2
phen
yl
methyl
keton
eca
rboxyl
ate
amide
aldeh
yde
chlor
ine
fluori
ne
ethyl
naph
thyl
S-S bond
alcoh
ol …
CHEMICAL SIMILARITY FROM FINGERPRINTS
NI=2Intersection
NU=8Union
Tanimoto Similarity (or Jaccard Index), T
![Page 50: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/50.jpg)
+ 1
Bulky hydrophobe
Aromatic
5.0 ±0.3 Å 3.2 ±0.4 Å
2.8 ±0.3 Å
Pharmacophore ModelsΦάρμακο (drug) + Φορά (carry)
A 3-point pharmacophore
![Page 51: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/51.jpg)
Molecular DescriptorsMore abstract than chemical fingerprints
Physical descriptorsmolecular weightchargedipole momentnumber of H-bond donors/acceptorsnumber of rotatable bondshydrophobicity (log P and clogP)
Topologicalbranching indexmeasures of linearity vs interconnectedness
Etc. etc.
Rotatable bonds
![Page 52: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/52.jpg)
A High-Dimensional “Chemical Space”Each compound is a point in an n-dimensional space
Compounds with similar properties are near each other
Descriptor 1
Descriptor 2
Des
crip
tor 3
Point representing a compound in descriptor space
Apply multivariate statistics and machine learning for descriptor-selection. (e.g. partial least squares, PCA, support vector machines,
random forest, deep learning etc.)
![Page 53: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/53.jpg)
Proteins and Ligand are Flexible
+
Ligand
Protein
Complex
ΔGo
![Page 54: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/54.jpg)
Proteinase K
NMA (Normal Mode Analysis) is a bioinformatics method to predict the intrinsic dynamics of biomolecules
https://bioboot.github.io/bimm143_S19/lectures/#12
Do it Yourself!
![Page 55: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/55.jpg)
• Normal Mode Analysis (NMA) is a bioinformatics method that can predict the major motions of biomolecules.
NMA in Bio3D
pdb <- read.pdb("1hel") modes <- nma( pdb )m7 <- mktrj(modes, mode=7, file="mode_7.pdb")
view(m7, col=vec2color(rmsf(m7)))```
Then you can open the resulting mode_7.pdb file in VMD> Use "TUBE" representation and hit the play button...
![Page 56: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/56.jpg)
• If you want the 3D viewer in your R markdown you can install the development version of bio3d.view
• In your R console:
• To use in your R session:
> install.packages("devtools")> devtools::install_bitbucket("Grantlab/bio3d-view")> install.packages("rgl")
Bio3D view()
> library("bio3d.view")> pdb <- read.pdb("5p21")> view(pdb)> view(pdb, "overview", col="sse")> view(m7)
![Page 57: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/57.jpg)
• If you want the interactive 3D viewer in Rmd rendered to output: html_output document:
SideNote: view()
```{r}library(bio3d.view)library(rgl)```
```{r}modes <- nma( read.pdb("1hel") )m7 <- mktrj(modes, mode=7, file="mode_7.pdb")
view(m7, col=vec2color(rmsf(m7)))rglwidget(width=500, height=500)```
![Page 58: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/58.jpg)
Do it Yourself!
Hand-on time!
Focus on section 3 & 4 exploring NMA and PCA apps
https://bioboot.github.io/bimm143_S19/lectures/#13
![Page 59: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/59.jpg)
Reference SlidesMolecular Dynamics (MD) and Normal Mode Analysis
(NMA) Background and Cautionary Notes
[ Muddy Point Assessment ]
![Page 60: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/60.jpg)
PREDICTING FUNCTIONAL DYNAMICS
• Proteins are intrinsically flexible molecules with internal motions that are often intimately coupled to their biochemical function
– E.g. ligand and substrate binding, conformational activation, allosteric regulation, etc.
• Thus knowledge of dynamics can provide a deeper understanding of the mapping of structure to function
– Molecular dynamics (MD) and normal mode analysis (NMA) are two major methods for predicting and characterizing molecular motions and their properties
![Page 61: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/61.jpg)
McCammon, Gelin & Karplus, Nature (1977) [ See: https://www.youtube.com/watch?v=ui1ZysMFcKk ]
• Use force-field to find Potential energy between all atom pairs
• Move atoms to next state
• Repeat to generate trajectory
MOLECULAR DYNAMICS SIMULATION
![Page 62: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/62.jpg)
Divide time into discrete (~1fs) time steps (∆t)(for integrating equations of motion, see below)
t
![Page 63: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/63.jpg)
Divide time into discrete (~1fs) time steps (∆t)(for integrating equations of motion, see below)
t
![Page 64: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/64.jpg)
Divide time into discrete (~1fs) time steps (∆t)(for integrating equations of motion, see below)
At each time step calculate pair-wise atomic forces (F(t)) (by evaluating force-field gradient)
Nucleic motion described classically
Empirical force field
t
![Page 65: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/65.jpg)
Divide time into discrete (~1fs) time steps (∆t)(for integrating equations of motion, see below)
At each time step calculate pair-wise atomic forces (F(t)) (by evaluating force-field gradient)
Nucleic motion described classically
Empirical force field
Use the forces to calculate velocities and move atoms to new positions(by integrating numerically via the “leapfrog” scheme)
t
![Page 66: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/66.jpg)
BASIC ANATOMY OF A MD SIMULATIONDivide time into discrete (~1fs) time steps (∆t)(for integrating equations of motion, see below)
At each time step calculate pair-wise atomic forces (F(t)) (by evaluating force-field gradient)
Nucleic motion described classically
Empirical force field
Use the forces to calculate velocities and move atoms to new positions(by integrating numerically via the “leapfrog” scheme)
REPEAT, (iterate many, many times… 1ms = 1012 time steps)
t
![Page 67: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/67.jpg)
MD Prediction of Functional Motions “close”
“open”
Yao and Grant, Biophys J. (2013)
![Page 68: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/68.jpg)
• MD is still time-consuming for large systems• Elastic network model NMA (ENM-NMA) is an example
of a lower resolution approach that finishes in seconds even for large systems.
Atomistic
C. G.
• 1 bead / 1 amino acid
• Connected by springs
Coarse Grained
i
jrij
COARSE GRAINING: NORMAL MODE ANALYSIS (NMA)
![Page 69: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/69.jpg)
NMA models the protein as a network of elastic strings
Proteinase K
![Page 70: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/70.jpg)
Do it Yourself!
Hand-on time!
Focus on section 3 & 4 exploring NMA and PCA apps
https://bioboot.github.io/bimm143_S19/lectures/#13
![Page 71: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/71.jpg)
Ilan Samish et al. Bioinformatics 2015;31:146-150
![Page 72: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/72.jpg)
INFORMING SYSTEMS BIOLOGY?
Genomes
DNA & RNA sequence
DNA & RNA structure
Protein sequence
Protein families, motifs and domains
Protein structure
Protein interactions
Chemical entities
Pathways
Systems
Gene expression
Literature and ontologies
![Page 73: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/73.jpg)
• Structural bioinformatics is computer aided structural biology
• Described major motivations, goals and challenges of structural bioinformatics
• Reviewed the fundamentals of protein structure
• Explored how to visualize protein structure with VMD and use R to perform more advanced structural bioinformatics analysis!
• Introduced both physics and knowledge based modeling approaches for describing the structure, energetics and dynamics of proteins computationally
• Introduced both structure and ligand based bioinformatics approaches for drug discovery and design
SUMMARY
[ Muddy Point Assessment ]
![Page 74: Structural Bioinformatics II - GitHub Pages › bimm143_S19 › class-material › ... · 2019-06-03 · Structural Bioinformatics II ... img054.jpg (400x300x24b jpeg) Position (x)](https://reader035.fdocuments.in/reader035/viewer/2022081611/5f03a5397e708231d40a12a0/html5/thumbnails/74.jpg)
• A model is never perfect A model that is not quantitatively accurate in every respect does not preclude one from establishing results relevant to our understanding of biomolecules as long as the biophysics of the model are properly understood and explored.
• Calibration of parameters is an ongoing imperfect processQuestions and hypotheses should always be designed such that they do not depend crucially on the precise numbers used for the various parameters.
• A computational model is rarely universally right or wrongA model may be accurate in some regards, inaccurate in others. These subtleties can only be uncovered by comparing to all available experimental data.
CAUTIONARY NOTES