Recombinant ProteinProduction and Crystallography · 2018-06-05 · • Fast Protein Liquid...

73
www.imb.de Modern Techniques in Life Science Lecture Recombinant Protein Production and Crystallography June 4, 2018 Prof. Dr. Eva Wolf Dr. Martin Möckel

Transcript of Recombinant ProteinProduction and Crystallography · 2018-06-05 · • Fast Protein Liquid...

  • www.imb.de

    Modern Techniques in Life Science Lecture

    Recombinant Protein Production and

    Crystallography

    June 4, 2018

    Prof. Dr. Eva WolfDr. Martin Möckel

  • www.imb.de

    Part I – Recombinant Protein Production - Outline

    Martin Mö[email protected]

    1 - Rationale behind recombinant proteins – what do we need them for?

    2 - Protein expression systems / organisms• Overview• Escherichia coli• Insect cells

    3 - How to access recombinant proteins – cell lysis and clearing techniques

    4 - Isolation of recombinant proteins – chromatography techniques • Principle of column chromatography• Fast Protein Liquid Chromatography (FPLC)• Mainstream column-chromatographic methods

    5 - Quality control of purified proteins

    typicalworkflow

    for proteinproduction

    process

  • www.imb.de

    1 - Rationale behind recombinant proteins – what do we need them for?

    Martin Mö[email protected]

  • www.imb.de

    1- Rationale behind recombinant proteins

    Martin Mö[email protected]

    Basic research: recapitulate in vivo findings -> generating direct proof for actions/interactions of

    specific proteins („reconstitution“ of certain in vivo situations) study biophysical characteristics and structure (see 2nd part of the lecture) as target for antibody production laboratory routine (enzymatic reactions, e.g. for molecular cloning) etc.

    Pharmaceutical industry: vaccines insulin therapeutic antibodies many more

  • www.imb.de

    2 - Protein expression systems / organisms

    Martin Mö[email protected]

  • www.imb.de

    2 - Overview of different expression systems

    Martin Mö[email protected]

    Characteristics E. coli Yeast Insect cells Mammalian cells

    Cell growth (time/division) rapid (30 min) rapid (90 min) slow (18-24 h) slow (24 h)

    Complexity of growth medium minimum minimum complex complex

    Cost of growth medium low low high high

    Expression level high low - high low - high low - moderate

    Extracellular expression secretion to periplasmsecretion to

    mediumsecretion to

    mediumsecretion to

    medium

    Protein folding refolding often requiredrefolding may be required proper folding proper folding

    N-linked glycosylation none high mannose simple, no sialic acid complex

    O-linked glycosylation no yes yes yesPhosphorylation (no) yes yes yes

    Acetylation no yes yes yesAcylation no yes yes yes

    gamma-Carboxylation no no no yes

  • www.imb.de

    2 - Escherichia coli

    Martin Mö[email protected]

    Escherichia coli is the most common expression system among prokaryotes.

    Advantages: fast growth easy transgene introduction heavy overexpression / large yields cheap

    Disadvantages: size limit of recombinant proteins app. 100 kDa lack of eukaryotic folding machinery -> might lead

    to misfolded/ aggregated overexpressed eukaryoticproteins

    lack of eukaryotic posttranslational modificationsmight impact on functionality of recombinanteukaryotic proteins

  • www.imb.de

    2 - Escherichia coli - expression vectors

    Martin Mö[email protected]

    Transgenes are introduced andmaintained using plasmids that mediate antibiotic resistance and harbour theirown origin of replication.

    Transgene expression is driven by a strong promoter in most cases. Often, transgene expression is controled by a lac-operon-like system (induced by theaddition of a stable galactose-thioglycosid, IPTG (Isopropyl-β-D-thiogalatopyranosid)

    In order to retain the recombinantprotein, the transgene needs an affinitytag (see slides for affinity chromato-graphy later).

    vector map derived using SnapGene®

  • www.imb.de

    2 - Insect cells

    Martin Mö[email protected]

    Sf9 cells

    Advantages: allows for the production of “difficult” eukaryotic

    proteins and big protein complexes strong overexpression can be achieved most of the

    times require only a 28°C incubator, no need for CO2 (in

    contrast to most mammalian cell systems) can be cultivated in shaker flasks (cheaper than working

    with mammalian cells)

    Disadvantages: long doubling time (app. 24 h, E.coli: 20 min!) tedious preparations required before expression (several

    weeks) -> parallel, instead of sequential work flow isessential (start with multiple possible constructs)!

    expensive media more difficult to handle (danger of contaminations)

    Cells derived from the moth Spodoptera frugiperda are themost common system for eukaryotic protein production.

  • www.imb.de

    2 - Insect cells - induction of recombinant protein production

    Martin Mö[email protected]

    Sf9 cells

    The baculovirus system can be used to deliver transgenes into cultured insect cells using recombinant virus particles.

    Advantage: Baculoviruses can be used in a S1 laboratory environment (compared e.g. to lentiviral infection of mammalian cells that has to be performed in a S2 lab)

    Virion of AcMNPV/ baculovirus:

    infection

    natural occuring baculoviruses: infect subset of insects infected insects disintegrate and contaminate

    leaves/plants -> larvae that feed on contaminatedplants are infected

  • www.imb.de

    2 - Insect cells - baculovirus production

    Martin Mö[email protected]

    Baculoviruses can be produced by transfecting a huge plasmid, the „bacmid“, into insect cells. The recombinant bacmid is produced in E.coli. It contains all required genes to produce the

    virus and can be modified, so that it also contains the desired transgene that will be expressedin infected insect cells.

    After successfull transfection, virus particles are collected from the media and can beamplified in order to have enough virus to infect a sufficient amount of cells for proteinproduction.

    It takes two to three weeks from cloning unil the production recombinant proteins.

    picture from Invitrogens´ „Bac-to-Bac®Baculovirus Expression System“ manual

  • www.imb.de

    3 - How to access recombinant proteins – cell lysis and clearing techniques

    Martin Mö[email protected]

  • www.imb.de

    3 - Cell lysis techniques

    Martin Mö[email protected]

    intracellular protein

    requires cell lysisextracellular proteinsecretion-> can be retrieved fromthe media (or periplasmin the case of E.coli)

    transgeniccell

    separation of cells from media using centrifugation or filtration

    use of the supernatant forprotein purification,disposal of cells

    antibodies,many pharma-ceutical proteins

    many researchand biotechproteins

    use of the cell pellet forprotein purification, cells need to be lysed!

    extracellularprotein

    intracellularprotein

  • www.imb.de

    3 - Cell lysis techniques

    Martin Mö[email protected]

    Chemical lysis: enzymatic digestion of cell wall detergents

    Osmotic shock

    Freeze-thaw cycles

    Physical lysis: french press microfluidizer douncer homogenizer sonication

    Strong sonication can heat the sample, which mightlead to the unfolding and precipitation of proteins.

    Ultrasonifier

    Douncer:

  • www.imb.de

    4 - Isolation of recombinant proteins –chromatography techniques

    Martin Mö[email protected]

  • www.imb.de

    4 - Isolation of recombinant proteins

    Martin Mö[email protected]

    absorptive

    non-absorptive

    red: negatively-charged surfaceblue: positively-charged surface

  • www.imb.de

    4 - 1: Principle of liquid column chromatography

    Martin Mö[email protected]

    Total column volume = volume of mobile phase +

    volume of stationary phase

    Solid/stationaryphase

    Liquid/mobilephase

  • www.imb.de

    4 - 2: Fast Protein (Performance) Liquid Chromatography (FPLC)

    Martin Mö[email protected]

    An FPLC is excellent to streamline multi-step purifications and it allows moresophisticated purification/column setups.

    This makes purifications more reproducibleand reduces the hands-on time.

    Modern times protein purification is almostunimaginable without FPLC machines!

    columns can be exchanged

    proteinsabsorb at 280 nm

    chromatogram: Sironi et al., EMBO Journal, 2001, DOI 10.1093/emboj/20.22.6371

    pictures: GE-lifesciences and lifeserv.bgu.ac.il

  • www.imb.de

    4 - 3: column-chromatographic methods

    Martin Mö[email protected]

    affinity,(ion exchange)

    ion exchange,hydrophobic interaction

    gel filtration

    Multi-step purification setups are superior to single-step setups, as they allow for higherpurity (and often a more reproducible activity profile) of recombinant proteins:

  • www.imb.de

    4 - 3: column-chromatographic methods - affinity

    Martin Mö[email protected]

    Bind-elute principlewith specific bindingand elution conditions:

    Eluent competes withprotein for ligand binding:

    protein-concentratingeffect

    Affinity chromatography is a very selectivetechnique, therefore often used as a firststep to enrich recombinant proteins fromcell lysates. Most of the times, the proteinof interest needs to be equipped with an artificially introduced affinity tag.

    recombinant fusion protein

    high eluent concentration in bufferlow eluent concentration in buffer

  • www.imb.de

    4 - 3: column-chromatographic methods - affinity

    Martin Mö[email protected]

    Eluent

    There are various artificial affinity tags available, some examples are:

    recombinant fusion protein

    specific protease cleavage site allowsaffinity tag removal (while protein isbound to column or after elution)

  • www.imb.de

    4 - 3: column-chromatographic methods - ion exchange

    Martin Mö[email protected]

    Proteins are amphoteric:

    Ion exchange chromatography (IEX) separates molecules on the basis of differences in their net surface charge at a certain pH

    Bind-elute principle:

    binding at low ionic strength(and correct pH range) andelution using high ionicstrength (high salt) conditionsor adjusting the pH to theisoelectric point of the protein:

    high [salt]

    low [salt]

  • www.imb.de

    4 - 3: column-chromatographic methods - size exclusion

    Martin Mö[email protected]

    Size exclusion chromatography (SEC) separates molecules on the basis of theirsize (and shape). The matrix is inert (non-absorptive technique). Compatible with various buffers Ideal final polishing step Problem: dilution of the sample

    porous beads: small molecules enter thepores, while larger molecules do not/less:

    stationaryphase

    mobile phase

  • www.imb.de

    5 - Quality control of purified proteins

    Martin Mö[email protected]

  • www.imb.deMartin Mö[email protected]

    5 - Quality control of purified proteinsSDS-PAGE:

    staining of proteins with coomassie brilliantblue, molecular weight validation andidentification of impurities (other proteins)

    Analytical size exclusion chromatography:

    identification of proteinaggregates/oligomers

    protein complexes: (correct) stoichiometry

    Spectroscopy/spectrometry:

    Ultraviolet (UV) spectroscopy : identification of contaminating nucleic acids

    Infrared (IR) spectroscopy: secondary structure of protein

    Mass spectrometry: identity of correct protein sequence/length,

    post-translational modifications

  • www.imb.deMartin Mö[email protected]

    1 – Capture of His-tagged protein from E. coli lysate using a Ni-functionalized column (affinity):

    3 – Polish and homogeneity check using size exclusion:

    Exemplary purification flow

    2 – Intermediate ion exchange step/concentration:

  • MacromolecularX-ray crystallography

    Eva WolfStructural Chronobiology

    JGU and IMB Mainz

  • X-ray crystallography

    Why X-rays ?

    Why crystals and crystallography ?

    Abbe limit: resolution limited bywave length of used „light“ source→ use X-rays with l about 1 Å

    - Enhance X-ray diffraction signal by regular repetition(signal : noise)

    - Radiation damage

    Intensity Ihkl ~ number of unit cells

  • Dimensions of biological Structures

    Atoms Small Macromolecule Particle Organelle, Eucaryotic

    molecule Protein/DNA/RNA Bacteria Cells

    C-C- Ribose Domain Ribosome Mitochondria Erythrocyte

    bond (~ 4 Å) (2.5 - 5 nm) (~ 30 nm) Bacteria

    1.54 Å Virus (1-2 mm) (5-8 mm)

    (10-100 nm)

    0.1nm 1nm 10nm 100nm 1mm 10mM

    1 Å

  • Dimensions biological Structures - Suitable techniques for visualization

    Atoms Small Macromolecule Particle Organelle, Eucaryotic

    molecule Protein/DNA/RNA Bacteria Cells

    C-C- Ribose Domain Ribosome

    bond (~ 4 Å) (2.5 - 5 nm) (~ 30 nm)

    1.54 Å Virus (10-100 nm) (1-2 mm) (5-8 mm)

    0.1nm 1nm 10nm 100nm 1mm 10mM

    1 Å

    Light microscopy

    Resolution ~ l/2, > 250 nm

    Superresolution lightmicroscopy: ~ 10-20 nm

    Atomic resolution

    (~ 1 - 3 Å) :

    X-ray crystallographyl x-ray ca. 1 Å

    NMR

    Single particle EMl 100 keV e- : 0.037 Å

    Electron Microscopy

    Single particle

    10-20 Å negative stain

    (e.g. Uranylacetate)

    > 3 Å Cryo-EM

    (direct detector, motion

    correction, sample) Electron MicroscopyCell biology

  • X-ray crystallography

    NMR (nuclear magnetic resonance)

    Single particle EM

    Modelling

    Ab Initio vsHomologie

    Crystal, broad MW range, high resolution (typically up to1.5 Å)

    Solution, dynamics,MW typically < 30 kDa

    MW > 200 kDa

    resolution typically up to 3 Å

  • Resolution6 Å

    3.0 Å

    4.0 Å

    1.8 Å

    1.0 Å

    10 Å – cryo-EM

    Ring of

    6 subunits,

    hexamer

    0.5 Å

  • Macromolecular X-ray crystallography

    mg

    Cu Ka = 1.54 Å Variable l, usually around 1 Å

    FT

    F(hkl) = |F| * eia

    In vitro =„in glass“

    SDS-PAGE

    Rotating anode Synchrotron

    „PhaseProblem“

    X-ray diffraction pattern Electron density

  • Protein Expression (recombinant) and Purification

    Crystallization

    X-ray data collection

    Structure determination

    Phasing stategies, Electron density calculation

    Model building and Refinement

    Macromolecular X-ray crystallography

    Structure Validation and Evaluation (Geometry)

    mg of pure, homogeneous protein: Purity: SDS-PAGE > 95% pure.Non-aggregated, stable, folded,single conformational andoligomeric state: SEC, MALS/SLS, DLS, CD, SAXS

  • Crystallisation of Macromolecules

    Multiparameter search for „right“ conditions

    - pH

    - Precipitant : PEG (Polyethylenglykol), MPD (2-Methyl-2,4-Pentanediol), Ethanol, Isopropanol, Salt (e.g. AmSO4, LiSO4, NaCl, MgCl2, NaOAc)

    - Ionic strength

    - Additive (chemical helping crystallization)

    - Temperature (4°C, 20°C)

    - Protein construct (Fragment, Organism, Ligands)

    - Protein/Ligand: Concentration , Stoichiometry

    Screens to find good conditions.

    Crystallization robots and imaging systems.

  • Optimization of Protein Crystals

    Spherulites Clusters of needles

    Single needles („1D“)Plates („2D“)

    3D!

    < < <

    < <

    best

  • Crystallisation by Vapour Diffusion

    „Hanging drop“

    Crystallization plates:

    96 well 24 well

    Typical drop volumes:

    1µl + 1µl (24 well, manual)

    100 nl + 100 nl (96 well, robot)

    H2OReservoir:70 µl (96 well)0.5 - 1ml (24 well)

    „Sitting drop“

  • Crystallisation by Vapour diffusion: Phase diagram

    Hangingdrop

    H2O

    Drop: Reservoir + Protein

    Red arrows: concentration over time, developmentduring crystallisation. (minutes to weeks)

    Undersaturated

    Supersaturated

  • A typical screen result – 96 well

    Lots of precipitate in different conditions …..

    Hopefully at least one

    drop with a crystal …

  • If you don‘t get a crystal.. : Change theprotein construct

    Lots of precipitate in different conditions …..

  • Design crystallizing proteins by limited proteolysis

    212

    118

    66

    43

    29

    20

    14

    M Chymotrypsin GluC

    Identify flexible protein regionsthat are accessible and cleavedby proteases.

    Mass spec to determine domainboundaries of stable degradationproducts. Reclone, express, purify.

    Or change the organism, e.g. → →

  • Remove flexible parts→ better crystal packing

    Design crystallizing proteins by limited proteolysis –improve crystal packing

    Quite well packedalready

  • Protein Expression (recombinant) and Purification

    Crystallization

    X-ray data collection

    Phasing, Electron density calculation

    Model building and Refinement

    Macromolecular X-ray crystallography

    Structure Validation and Evaluation (Geometry)

  • General Setup

    X-ray source

    Optics:Mirrors,Monochromator Crystal

    Detector

  • Crystal freezing and Mounting

    Collect data at 100K (reduce radiation damage)

    - define cryogenic solution (e.g. add glycerol)

    - mount crystal in loops

    - freeze crystals in liquid nitrogen

    - nitrogen gas at data collection

    0°C = 273,15 K 100 K = -173.15°C

    Liquid nitrogen evaporates at 77 K (-196°C) at normal pressure.

  • X-ray sources in Macromolecular Crystallography

    Rotating Cu-Anode („home source“):

    characteristic monochromatic Cu-Ka-

    radiation (1.54Å)

    Detector: Image plate

    Rotate crystal at data collection

  • Synchrotron sources for macromolecular CrystallographyDESY-PETRA III

    (Hamburg)

  • Synchrotron: high brilliance & variable wave length

    → electrons travel in a ring accelerated perpendicular to magnetic field

    → tangentially emitted Synchrotron radiation

    → high brilliance (high intensity, low divergence), variable wave length

    → smaller crystals, larger unit cells (larger 3D-structures), better data quality, experimental phasing with anomalous signals, time resolved studies

  • Experimental setup at Synchrottron (ESRF)

    Detector:

    CCD

    (fast)

    Cryo stream, nitrogen gas

    crystal

  • Mount crystals manually: potentially safer, but takes longer and can be hard at 3 am at night

    Automatic sample changer – faster, not always reliable, but less work

  • Automatic sample changer at work – the scientist stays outside ..

    Pilatus: large area pixel detector

  • Reward: High resolution 3D–structures of

    multi-subunit macromolecular complexes

    Large/few unit cells, potentially small crystals → weak signal→ need high brilliance synchtotron radiation and sensitive detectors

    E.g.- Ribosome: Protein biosynthesis- Nucleosome: DNA-packaging, Chromatin- RNA-Polymerase II : Transcription

    Ribosome (1.6 MDa) Nukleosome (206 kDa) RNA-Polymerase II,

    ~ 500 kDa

  • Single particle Cryo-EM - recent developments

    Direct electron detector: electrons detected directly

    - more sensitive and much faster than CCD cameras→ record movies (many frames per second)

    New image-processing tools: correct images for beam-induced

    sample movements → movie processing and motion correction

    → Resolution up to 3 - 4 Å for larger complexes

    Advantage: don‘t need crystal

    Draw back: - very expensive equipement- sample preparation tedious- resolution not beyond 3 A

    (X-ray still higher resolution)- not very suitable for < 200 kDa

  • Examples for high resolution Cryo-EM structures

    Ion channel, ≈3 Å Hydrogenase, ≈ 3 Å

    Mitochondrial ribosome, ≈ 3 Å g-Secretase (170 kDa) ≈ 4.5 Å (Nature 2015: 3.4 Å)

  • Protein Expression (recombinant) and Purification

    Crystallization

    X-ray data collection

    Phasing, Electron density calculation

    Model building and Refinement

    Macromolecular X-ray crystallography

    Structure Validation and Evaluation (Geometry)

  • Steps to get a crystal structure – structure determination, model building

    mg

    Cu Ka = 1.54 Å Variable l, usually around 1 Å

    FFT

    F(hkl) = |F| * eia

    „Phase Problem“

  • Structure determination - The Phase Problem

    FFT

    F(hkl) = |F| * eia

    Amplitude Phase

    1. Phases of similar structure:

    Molecular Replacement

    2. Experimental phases:

    Heavy atoms or anomalous

    scatterers

    FFT

    Light and electrons:lens as physical

    Fouriertransformator

    → Microscope

    X-rays: no lens, calculate

    Fourietransformation (FFT)

  • Protein Expression (recombinant) and Purification

    Crystallization

    X-ray data collection

    Phasing, Electron density calculation

    Model building and Refinement

    Macromolecular X-ray crystallography

    Structure Validation and Evaluation (Geometry)

  • ???????

    Interpret Electron Density

  • Experimental density

    Skeleton of Protein

    (building program)

    Corrected Skeleton

    Ca-backbone based on corrected skeleton

    1

    2

    3

    4

    Interpret Electron density – Model building

  • Assign Sequence to electron density

    1. Known fold/structure (molecular replacement)

    → look for expected secondary structure elements

    (e.g. five-stranded antiparallel b-sheet, long a-Helix)

    → orientation known

    - Primary sequence alignments

    Experimental densityCa-Backbone of Protein

    Interpret Electron density – Model building

  • 2. Unknown Structure

    - Identify secondary structure elements (b-Sheet, a-Helix)

    - Look for Ligands/Cofactors (e.g. Nucleotide GDP)

    → define active centre

    → positive |Fobs| - |Fcalc| difference density

    GDP: 2.0 Å (Mg2+ yellow, H2O red)

  • Parallel or antiparallel ?

    parallel antiparallel

    Parallel and antiparallel ß-sheet

  • Antiparallel !

    parallel antiparallel

    Parallel and antiparallel ß-sheet

  • 2. Unknown Structure

    Look for amino acids with large side chains

    (Phe, Trp, Tyr)

    Se-MAD/SAD: Selenium positions (Se-Methionine),

    phasing by anomalous dispersion

    → positive anomalous difference density |F+| - |F-|

  • 2 Å

    Build Model

    Ribbon SurfaceLigand

    Refine

    Interpret Electron Density

  • Protein Expression (recombinant) and Purification

    Crystallization

    X-ray data collection

    Phasing, Electron density calculation

    Model building and Refinement

    Macromolecular X-ray crystallography

    Structure Validation and Evaluation (Geometry)

  • Evaluate Geometry of the Final Model

    Main chain torsion angles Ca – N (F, Phi) and Ca – C=O (Y, Psi)

    between peptide bonds → allowed values for secondary structures.

    Check residues with disallowed F,Y-angle combinations

    → Ramachandran Plot

    w=180°

    Ca

    Phi

    Psi

    R

  • 92.9% allowed (red)

    7.1 % additionally allowed(yellow)

    No residues disallowed.

    → good geometry

    Example:

  • Evaluate Structure : What can we learn ?

    Follow-up experiments ?

    IIGP1(Ghosh et al., 2004)

    PDB (Protein Data Bank), entry 1TQ4

    Fold / Topology:

    → Topology-Diagramm

    Similar structures in pdb ?

    → DALI-Server (www)

    Fold type: e.g. SCOP

    (Structural Classification of Proteins)

  • Analyse molecule surface for potential interaction sites withother molecules: electrosatic surface potential

    Example: Dimer of GTPase IIGP1 (Ghosh et al, 2004), surface charges

    Red: negative, blue: positive, white: hydrophobic.

    Positive

    Negative

  • Analyse Protein-Protein-

    Interactions of complexes

    IIGP1-Dimer: Interfaces I and II

    Structure based interface mutants

    Interface I (B): 800Å2 buried surface;

    helical interface

    Mutate: Leu44/Arg (steric),

    Lys48/Ala (hole),

    Lys48/Glu (charge reversal)

    Interface II (C):1200Å2 buried surface;

    GTPase Domains.

    Mutate: Met173/Ala (hole),

    Ser172/Arg (steric)

  • Analyse Protein-Ligand Interactions

    GDP GppNHp

    2,7Å

    Core Facilities Lecture Protein Production Crystallography slides MMFoliennummer 1Foliennummer 2Foliennummer 3Foliennummer 4Foliennummer 5Foliennummer 6Foliennummer 7Foliennummer 8Foliennummer 9Foliennummer 10Foliennummer 11Foliennummer 12Foliennummer 13Foliennummer 14Foliennummer 15Foliennummer 16Foliennummer 17Foliennummer 18Foliennummer 19Foliennummer 20Foliennummer 21Foliennummer 22Foliennummer 23Foliennummer 24Foliennummer 25Foliennummer 26

    imb-vl-master-ss2018-skript-ewolf