7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need...

26
7. 1D predictions

Transcript of 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need...

Page 1: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

7.1D predictions

Page 2: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Why do we need predictions?

Year 2017

Sequences 71 002 161

Structures 125 526

Page 3: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Types of predictions

Page 4: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

1D predictions

Secondary structure

Accessible surface

Signal sequences

Transmembrane regions

Coiled coils

http://ppopen.rostlab.org/

http://bioinf.cs.ucl.ac.uk/psipred/Pfam

Page 5: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Determination of secondary structure elements

It can be based on :Dihedral anglesHydrogen bondsGeometry

Automatic assignmentsDSSPSTRIDE

3 (alpha, beta, coil) or more categories ( pl. turn, other types of helices)

Don't agree completely

Page 6: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Predicting secondary structure elements from the sequence

Page 7: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Amino acids

Page 8: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Secondary structure elements

The various amino acids have different preferences for the secondary structure elements

Page 9: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Stages of secondary structure prediction methods

Page 10: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

PHDsec

Page 11: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Principles of prediction methods

I. Testing and training methodSeparate sets!!!

II. Evaluation

Per residue accuracyAmount predictedSegment overlap

Page 12: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions
Page 13: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Accuracy

Page 14: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Coiled Coils

Page 15: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Coiled Coil prediction

COILS : Ismert CC szakaszokhoz való hasonlóság (keratin, Myosin, troponin)

To exclude false positive change weights of hydrophobic residues (h)

PAIRCOIL: uses pair correlation of amino acid within heptad repeats

Prediction accuracy depends on length of coiled coil regions

lower accuracy for multiple coils

Page 16: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Membrane proteins

Important:

Energy productionTransportcell-cell connection

Drug targets

Page 17: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

TM proteins

Page 18: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Structure determination of TM proteins

TM proteins are no water soluble

They have to taken out from the membrane and solubilized

Detergents

Very few known structures (2%)

Information about the position of membrane is lost

(PDBTM , OPM)

Page 19: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Topology

Location of membrane spanning segments and their orientation relative to the membrane

Page 20: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Prediction of TM proteinsHydrophaty scales

Page 21: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Topology prediction

Omit cleaved segments

Topology prediction rules

Hydrophobicity (aa composition)Length distribution

Positive inside rules

More difficult cases: reentrant loop

Increasing accuracy

ML approaches (NN, HMM)

Multiple sequence alignments, profiles

Consensus methods

Experimental constraints

Page 22: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

HMMTOP

Page 23: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Signal sequences

Page 24: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Signal sequenceN-terminal signal sequence

Extracellular space, mitochondria, chloroplast

Depends on species and compartment

For example: secretory signal peptide usually 15-30 AA3 zones : Positive N-terminal, hydrophobic region, C-terminal polar with some charged

residues at the endFurther localozation singals and modes

Page 25: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

Prediction of localization

1. Based on sequenceCleavage site

PSSM,ML (NN, SVM, HMM)

LocalizationAA composition, other global features

2. Based on other information(eg. Expression level, phylogenetics, GO annotation

3. Specific domain, homology

Page 26: 7. 1D predictions - Eötvös Loránd University2017/01/01  · 7. 1D predictions Why do we need predictions? Year 2017 Sequences 71 002 161 Structures 125 526 Types of predictions

What happens if you submit a globular protein to transmembrane

predictor?

– In general, transmembrane topology prediction methods are not made to tell whether the protein sequence belongs to a globular or a transmembrane protein– Some methods can do this DAS http://mendel.imp.ac.at/sat/DAS/DAS.html

– Sometimes signal sequence prediction methods can help