Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb...

18
Eukaryotic Gene Prediction Rui Alves

Transcript of Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb...

Page 1: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Eukaryotic Gene Prediction

Rui Alves

Page 2: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

How are eukaryotic genes different?

DNA

RNA PolmRNA

RybProtein

Page 3: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

How are eukaryotic genes different?

DNA

RNA Pol

RybProtein

mRNA mRNA

SpliceosomemRNA mRNA

Correctly Identifying Splicing sites is not a trivial task

Page 4: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

How do we predict splicing sites?

• By Homology

• Ab initio– SS motifs– Codon usage– Exonic Splicing Enhancers– Intronic Splicing Enhancers– Exonic Splicing Silencers– Intronic Splicing Silencers

Page 5: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Homology Splice Site Prediction

Known spliced gene

Predicted spliced gene

Page 6: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Splice Site Motifs

Page 7: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Exonic Splicing Enhancers

Page 8: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Exonic Splicing Silencers

Genes & Development 18:1241-1250

Page 9: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Interaction between SE and SI

Page 10: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Rules for Splicing

• 3’ end likely target for repression

• Distance between SE and 3’ end < 100bp

• Splicing efficiency p(interaction SEC-3’ end)

Page 11: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Methods for splicing detection

Training set

of

know spliced

genes

Algorithm

Test set

of

know spliced

genes

Set

of

know spliced

genes

GA, NN, HMM

Bayesian

GA, NN, HMM

Bayes,METest set

Predictions

Page 12: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

A Genetic Algorithm Method

Motif DM1 … AMi … EM

DM1

AM

p(i)

EM

IM

Shuffle lines and columns k times and each time calculate the probability of a given

combination of motifs getting spliced

Select m best combinations and continue to evolve the algorithm until it predicts training

set

Page 13: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

A Neural Net Method

Weight Table for splice

elements

Hidden Nodes

Sequences

Predicted Splicing

Corrected Weight Table for splice

elements

Page 14: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Summary

• Eukaryotic genes have exons

• Biological rules combined with mathematical and statistical approaches can be used to predict the boundaries for the exons and to predict the splice variants

Page 15: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

How to find what genes a string of DNA contains

Rui Alves

Page 16: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Simple steps

• Go to a known gene prediction server (or google for one)

• Input sequence and wait for prediction

• Get prediction(s), either as cDNA or as a tranlated protein sequence and do homology searches to identify them in a known database (e.g. NCBI or SWISSPROT)

Page 17: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Simple steps a)

• Go to a known gene prediction server (or google for one)

• Input sequence and wait for prediction

• Get prediction(s), either as cDNA or as a translated protein sequence and do homology searches to identify them

Page 18: Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.

Paper PresentationThe human genome (Science) vs. The human

genome (Nature)

Nature : Pages 875 to 901

Science: Pages 1317-1337

Compare the differences in methods and results for the annotation

DO NOT SPEND TIME TALKING ABOUT THE SEQUENCING OR ASSEMBLY ITSELF

Do not go into the comparative genome analysis