The Ribosomal Protein Binding Site in Saccharomyces cerevisiae ...
"Common Characteristic of Sequences of Ribosomal Protein Gene Introns in Saccharomyces cerevisiae"
description
Transcript of "Common Characteristic of Sequences of Ribosomal Protein Gene Introns in Saccharomyces cerevisiae"
"Common Characteristic of Sequences of Ribosomal Protein Gene Introns in
Saccharomyces cerevisiae"
Michal WolffAdvisors:Prof. Martin Kupiec (TAU)DR. Ron Unger
Why? -there is something special about them
Number of intron containing genes = 242 Number of ribosomal protein intron containing
genes = 105 = 43% Number of ribosomal protein genes = 140 Ribosomal protein gene introns in the yeast are
larger On average the ribosomal intron length is 405
while the non-ribosomal is 151
Why ?Intron size bias by gene class
0
10
20
30
40
50
60
70
80
50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950 1000 1050
Intron size (bp)
Nu
mb
er o
f in
tro
ns
ribosomal gene introns
non-ribosomal geneintrons
What? - The sequence
homology, consensus, all the intron, areas in the intron, nucleotides.
What? - Secondary structure
Where? - Ares lab Yeast Intron Database
Gene/ORF Name: YMR230W
Description: Small ribosomal subunit protein S10B (rat S10).
Number of introns: 1
Ares Intron Name YMR230W_13_732413_733140_INTRON_53_462
Comments
Length Info (in nt) 410; to Branch base (lariat length) 371
Location relative to orf (nt) start 53; stop 462
Features GUAUGU UUACUAACAAGG AAUAG
Sequence
>YMR230W_13_732417_732466_PREGUUGAUGCCAAAGCAAGAAAGAAACAAAAUUCACCAAUACUUGUUUCAAG>YMR230W_13_732413_733140_INTRON_53_462GUAUGUUUGCAUUUUUUAGGUGAAAUAUGCAGUGAUAUGCUCCGAAAUGGAUAGCAAAGAUGAUAAAUGAAAAUACAAUUAAAUUGAACUUGAAUAUCAUUAAAAGUGGUAGAAAACCGAUUGUUUCAAAAAGAUCUUUGUGGACAAGUACGGCAGUGCAUGAAUCCGAAAGAAAAGUGCAACCUAAGCAGAAUUCCAUUAUUUCACGCGUCCAUUUUUAUAAUGUUUGAACUUUUGAGUCCUUAUAGAAAUGCUAAUAUUAUGAUCAAAGCGAUUAUAUCAUUUACUUUACAGUUUCCGGAGUUUCCGUAUUAUUGCAAAAGGACCACAAACAAUUGAAAUCAUGAUAUCGUAAGAAAUAUUAUUACUAACAAGGAAGAUCAUUGAAUUACGAUCGCAUAUCGAAAUAG>YMR230W_13_732875_732924_POSTAAGGUGUUGUCGUCGCUAAGAAGGAUUUCAACCAAGCUAAGCACGAAGAA
How? – Perl & Bioinformatics Tools
Perl - originally designed for text processing
EMBOSS – sequence alignment mfold – secondary structure BLAST – all against all
How? – Data Analysis
Microsoft Excelchartssortgraphsχ² - expected – non-ribosomal behavior
Results – upstream the intronglutamine
CAA
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 10 20 30 40 50
nucleotides upstream of the intron
pro
ba
bili
ty
upstream ribosomal intron
upstream non-ribosomal introns
Results – downstream the intronlysine
AAG
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 10 20 30 40 50
nucleotides downstream of intron
pro
ba
bili
ty
downstream ribosomal introns
downstreamstream non-ribosomalintrons
Results – in the intron
Hexamers - The biggest difference (30-40%) is in the AU rich sequences for the ribosomal sequences.
Octamer - The ribosomal introns had more poly U octamers then the non-ribosomal ones.
Results - in the intron
10-20 nucleotides upstream from the 3’ end the ribosomal introns have a high probability to contain a polyU sequence.
The area between the branchpoint and the 3’ splice site is around 30 bases long.
UUU,AUU,UUA
Results - in the intronUUU
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
area in intron
prob
abili
ty
ribosomal - upstream 3' end
non-ribosomal - upstream 3' end
Results – in the branchpoint-3’
distribution of nucleotides in the branchpoint-3' end area
0%
10%
20%
30%
40%
50%
60%
nucleotide
pe
rsa
nta
ge
in s
eq
ue
nc
e ribosomal introns
non-ribosomal introns
ribosomal introns 29.80% 12.38% 8.67% 49.15%
non-ribosomal introns 30.53% 16.49% 13.96% 39.02%
A C G U
Results – in the branchpoint-3’
common subsequences A-polyUpolyUpolyU-A
U subsequence with a mutation – In the area – more in the ribosomal introns In the whole intron – more in the non-
ribosomal
Results – in the branchpoint-3’U sequences with one mutation in the intron
and the branchpoint - 3' site area
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21
sequence length(bp)
pro
ba
bili
ty
ribosomal introns branchpoint-3' site
non-ribosomal introns branchpoint-3' site
ribosomal introns
non-ribosomal introns
Conclusion
The branchpoint-3’ splice site area might be homologues – all against all blast.
The difference between ribosomal and non-ribosomal may be connected with a polyU sequence.
The ribosomal gene introns and the non-ribosomal gene introns are not similar.
Thanks for listening!