PREditor P redictive R NA Editor for Plant Mitochondrial Genes
description
Transcript of PREditor P redictive R NA Editor for Plant Mitochondrial Genes
![Page 1: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/1.jpg)
PREditorPredictive RNA Editor for Plant Mitochondrial Genes
Jeff Mower
![Page 2: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/2.jpg)
What is RNA Editing?– A process that alters the RNA sequence
– Nt insertion, deletion, or conversion
– Does not include RNA maturation processes
![Page 3: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/3.jpg)
RNA Editing in Plants– Occurs in mitochondria and chloroplasts
– C to U and U to C conversions
– Mechanism is not known
![Page 4: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/4.jpg)
RNA Editing in Plants– In seed plants (conifers, flowering plants, etc.)
• Widespread in mitochondrion• Rare in chloroplast• Predominantly C to U
– In non-seed plants (mosses, ferns, etc.)• Frequent in mitochondrion and chloroplast• Both C to U and U to C are common
![Page 5: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/5.jpg)
AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA
M R N S V G C Q *
UGGC
![Page 6: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/6.jpg)
AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA
M R N S V G C Q *
UGGC
AUGAUG CAAAAUCGU UCU UGCGGCGUA
M V M R N S V G C Q *
GUC
Creation of new start codon
AG UGA UGGC
![Page 7: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/7.jpg)
AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA
M R N S V G C Q *
UGGC
AUGAUG CAAAAUUGU UUU UGCGGCGUA
M V M C N F V G C Q *
GUC
Alteration ofprotein sequence
Creation of new start codon
AG UGA UGGC
![Page 8: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/8.jpg)
AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA
M R N S V G C Q *
UGGC
AUGAUG CAAAAUUGU UUU UGCGGUGUA
M V M C N F V G C Q *
GUC
Alteration ofprotein sequence
Creation of new start codon
AG UGA UGGC
No effect onprotein sequence
![Page 9: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/9.jpg)
AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA
M R N S V G C Q *
UGGC
AUG UGAUGGCAUG UAAAAUUGU UUU UGCGGUGUA
M V M C N F V G C *
GUC
Alteration ofprotein sequence
Creation of new stop codon
Creation of new start codon
AG
No effect onprotein sequence
![Page 10: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/10.jpg)
Identifying Edit Sites1. Determine experimentally
• Need to isolate and reverse transcribe RNA• Need multiple reads (editing is not always complete)
![Page 11: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/11.jpg)
Identifying Edit Sites1. Determine experimentally
• Need to isolate and reverse transcribe RNA• Need multiple reads (editing is not always complete)
2. Predict based on sequence context• Upstream and downstream regions are important• Unambiguous motifs have not been identified
![Page 12: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/12.jpg)
Identifying Edit Sites1. Determine experimentally
• Need to isolate and reverse transcribe RNA• Need multiple reads (editing is not always complete)
2. Predict based on sequence context• Upstream and downstream regions are important• Unambiguous motifs have not been identified
3. Predict based on protein conservation• Proteins are more conserved after editing • Editing tends to “correct” amino acid sequences
![Page 13: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/13.jpg)
PREditor Methodology – Aligned sequence database (ASD) Construction
• 363 DNA sequences RNA editing information is known Organisms do not perform RNA editing
• Proteins were translated using the editing information
• Homologous proteins were aligned 42 different alignments All known mt proteins are covered
![Page 14: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/14.jpg)
PREditor Methodology – Input Sequence Manipulation
• Accept a protein-coding DNA sequence as input• Translate input sequence• Align translation to homologous proteins in ASD
![Page 15: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/15.jpg)
PREditor Methodology – The Underlying Principle
• ASD sequences translated from edited RNA• Input sequence translated from unedited DNA• “Where can RNA editing in the input sequence increase
conservation to the ASD sequences?”
![Page 16: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/16.jpg)
ASD1 A T L G
ASD2 E M L G
ASD3 A M L G
![Page 17: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/17.jpg)
ASD1 A T L G
ASD2 E M L G
ASD3 A M L G
Input E (GAA) T (ACG) P (CCU) G (GGC)
![Page 18: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/18.jpg)
ASD1 A T L G
ASD2 E M L G
ASD3 A M L G
Input E (GAA) T (ACG) P (CCU) G (GGC)
Unedited E (GAA) 0.33
Edited N/A N/A
![Page 19: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/19.jpg)
ASD1 A T L G
ASD2 E M L G
ASD3 A M L G
Input E (GAA) T (ACG) P (CCU) G (GGC)
Unedited E (GAA) 0.33 T (ACG) 0.33
Edited N/A N/A M (AUG) 0.67
![Page 20: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/20.jpg)
ASD1 A T L G
ASD2 E M L G
ASD3 A M L G
Input E (GAA) T (ACG) P (CCU) G (GGC)
Unedited E (GAA) 0.33 T (ACG) 0.33 P (CCU) 0.0
Edited N/A N/A M (AUG) 0.67 S (UCU) 0.0
L (CUU) 1.0
F (UUU) 0.0
![Page 21: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/21.jpg)
ASD1 A T L G
ASD2 E M L G
ASD3 A M L G
Input E (GAA) T (ACG) P (CCU) G (GGC)
Unedited E (GAA) 0.33 T (ACG) 0.33 P (CCU) 0.0 G (GGC) 1.0
Edited N/A N/A M (AUG) 0.67 S (UCU) 0.0 G (GGU) 1.0
L (CUU) 1.0
F (UUU) 0.0
![Page 22: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/22.jpg)
Performance Analysis – Remove one protein sequence from the database
– Use the unedited DNA sequence as input
– Calculate statistics • Accuracy = (TP + TN) / (TP + FP + TN + FN)• Sensitivity = TP / (TP + FN)• Specificity = TN / (TN + FP)
– Repeat for each sequence
![Page 23: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/23.jpg)
Performance Analysis– Total # C’s = 58,982
– True edited sites = 3,548 (6.0%)• TP = 2,922• FN = 626
– True non-edited sites = 55,434• TN = 54,829• FP = 605
![Page 24: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/24.jpg)
Performance Analysis – Sensitivity = 82.4%
• Proportion of true edited sites that were predicted correctly• Increases to 94.6% if you ignore missed silent edited sites
– Specificity = 98.9%• Proportion of true non-edited sites that were predicted correctly
– Accuracy = 97.9%• Proportion of all sites that were predicted correctly• Increases to 98.7% if you ignore missed silent edited sites
![Page 25: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/25.jpg)
Limitations– Cannot predict editing at silent sites
• 458 of 626 FN are at silent sites
![Page 26: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/26.jpg)
Limitations– Cannot predict editing at silent sites
• 458 of 626 FN are at silent sites
– Not a major problem in practice• Silent editing sites do not affect the protein sequence • Many silent sites are only occasionally edited• Only 13% of editing sites are silent (expect ~38%)
![Page 27: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/27.jpg)
Limitations– Aligned sequence database is skewed
Origin of
RNA editing
Angiosperms 266 (74%)
Gymnosperms 2 (1%)
Ferns 0
Horsetails 0
Hornworts 0
Mosses 0
Liverworts 32 (9%)
Charophytes 59 (16%)
Chlorophytes ―
![Page 28: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/28.jpg)
Limitations– The skewed database effect
ASD1 Z
ASD2 Z
ASD3 Z
ASD4 Z
ASD4 X
Input X or Z?
![Page 29: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/29.jpg)
Ongoing Work– Reducing the skewed database effect
• Weighted sequences and phylogenetics
ASD1 Z
ASD2 Z
ASD3 Z
ASD4 Z
ASD4 X
Input X!
![Page 30: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/30.jpg)
Ongoing Work – Making the online resource more appealing
– Making the online resource more user-friendly
![Page 31: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/31.jpg)
Future Directions – Increase diversity in the ASD
– Analyze sequence context using the ASD
– Apply methodology to editing in chloroplasts
– Apply methodology to U to C editing
![Page 32: PREditor P redictive R NA Editor for Plant Mitochondrial Genes](https://reader036.fdocuments.in/reader036/viewer/2022081603/56813d55550346895da715d3/html5/thumbnails/32.jpg)
Thanks – Jeff Palmer
– Sun Kim
– Danny Rice and other members of the Palmer lab