1 DNA + RNA Dan Graur. 2 The hereditary information of all living organisms, with the exception of...

Post on 02-Jan-2016

216 views 1 download

Tags:

Transcript of 1 DNA + RNA Dan Graur. 2 The hereditary information of all living organisms, with the exception of...

1

DNA + RNADNA + RNA

Dan GraurDan Graur

2

The hereditary information of all living organisms, with the exception of some viruses, is carried by deoxyribonucleic acid (DNA) molecules.

3

DNA is made of DNA is made of fourfour nucleotides nucleotides

adenine (A)adenine (A) cytosine (C)cytosine (C)guanine (G)guanine (G) thymine (T)thymine (T)

2 purines (R):2 purines (R): 2 pyrimidines (Y):2 pyrimidines (Y):

4

sugar (deoxyribose) phosphate

DNA is made of DNA is made of fourfour nucleotides nucleotides

adenine (A)adenine (A) cytosine (C)cytosine (C)guanine (G)guanine (G) thymine (T)thymine (T)

2 purines (R):2 purines (R): 2 pyrimidines (Y):2 pyrimidines (Y):

5two ringstwo rings one ringone ring

DNA is made of DNA is made of fourfour nucleotides nucleotides

2 purines (R):2 purines (R): 2 pyrimidines (Y):2 pyrimidines (Y):

AA CCGG TT

6

The nucleotides are covalently linked in tandem by asymmetrical 5’-3' phosphodiester bonds. The DNA is, thus, polarized.

7

Thus, the sequenceThus, the sequence

5’-GCAAT-5’-GCAAT-3’3’

is different from the is different from the sequencesequence

3’-GCAAT-3’-GCAAT-5’5’

8

Deoxyribonucleic acid (DNA) Deoxyribonucleic acid (DNA) consists of two complementary consists of two complementary strands twisted around each other strands twisted around each other to form a to form a ririgght-handed double ht-handed double helixhelix. .

The two chains are joined The two chains are joined throughout their lengths by throughout their lengths by hydrohydroggen bondsen bonds between pairs of between pairs of nucleotides. nucleotides.

9A purinepurine always pairs with a pyrimidinepyrimidine.

Strongbond

Weakbond

10

TheThe antiparaantipara

llelllel structustructure of re of

double-double-strandestranded DNAd DNA

11

5’ 3’

downstreamupstream

upstreamdownstream

5’3’

12

5’

5’3’

3’

heavy chain

light chain

A preponderance of purines

A preponderance of pyrimidines

13

5’

5’3’

3’

CTGGA

14

The length of a single-stranded nucleic acid is measured in number of nucleotides.

15

The length a double-stranded sequence is measured in base pairs (bp), thousands of base pairs (kilobases, Kb), millions of base pairs (megabases, Mb), or billions of base pairs (gigabases, Gb).

16

The entire complement of genetic material carried by an individual is called the genome.

Eukaryotic cells usually have more than one genome: nuclear1, mitochondrial2, & plastid3.Some eukaryotes have a single genome; some have4 or more genomes .

1122

33

17

GenomeGenome

Nongenic or Nongenic or intergenic DNAintergenic DNAGenic DNAGenic DNA

18

ReplicationReplicationDNA DNA

19

1958: Matthew Meselson & Franklin Stahl

20

21

The origin of replication is at the replication bubble, a local region where the two strands of the DNA helix have been separated from each other. Replication proceeds in both directions as two replication forks.

22

Bacteria have one origin of replication. A bacterial genome can replicate in ~40 minutes. In eukaryotes, many replication origins exist. They are spaced at intervals of up to 300,000 bp from one another. Replication in eukaryotic cells may take several hours.

23DNA replication occurs only in the 5'-DNA replication occurs only in the 5'-to-3' direction. to-3' direction.

24

Okazaki fragments Bacteria = 1,000 - 2,000 nucleotidesVertebrates = 100 - 200 nucleotides

The leading strand is replicated continuously. The lagging strand is replicated as Okazaki fragmentsOkazaki fragments.

25

When Wilhelm Johannsen coined the word “gene” in 1909, the term meant “a unit of heredity.” Its material basis was unimportant for its usefulness as a concept.

26

“Where the meaning of most four-letter words is all too clear, that of gene is not. The more expert scientists become in molecular genetics, the less easy it is to be sure about what, if anything, a gene actually is.”

Helen Pearson

27

“A gene is a sequence of DNA that is essential for a specific function.”

(1) Protein-codinProtein-codingg ggenesenes, which are transcribed into RNA and subsequently translated into proteins(2) RNA-specifRNA-specifyyiningg ggenesenes, which are only transcribed into RNA(3) Untranscribed Untranscribed ggenesenes.

28

(1) Protein-coding genesProtein-coding genes

(2) RNA-specifying genesRNA-specifying genes

Structural or Structural or ProductiveProductive

GenesGenes

29

TranscriptionTranscriptionDNA RNA

30

RNARNA differs differs from from DNADNA by by having having riboseribose instead of instead of deoxyribosedeoxyribose……

31

standard nucleotides

RNARNA differs differs from from DNADNA by using by using uraciluracil instead of instead of thyminethymine… …

32

DNADNA is is mostly mostly double-double-strandedstranded. .

RNARNA is is mostly mostly single-single-strandedstranded..

33

The 5’ to 3’ DNA is transcribed into 3’ to 5’ RNA

TranscriptionTranscription

antisense

sense

34

RNA1. Coding RNA = mRNA2. Non-coding RNA3. Non-functional RNA

(transcriptional noise)

35

Non-coding RNAs

36

RNA processingRNA processingpre-RNA mature-RNA

37

Examples of RNA Examples of RNA processing:processing:

1. Capping of 5' end. 1. Capping of 5' end. 2. Polyadenylation of 3' 2. Polyadenylation of 3' end.end.3. Postranscriptional 3. Postranscriptional modifications.modifications.4. Splicing.4. Splicing.5. RNA editing.5. RNA editing.

38

CappingCapping is the addition of a 7’-methylated guanine to the 5’ end via a 5’ to 5’ bond.

39

Polyadenylation of Polyadenylation of 3' end3' end

40Postranscriptional modifications

I: Inosine (needed for wobble).

Ψ: Pseudouridine (only in Eukarya and Archaea).

D: Dihydrouridine.

T: Thymine (unusual for RNA).

Y: Wybutosine (found just after the anticodon).

I: Inosine (needed for wobble).

Ψ: Pseudouridine (only in Eukarya and Archaea).

D: Dihydrouridine.

T: Thymine (unusual for RNA).

Y: Wybutosine (found just after the anticodon).

41

SplicingSplicing

42

Protein-coding genesProtein-coding genesThe number of introns varies greatly from gene to gene. The distribution of intron sizes in vertebrate genes is very broad. The distribution of exon sizes is much narrower with a peak at around 150 bp. The vast majority of protein-coding genes in vertebrates consist mostly of introns.

43

donor

acceptor

44

The human factor-IX geneThe human factor-IX gene

Total length of exons = 1,386 bp Total length of introns = 29,954 bp 5' untranslated region = 30 bp3' untranslated region = 1,389 bp

Only about 4%4% of the pre-mRNA sequence actually encode the protein.

Note the uneven distribution of exons.

45

Mean size = 144 bp

Mean size = 123 bp

Mean size = 165 bp

Homo sapiens Rattus rattus Arabidopsis thaliana

Mean size = 2364 bp

Mean size = 733 bp

Mean size = 171 bp

exons

introns

Intron and exon lengths

47

The nuclear divide: The nuclear divide:

48

49

RNA editing

SubstitutionInsertion/deletion•Mononucleotide insertion (C, G, U)•Mononucleotide deletion (U)•Dinucleotide insertion (GC, GU, CU, AU, AA)•Substitutions (UA, UG, UC, AG, CU, CA, GA)•Modifications to a nonstandard nucleotide (AI)

50

Substitution editing

51

Insertion/deletion editingWhen RNA editing is extensive, so that the resulting RNA bears little resemblance to the DNA sequence, the DNA template is called a cryptogenecryptogene.

52

Pseudogenes = Dead Gene:Pseudogenes = Dead Gene:

A pseudogene is a DNA segment A pseudogene is a DNA segment that is similar to a gene, but that is similar to a gene, but which contains defects that which contains defects that render it non-functional. render it non-functional.

Pseudogenes are marked by the Pseudogenes are marked by the prefix prefix followed by the name of followed by the name of the gene to which they are similar, the gene to which they are similar, e.g., e.g., -globin.-globin.

In computerized databanks, the In computerized databanks, the suffix P is used, e.g., CA5P for suffix P is used, e.g., CA5P for the the -carbonic anhydrase pseudogene -carbonic anhydrase pseudogene 5.5.

53

Most pseudogenes are not transcribed. A significant minority of pseudogenes is transcribed but not translated.A handful of pseudogenes is translated.

Pseudogenes are ubiquitous at all genomic locations and in all organisms, although some organisms and some locations tend to harbor more pseudogenes than others.

54

There are three main categories of pseudogenes distinguished by the molecular evolutionary processes which created them:

1. Unitary2. Unprocessed3. Processed

Each of these can be duplicated or unduplicated.

There are also mosaic pseudogenes.