1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains...

67
1 LINEs & & SINEs

Transcript of 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains...

Page 1: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

1

LINEs && SINEs

Page 2: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

2

The genome of most eukaryotes contains highly-repetitive highly-repetitive interspersed sequencesinterspersed sequences:

(1) short-interspersed repetitive elements (SINESINEs)

(2) long-interspersed repetitive elements (LINELINEs).

Page 3: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

3

~1/2 of the human genome consists of interspersed repetitive sequences.

Page 4: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

4

Page 5: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

5

Page 6: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

6

interspersed repeats

Original definitions:

Page 7: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

7

Current definitions:

LINEs = Active or degenerate descendants of transposable elements. SINEs = Non-autonomous transposable elements (lacking the ability to mediate their own transposition) and their degenerate descendents.

Page 8: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

8

Page 9: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

9

The reverse The reverse transcriptase has transcriptase has LINELINE specificityspecificity, i.e., a , i.e., a reverse transcriptase reverse transcriptase from one from one LINELINE will only will only recognize the 3’ end of recognize the 3’ end of that that LINELINE, and will be , and will be less efficient at less efficient at recognizing and reverse recognizing and reverse transcribing other transcribing other LINELINEs. s.

Page 10: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

10

SINESINEs are s are retrosequencesretrosequences that range in length that range in length from 75 to 500 bp. from 75 to 500 bp.

SINESINEs do not possess any s do not possess any reading framereading frame. .

Thus, their Thus, their retroposition retroposition mustmust be be aided by aided by other genetic other genetic elementselements. .

Page 11: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

11

SINESINE

7SL-RNA derived tRNA-derived

Primate Alu+

Rodent B1 All others

Page 12: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

12

AluAlu elements elements

• Length = ~300 bp• Repetitive: > 1,000,000 times in the human genome

• Constitute >10% of the human genome • Found mostly in intergenic regions and introns• Propagate in the genome through retroposition (RNA intermediates).

Page 13: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

13

Page 14: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

14

Evolution of Alu elements (I)

Page 15: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

15

Evolution of Alu elements (II)

FAMFAM

FLAMFLAM FRAMFRAM

Page 16: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

16

Page 17: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

17

Master-gene model for Alu proliferation in the genome

Master gene A

Replicatively incompetent progeny

Progeny undergoes multiple independent mutations

Mutation renders A non-functional & creates new master gene B

Mutation renders B non-functional & creates new master gene C

Page 18: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

18

Page 19: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

19

AluAlu elements can be elements can be divided into divided into subfamiliessubfamilies

The The subfamilies subfamilies are are distinguished distinguished by ~16 by ~16 diagnostic diagnostic positions.positions.

Page 20: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

20

diagnosticdiagnosticpositionspositions

Page 21: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

21

Alu elements are found only in primates.

All the millions of Alu elements have accumulated in a mere ~65 million years.

Page 22: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

22

Alu elements can be sorted into distinct families according to shared patterns of variation.

At any given point in time, only one or several Alu “master copies” are capable of transposing.

Page 23: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

23

Early in primate Early in primate evolution, evolution, AluAlu transposition rate transposition rate was approximately was approximately one new jump in one new jump in every live birthevery live birth. .

Today, it is about Today, it is about one new jump in one new jump in every 200 live every 200 live birthsbirths. .

Page 24: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

24

PV92, a human-specific Alu insertion on chromosome 16

There are two alleles: with (+) or without (–) the Alu transposable element. There are three genotypes (++, + –, – –). The + and – alleles can be separated by size using gel electrophoresis.

Page 25: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

25

tRNA derived SINEs

Page 26: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

26

a tRNA-derived SINE

twin twin SINESINE500 per haploid genome500 per haploid genome

Culex pipiens

Page 27: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

27

Norihiro Okada et al.

Page 28: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

28

Bombyx moriBombyx mori

Page 29: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

29

Page 30: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

30

3 predictions:

1. Where there’s a SINE, there’s a LINE!

2. Once a LINE partner becomes inactive, the SINE partner will lose its ability to retrotranspose.

3. A LINE partner should have a longer evolutionary history and, hence, a broader phylogenetic distribution than its SINE partner.

Page 31: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

31

Degrees of sequence divergence in Alu sequences and -globin pseudogenes

_______________________________________________________________

Percent divergence

_______________________________________Species pair Alu sequence pseudogene

_______________________________________________________________

Human vs. chimpanzee 2.2 ± 1.4 1.7

Human vs. orangutan 3.7 ± 1.9 3.1

_______________________________________________________________

Page 32: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

32

GENETIC AND EVOLUTIONARY EFFECTS OF TRANSPOSITION

1. Duplicative transposition increases genome size.

Lily Edible frog Sunflower

Page 33: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

33

2. Bacterial transposons often carry genes that confer antibiotic or other forms of resistance. Plasmids can carry such transposons from cell to cell, so that resistance can spread throughout a population or an ecosystem.

Page 34: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

34

Page 35: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

35

A figure used by Barbara McClintock at her Nobel Prize (1983) Lecture

Page 36: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

36

3. Gene expression may be altered by the presence of a transposable element.

a.a. An insertion may obliterate the reading frame (phenotypic effects).

b.b. A transposable element may contain regulatory elements (effects on transcription of nearby genes).

c.c. Transposable elements may contain splice sites (effects on RNA processing even if the element is in an intron).

Page 37: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

37

Page 38: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

38

Caused by the insertion of a transposable element.

Page 39: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

39Transposable elements in Antirrhinum majus

Page 40: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

40

cinnamic acidchalcone eriodictyol dihydroquercetin cyanidin

nivea recurrens

Page 41: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

41

cinnamic acidchalcone eriodictyol dihydroquercetin cyanidin

nivea incolorata pallida

Page 42: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

42

Page 43: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

43

rrugosusugosus

Page 44: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

44

Page 45: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

45

Page 46: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

46

4. Transposable elements promote gross genomic rearrangements

a.a. directly (moving a DNA sequence from one genomic location to another).

b.b. indirectly (as a result of transposition, two sequences become similar to one another so that unequal crossing-over between them is possible).

Page 47: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

47

Page 48: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

48

Page 49: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

49

An unequal crossing-over event facilitated by the presence of multiple Alu sequences in the introns flanking exon 5exon 5 of the low-density-lipoprotein-receptor gene, has given rise to a mutant gene lacking exon 5 (FH-626aFH-626a). Patients homozygous for this deletions have hypercholesterolemiahypercholesterolemia.

Page 50: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

50

5. Insertion of transposable elements into some members of a multigene family but not others will reduce the rate and limit the extent of gene conversion between the members of the family and, therefore, increase the rate of divergence between duplicate genes.

Page 51: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

51

6. Some transposable elements cause an increase in the rate of mutation in adjacent regions.

Page 52: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

52

7. The presence of a transposable element can turn an otherwise immobile piece of DNA into a mobile one.

IS IS

Page 53: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

53

8. Replicative transposable 8. Replicative transposable elements may be “motif donors” elements may be “motif donors” for for nucleosome positioningnucleosome positioning, , DNA DNA methylationmethylation, , transcription transcription enhancementenhancement, , transcription transcription silencingsilencing, , polyadenylationpolyadenylation, , retinoic-acid receptorsretinoic-acid receptors, , RNA RNA splicingsplicing, , RNA stabilizationRNA stabilization, , RNA transportRNA transport, & , & open-reading open-reading framesframes. .

Page 54: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

54

Hybrid dysgenesis in Drosophila is a syndrome of correlated abnormal genetic traits that is spontaneously induced in one type of hybrid between certain mutually interactive strains, but not in the reciprocal hybrid.

Margaret Kidwell

Page 55: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

55

(1) failure of the gonads to develop if exposed to temperatures above 27˚C, (2) recombination in males, (3) chromosomal breakages, (4) distortion of Mendelian transmission ratios, and (5) a high frequency of mutation.

Page 56: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

56

Hybriddysgenesis

In PP-carrying flies, there are 30-50 P elements per haploid genome. Many are inactive.

MM flies do not carry P elements.

PP MM

Page 57: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

57

A 2,907-bp P element in D. melanogaster. The element contains a single gene encoding a transposase (766 amino acids). The coding region consists of four exons interrupted by three introns. Production of the functional protein depends on the splicing of the third intron. The splicing of this intron is prevented by the binding of a multiprotein complex to exon 2. One component of this complex has a low abundance in germline cells, effectively limiting the production of functional transposase to these cells only. In somatic cells, a shorter and inactive polypeptide is encoded by the mRNA that retains the third intron.

Page 58: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

58

Hybrid dysgenesis

cytotypecytotype

Page 59: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

59

Speciation (cladogenesis) = the creation of two or more taxa from a parental taxon.

Page 60: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

60

Page 61: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

61

Why NOT hybrid dysgenesis?

Because of segregation distortion, P quickly spreads through the population.

P has the ability to move from individual to individual as an infectious agent.

Hybrid dysgenesis was only found in Drosophila mealnogaster and may not be a universal phenomenon.

Page 62: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

62

Richard GoldschmidtThe “hopeful monster” idea

“The change from species to species is not a change involving more and more additional atomistic changes, but a complete change of the primary pattern or reaction system into a new one, which afterwards may again produce intraspecific variation by micromutation.” (Richard Goldschmidt. 1940. The Material Basis of Evolution, pp. 205-206)

Page 63: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

63

Genetic resetting

Mass replicative transposition of elements containing elements containing regulatory sequencesregulatory sequences in one population may cause many genes to be subject to novel modes of regulation.

Page 64: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

64

A genetically reset population will become reproductively reproductively isolatedisolated from the population that retains the old form of gene regulation.

Page 65: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

65

The discovery that some Alu repeats contain retinoic-acid response elements, which may function as transcription factors, indicates that it is possible to alter the expression of numerous genes through the dispersion of transposable elements.

Page 66: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

66

Mechanical incompatibilityIn one population the transposable elements multiply to such an extent as to cause a significant increase in the size of the chromosomes.

A hybrid organism that inherits big chromosomes from one parent and small chromosomes from the other would experience difficulties in chromosome pairing during meiosis, and would be most probably be sterile.

Page 67: 1 & LINEs & SINEs. 2 highly-repetitive interspersed sequences The genome of most eukaryotes contains highly-repetitive interspersed sequences: SINE (1)

67

Thursday, April 26, 2012 Tuesday, May 8, 2012 final exam