PLANT RETROTRANSPOSONS - MTUforest.mtu.edu/faculty/joshi/publish/pdf/amar.pdf · P1: FNE/fgo/fok...
-
Upload
hoangkhanh -
Category
Documents
-
view
214 -
download
0
Transcript of PLANT RETROTRANSPOSONS - MTUforest.mtu.edu/faculty/joshi/publish/pdf/amar.pdf · P1: FNE/fgo/fok...
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?Annu. Rev. Genet. 1999. 33:479–532
Copyright c© 1999 by Annual Reviews. All rights reserved
PLANT RETROTRANSPOSONS
Amar Kumar and Jeffrey L. BennetzenScottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA, Scotland andDepartment of Biological Sciences, Purdue University, West Lafayette, Indiana47907-1392; e-mail: [email protected]
Key Words DNA markers, evolution, genome organization, mutation, regulation,retroelements
■ Abstract Retrotransposons are mobile genetic elements that transpose throughreverse transcription of an RNA intermediate. Retrotransposons are ubiquitous in plantsand play a major role in plant gene and genome evolution. In many cases, retrotrans-posons comprise over 50% of nuclear DNA content, a situation that can arise in just afew million years. Plant retrotransposons are structurally and functionally similar to theretrotransposons and retroviruses that are found in other eukaryotic organisms. How-ever, there are important differences in the genomic organization of retrotransposonsin plants compared to some other eukaryotes, including their often-high copy num-bers, their extensively heterogeneous populations, and their chromosomal dispersionpatterns. Recent studies are providing valuable insights into the mechanisms involvedin regulating the expression and transposition of retrotransposons. This review de-scribes the structure, genomic organization, expression, regulation, and evolution ofretrotransposons, and discusses both their contributions to plant genome evolution andtheir use as genetic tools in plant biology.
CONTENTS
Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 480Types, Structures, and Distributions of Retrotransposons in Plants. . . . . . . . . 481
LTR Retrotransposons of the Ty1-copiaGroup . . . . . . . . . . . . . . . . . . . . . . . . . . 482LTR Retrotransposons of the Ty3-gypsyGroup . . . . . . . . . . . . . . . . . . . . . . . . . . 489LINEs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 489SINEs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 490
Retrotransposon Dispersal and Organization in The Plant Genome. . . . . . . . 490Chromosomal Locations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 490Retrotransposon Abundance in Plants Relative to Other Eukaryotes. . . . . . . . . . . 493Local Structures. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 495Insertion Site Preferences. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 496In Organellar Genomes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 497
Regulation of Expression and Transposition. . . . . . . . . . . . . . . . . . . . . . . . . . 497Transcriptional Regulation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 497
0066-4197/99/1215-0479$08.00 479
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?480 KUMAR ■ BENNETZEN
Activation by Biotic and Abiotic Stresses. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 498Regulatory Sequences. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 499Epigenetic and Posttranscriptional Regulations. . . . . . . . . . . . . . . . . . . . . . . . . 500Retrotransposon Regulation as a Form of Host Defense. . . . . . . . . . . . . . . . . . . . 502
Plant Retrotransposon Origins and Evolution. . . . . . . . . . . . . . . . . . . . . . . . . 503The Frequency and Timing of Retrotransposition During Evolution. . . . . . . . . . . 503Heterogeneity. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 505Internal Mutation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 506Retrotransposons and Retroviruses. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 507Horizontal Transmission?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 508
Contributions of Retrotransposons to the Evolution of Plant Genesand Genomes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510Gene Mutation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510Genome Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 511Genome Rearrangement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 516Roles for Retrotransposons?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517
Retrotransposons as Genetic Tools. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518Molecular Markers for Phylogenetic, Biodiversity, and Genetic
Linkage Analyses. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518Gene Tagging and Functional Analysis of Genes. . . . . . . . . . . . . . . . . . . . . . . . . 519Evaluation of Somaclonal Variations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 519
Conclusions and Future Prospects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 520
INTRODUCTION
The mobile genetic elements called retrotransposons are ubiquitous in the plantkingdom. They are present in high copy numbers in most plants, making themmajor constituents of plant genomes. Retrotransposons move to new chromosomallocations via an RNA intermediate that is converted into extrachromosomal DNAby the encoded reverse transcriptase/RNaseH enzymes prior to reinsertion intothe genome (13, 16). This replicative mode of transposition can rapidly increasethe copy numbers of elements and can thereby greatly increase plant genome size(93, 156). The DNA transposable elements (for example,Ac, Tam1, andEn/Spm)transpose by an excision/repair mechanism and usually do not greatly increaseplant genome size (97). Retrotransposons, like DNA transposable elements, cangenerate mutations by inserting within or near genes. Moreover, retrotransposon-induced mutations are relatively stable; because they transpose via replication, thesequence at the insertion site is retained.
Many of the properties of transposable elements suggest that they are parasiticor selfish DNAs (34, 134). Because they are expressed in the genome and subjectto many of the same rules of inheritance as are the genes of their genomic “host,”a retrotransposon can be co-opted for use in a biological process that benefitsthe organism. For instance, the preferential insertion of some retrotransposons inDrosophilaat telomeric locations has removed the need for a telomerase function inthese insects (137). However, discussion of plant/transposable element interactions
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 481
are often most informative from a plant/parasite perspective, and we often employthis approach of looking at the genome as an ecosystem in which different DNAcomponents vie for influence and survival (7, 34, 134).
Because retrotransposons have the potential to dramatically alter gene func-tion and host genome structure, it is not surprising that their transpositional ac-tivities are regulated both by retrotransposon- and host-encoded factors, possiblyto avoid deleterious effects on host and retrotransposon survival. The intimaterelationship between retrotransposons and their plant hosts has existed for manymillions of years. We are just beginning to understand how retrotransposons andtheir hosts’ genomes have co-evolved mechanisms to regulate transposition, in-sertion specificities, and mutational outcomes in order to optimize each other’ssurvival. Here, we review the current status of research on plant retrotransposonswith respect to their structure, genomic organization, expression, regulation, trans-positional activities, and evolution. We also discuss their contributions to plantgenome evolution and their applications as tools for phylogenetic studies, bio-diversity assessments, and gene tagging. Since the first publication on a plantretrotransposon in 1984 (165), many publications on this subject have appeared,and it is not possible to refer to all of them in this review. For additional back-ground, we suggest any of several previous reviews on plant retrotransposons(5, 48, 54, 55, 95, 97, 169, 202).
TYPES, STRUCTURES, AND DISTRIBUTIONSOF RETROTRANSPOSONS IN PLANTS
Retrotransposons are the most abundant and widespread class of eukaryotic trans-posable element, consisting of the long terminal repeat (LTR) and the non-LTRretrotransposons (Figure 1) (5, 54, 93, 97). LTR retrotransposons are further sub-classified into the Ty1-copia and the Ty3-gypsygroups that differ from eachother in both their degree of sequence similarity (207) and the order of en-coded gene products (Figure 1). Ty1-copiaretrotransposons are present through-out the plant kingdom, in species ranging from single-cell algae to bryophytes,gymnosperms, and angiosperms (44, 195). Ty3-gypsyretrotransposons also havebeen found to be widely distributed in the plant kingdom, including both gym-nosperms and angiosperms (90, 96, 131, 164, 176, 181). Furthermore, both thesegroups of retrotransposons are commonly found in high copy number (up to afew million copies per haploid nucleus) in plants with large genomes (Table 1)(5, 95, 97, 101, 147, 158). The non-LTR retrotransposons, LINEs (long interspersedrepetitive elements) and SINEs (short interspersed repetitive elements) can alsobe found in high copy numbers (up to 250,000) in the plant species studied to date(5, 97, 101, 169). Like the LTR retrotransposons, LINEs are present throughoutthe plant kingdom (129, 130). SINEs have been found in several angiosperms(32, 126, 130, 142, 210) and may be ubiquitous in plants. Table 1 shows a list ofsome well-characterized retrotransposons in plants.
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?482 KUMAR ■ BENNETZEN
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 483
LTR Retrotransposons of the Ty1-copia Group
The LTR retrotransposons have direct long terminal repeats (LTRs) that can rangefrom a few 100 bp to over 5 kb in size (Figure 1; Table 1). The LTRs do notencode any known proteins, but they do contain the promoters and terminatorsassociated with the transcription of LTR retrotransposons. LTRs terminate in shortinverted repeats, usually 5′—TG–3′ and 5′—CA–3′. The LTR retrotransposonsencode a number of proteins, specified by three major genes calledgag, pol, andint. These genes and proteins are all specified by a single mRNA molecule thathas the structure 5′—R–U5—PBS–coding region–PPT–U3—R–3′, where R, U5,PBS, PPT, and U3 stand for repeated RNA, unique 5′ RNA, primer binding site,polypurine tract, and unique 3′ RNA, respectively (Figure 1). Hence, transcriptioninitiates at the 5′ end of R in the 5′ LTR and terminates in the 3′ end of R inthe 3′ LTR. As this result suggests, the 3′ LTR also contains a promoter thatcan lead to readthrough transcription of sequences downstream from the insertedelement. The proteins encoded bygag, pol, andint are synthesized as a polyproteinthat is cleaved into functional peptides by apol-encoded protease (PR). Thegaggene encodes proteins involved in maturation and packaging of retrotransposonRNA and proteins into a form suitable for integration into the genome. Thepolgene encodes reverse transcriptase and RNase H activities that are required forreplication/transposition of the retrotransposon, whereasint encodes the integrasethat allows the DNA form of the retrotransposon to insert at a new chromosomallocation. In some cases,gag, pol, and int proteins are encoded within a singletranslational reading frame, but in other cases, two or more reading frames arepresent. This leads to a requirement for a frameshift or translational reinitiation to
←−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−Figure 1 General structures of the Ty1-copia, Ty3-gypsy, LINE, and SINE retrotrans-posons. The LTR retrotransposons have long terminal repeats in direct orientation ateach end. Within the LTRs are U3, R, and U5 regions that contain signals for initiationand termination of transcription. The transcript (thin arrow) is shown below each block;it initiates at the 5′ end of R within the 5′ LTR and terminates at the 3′ end of R withinthe 3′ LTR. The genes within the retrotransposons encode capsid-like proteins (CP),endonuclease (EN), integrase (INT), protease (PR), reverse transcriptase (RT), andRNAase-H. Other sequences featured are PBS (primer binding sites), PPT (polypurinetracts), NA (nucleic acid binding moiety), IR (inverted terminal repeats), DR (flankingtarget direct repeat), 5′ UTR (5′ untranslated region), 3′ UTR (3′ untranslated region),and Pol III A and B-promoter recognition sites for RNA polymerase III. These figuresare not drawn to scale, as the LTR retrotransposons range from a few kb up to 15 kbin size. LINEs usually range in size from less than 1 kb to maybe 8 kb, while SINEsare normally 100 bp to 300 bp in size. The envelope (env) gene-like sequence in theposition of ORF 3, where a functionalenvgene is present in the animal retroviruses,has been found in both Ty1-copiaand Ty3-gypsygroups (not shown). The function oftheenvgene-like sequence in plant retrotransposons is unknown.
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?T
AB
LE
1Pr
oper
ties
ofso
me
plan
tret
rotr
ansp
oson
s
Size
Pla
nt&
Tota
lLT
Rge
nom
eR
NA
Ret
rotr
ansp
oson
kbbp
Cop
No.
size
(1C
)L
ocat
ion
tran
scri
ptC
omm
ents
Ref
eren
ce
LTR
retr
otra
nspo
sons
Ty1-
copi
agr
oup
BA
RE
-112
.118
29>
5000
0B
arle
yD
ispe
rsed
Lea
ves,
callu
sA
ctiv
ein
the
rece
nt17
2(
487
0 M
bp)
past
;the
mos
tabu
n-da
ntkn
own
fam
ilyin
the
barl
eyge
nom
e
Bs1
3.2
302
1–5
Mai
ze—
—L
acks
RT
sequ
ence
s;76
( 2
290 0
Mbp
)ha
sac
quir
edco
n-se
rved
tran
smem
bran
edo
mai
nsof
ace
llula
rpl
asm
am
embr
ane
prot
onA
TPa
sege
ne
Opi
e-1
8.7
1256
>30
000
Mai
zeD
ispe
rsed
Roo
ts,l
eave
sA
ctiv
ew
ithin
the
last
158
inte
rgen
icta
ssel
s2
to6
mill
ion
year
s;pr
efer
entia
llyin
sert
sin
toth
eLT
Rs
ofot
her
retr
otra
nspo
sons
PRE
M-2
9.5
1307
>10
000
Mai
zeD
ispe
rsed
Mic
rosp
ores
Act
ive
inth
ere
cent
past
;18
4in
sert
ions
atth
e5′
flank
ing
regi
ons
ofth
epo
lgal
gene
s,co
nsti-
tute
sab
out5
%of
the
mai
zege
nom
e
∼ ∼
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?SI
RE
-111
.010
00>
200
Soyb
ean
—Se
edlin
gs,l
eaf
Act
ive
inth
ere
cent
past
;10
0(1
115
Mbp
)tis
sues
cont
ains
enve
lope
-lik
ese
quen
ces
Tnt
15.
361
0>
100
Toba
cco
Dis
pers
edPr
otop
last
,A
ctiv
ely
tran
spos
ing;
56(
229
0
Mbp
)eu
chro
mat
inro
ots,
wou
ndin
g,is
olat
edas
anin
sert
ion
path
ogen
atta
cks
inth
eni
age
nein
tissu
e-cu
lture
cells
;tra
nspo
ses
intr
ansg
enic
tom
ato
and
Ara
bido
psis
ToS1
74.
113
81–
5R
ice
Dis
pers
edPr
otop
last
,T
rans
pose
sin
tissu
ecu
l-66
( 4
20M
bp)
tissu
ecu
lture
ture
;mos
tlyin
sert
sin
orne
arco
ding
regi
ons.
Tto
15.
357
4>
30To
bacc
oD
ispe
rsed
Prot
opla
st,t
issu
eT
rans
pose
sdu
ring
tissu
e64
cultu
re,v
iral
,cu
lture
;tra
nspo
ses
inw
ound
ing,
path
o-tr
ansg
enic
rice
and
gen
atta
cks
Ara
bido
psis
.
Ty3-
gyps
ygr
oup
Ath
ila
10.5
1539
30A
.tha
lian
aPa
race
ntro
mer
ic—
Ath
ila
fam
ilyre
pres
ents
144
( 1
30 M
bp)
regi
ons
0.3%
ofth
eA
rabi
dops
isge
nom
e
Ath
ila-
1-1
∼12.
013
2473
0A
.tha
lian
a—
—C
onta
ins
enve
lope
-lik
e20
6se
quen
ces
Cer
eba
∼10.
0—
Bar
ley
Cen
trom
eric
—E
xclu
sive
lycl
uste
red
in15
0ce
ntro
mer
icre
gion
Cin
ful-
18.
658
6M
aize
—L
eave
sC
onta
ins
enve
lope
-lik
e15
6se
quen
ces
Cyc
lops
-2∼1
2.5
1504
1000
Pea
Dis
pers
ed—
Con
tain
sen
velo
pe-l
ike
24(
440
0 M
bp)
sequ
ence
s
cont
inue
d(
)
∼∼ ∼∼ ∼
1500
2000
0
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?G
rand
e-1
13.5
600
>13
00M
aize
Dis
pers
ed—
Con
tain
sa
larg
e3′
regi
on11
3of
mor
eth
an7
kbth
atap
pare
ntly
harb
ors
anac
quir
edce
llula
rge
neth
atpr
oduc
esa
900-
bptr
ansc
ript
inal
ltis
sues
exam
ined
IFG
5.9
333
1000
0P
inus
radi
ata
——
Inac
tive,
the
IFG
retr
o-90
( 4
0000
Mbp
)tr
ansp
oson
repr
esen
ts0.
25–0
.5%
ofth
eP
inus
geno
me
RIR
E3
∼10.
523
1610
Ric
e—
—In
activ
e;a
fam
ilyof
96R
IRE
3el
emen
tsar
efo
und
inal
lpla
nts
test
ed
Zeo
n-1
7.3
648
>12
50M
aize
—E
ndos
perm
Lac
ksR
Tse
quen
ces,
72bu
tpro
duce
sga
gpr
otei
nde
tect
edby
antib
odie
s
Non
-LT
Rre
trot
rans
poso
nsL
INE
s
Cin
41–
6.8
—50
–100
Mai
ze—
—A
fam
ilyof
hete
ro-
163
gene
ousl
ytr
unca
ted
5′-e
nds
TA
BL
E1
Size
Pla
nt&
Tota
lLT
Rge
nom
eR
NA
Ret
rotr
ansp
oson
kbbp
Cop
No.
size
(1C
)L
ocat
ion
tran
scri
ptC
omm
ents
Ref
eren
ce
c
onti
nued
)
(
∼
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?D
el2
4.5
—25
0000
Lil
ium
Dis
pers
ed—
Acc
ount
sfo
r4%
ofth
e10
1(2
9000
Mbp
)L
iliu
mge
nom
e
Tal1
6.1
—1–
6A
.tha
lian
aD
ispe
rsed
—A
fam
ilyof
non-
LTR
205
elem
ents
;17
addi
tiona
lre
vers
etr
ansc
ript
ase
sequ
ence
s(T
a12-
28)
wer
eid
entifi
ed;T
a17
islo
cate
din
the
mito
-ch
ondr
ialg
enom
e
Zep
p0.
987–
8943
—13
0C
hlor
ella
Telo
mer
ic—
Clu
ster
edin
the
telo
mer
ic62
vulg
aris
regi
on
SIN
Es
p-SI
NE
10.
125
—>
100
Ric
e—
—U
sefu
las
DN
Am
arke
rs74
S1B
n0.
170
—50
0B
rass
ica
napu
sD
ispe
rsed
Shoo
t,ro
ots,
Ele
men
tsar
etw
ofol
d32
( 6
30M
bp)
callu
sm
ore
met
hyla
ted
than
the
aver
age
met
hyla
tion
leve
lof
B.n
apus
nucl
ear
DN
A
TS
0.11
1—
>50
,000
Toba
cco
Dis
pers
ed—
TS-
hom
olog
ous
sequ
ence
s21
0ar
epr
esen
tin
the
intr
ons
and
flank
ing
regi
ons
ofm
any
gene
sin
the
Sola
nace
ae
Ts
0.20
6—
>10
00B
ellp
eppe
rD
ispe
rsed
—T
rans
pose
dw
ithin
the
149
( 3
420
Mbp
)PA
P2
gene
∼ ∼
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?488 KUMAR ■ BENNETZEN
make appropriate peptides downstream of the change in frame, and thereby canlead to a lower level of these downstream proteins relative to those upstream.
LTR retrotransposons undergo intracellular replication by a complex processthat first requires synthesis of a retrotransposon mRNA molecule (16). This mRNAspecifies the proteins needed for replication, and also serves as the template forreplication. The primer binding site on the mRNA molecule is complementary to acellular RNA, usually the 3′ end of a host tRNA. In vivo hybridization between theretrotransposon mRNA and the appropriate tRNA provides a short double-strandedRNA region with a free 3′ hydroxyl from the tRNA. Reverse transcriptase can usethis 3′ end as a primer, then synthesizing a DNA complement to the R and U5portions of the 5′ LTR. Reverse transcriptase cannot initially synthesize any addi-tional DNA, however, because it has come to the end of the template, the 5′ endof the retrotransposon mRNA. However, the RNase H molecule encoded by theretrotransposon specifically digests the RNA in any RNA:DNA hybrid, thus free-ing up a single-stranded DNA with homology to the R sequence that is also foundat the 3′ end of retrotransposon mRNAs. Hybridization between these sequencesleads to a circular structure that allows a continuation of the reverse transcrip-tion until a single-stranded DNA complementary to all of the element-internalsequences is synthesized to generate a single-stranded DNA circle. Second-strandDNA synthesis requires the action of RNase H and a polypurine tract (PPT) thatis just 5′ to the 3′ LTR. Once the double-stranded linear DNA molecule is synthe-sized, it can then be incorporated into the target genome by the action of integrase,which appears to cut both donor and target molecules. The DNA is apparentlycut with nicks that are staggered by 3 to 5 bp (a size that is consistent for anygiven integrase), thereby creating a flanking target direct repeat that is 3 to 5 bpin size.
In plants, only a few groups have attempted to study the replication cycle ofLTR retrotransposons. Initially, it was shown that tobaccoTnt1 transposes via anRNA-intermediate in the genome ofArabidopsis thaliana(108). Later, this groupdemonstrated that the replication cycle of the element involves generation of extra-chromosomal double-stranded DNA linear intermediates ofTnt1(43). They havealso shown that the primer binding site (PBS) and the PPT of theTnt1 used asprimers for the initiation of minus- and plus-strand DNAs are separated by twonucleotides from the 5′ and 3′ LTRs, respectively, giving rise to extrachromosomallinear intermediates that have two more base pairs at each end than the integratedcopies, as was observed for retroviruses and the yeast Ty3 retrotransposons (43).Prior to integration into the host genome, the two terminal nucleotides at the 3′
end of these linear intermediates are removed, as was seen with yeast Ty3 retro-transposons and retroviruses. Extrachromosomal circular DNA intermediates ofanother tobaccoTto1retrotransposon have also been identified and characterized(68). Interestingly, analyses on generated models for the secondary and tertiarystructure of the integrase (INT) core domain ofBARE-1retrotransposons, com-prising 80 clones from 28Hordeumaccessions, have revealed that they are similarto structures of human immunodeficiency virus and avian sarcoma virus INs and
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 489
thatHordeumretrotransposon INT has been under purifying selection to maintaina structure typical of retroviral INTs (175). However, further research is neededto determine whether the replication cycle of LTR retrotransposons is the same inplants as in other eukaryotes.
LTR Retrotransposons of the Ty3-gypsy Group
Ty3-gypsyretrotransposons encode the same functions as do Ty1-copiaretrotrans-posons, and they have a very similar genomic organization (Figure 1; Table 1).A significant difference is the order ofint andpol genes within the retrotranspo-son. Comparisons of reverse transcriptase sequences indicate that the Ty3-gypsyand Ty1-copia retrotransposons do form different lineages, so this difference ingene order makes good cladistic sense (207). By sequence and cladistic criteria,it appears that animal retroviruses are derived from Ty3-gypsyretrotransposons,presumably by the acquisition of an envelope (env) function that allowed themto be packaged into viral particles that permit intercellular infectivity (180, 207).When present,envis usually the last protein coding sequence upstream of the 3′
LTR. In some cases, it can be unclear whether a Ty3-gypsy-like element is actuallya retrotransposon or a retrovirus (5, 94). Based on present knowledge, the mechan-ics of intracellular element transcription, replication, and integration are the samefor Ty1-copiaand Ty3-gypsyretrotransposons, and for retroviruses.
LINEs
The LINE elements are simpler than LTR retrotransposons, but still encode manyof the same proteins (Figure 1; Table 1). LINEs havegag and pol genes, butlack an identified integrase. Thegag loci of LINEs usually specify a protein withendonucleolytic activity (EN), and this may be involved in integration of DNAversions of LINEs into chromosomal DNA (26). Alternatively, or additionally,LINEs may integrate into chromosomal DNA via the action of a low-frequencyDNA repair activity (123). By sequence diversity criteria, LINE elements appearto be the oldest class of eukaryotic retrotransposons, and cladistic studies suggestthat the first LTR retrotransposons may have arisen by acquisition of LTRs by aLINE (207).
In plants, little is known about the mechanisms of LINE transcription or inte-gration. Most of the LINEs that have been seen in plants are 5′ truncated, as areabout 95% of human L1 elements (160), suggesting similar transcription, replica-tion, and integration processes. In humans, active L1 elements are transcribed afterrecognition of an internal promoter, perhaps involving participation of both RNApolymerase II and RNA polymerase III (98). The bicistronic L1 mRNA is thentranslated, producinggagandpol proteins. Apparently, the endonuclease protein(EN) encoded bypol then cuts the target site, and homology at this target siteserves to prime reverse transcription of the L1 element bypol (107), subsequentlyleading to integration, presumably via nonhomologous end joining (123).
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?490 KUMAR ■ BENNETZEN
SINEs
The small retrotransposons called SINEs are very different from the other classesof retrotransposons in that they do not encode anytrans-acting transposition func-tions (Figure 1; Table 1). All known SINEs are derived from RNA polymerase IIIproducts (e.g. tRNAs) that appear to have evolved an ability to be efficiently repli-cated and integrated by proteins encoded by LINEs and/or LTR retrotransposons(15, 160). In this regard, SINEs are most like intronless pseudogenes, the mRNAmolecules that are occasionally reverse-transcribed and then inserted into eukary-otic genomes (37, 106). However, unlike the mRNAs that are RNA polymerase IIproducts, genes transcribed by RNA polymerase III usually contain their promotersequences within the RNA-coding region. Hence, if an RNA polymerase III prod-uct is rarely (perhaps inadvertently) utilized as a template to make DNA and thenintegrates into a genome, it will have a promoter that allows essentially normaltranscription. The simplest interpretation of natural selection says that evolutionof such sequences will lead to the origin of SINEs with better and better affinityfor the reverse transcription and integration processes (34, 134). Although it is notclear how SINEs replicate or integrate, the similarity in the varying sizes of theirflanking target DNAs, the shared presence of integrated polyA tails, and their fre-quent correlative abundance with LINEs and pseudogenes suggest that they mayuse LINE-specified functions (15, 39, 133, 179).
RETROTRANSPOSON DISPERSAL AND ORGANIZATIONIN THE PLANT GENOME
Major efforts have been made recently to study the genomic and physical organiza-tion of retrotransposons in gymnosperms and angiosperms. Their replicative modeof transposition and amplification has made retrotransposons highly successful ininvading almost all parts of plant genomes, where they can account for over 50%of the nuclear genome (90, 101, 124, 128, 141, 142, 156, 158, 210).
Chromosomal Locations
In situ hybridization data on metaphase chromosomes and prophase nuclei haverevealed that Ty1-copia retrotransposon sequences are dispersed throughout theeuchromatin, sometimes evenly and sometimes unevenly, depending on the plantspecies and the particular element employed (Figure 2) (17, 61, 124, 140–142).However, many elements are present in lower amounts or absent in certain regions(e.g. centromeres, interstitial and terminal heterochromatic regions, and ribosomalDNA sites) (Figure 2, Figure 3a,b) (17, 61, 124, 140, 141). There are exceptions tothis general observation, though, including elements that are preferentially abun-dant at the terminal heterochromatic regions ofAllium cepa(Figure 2) (142) orthe paracentromeric heterochromatic region ofA. thalianaandCicer arietinum(Figure 2) (17, 61). For example,Athila, which consists of LTRs flanking two
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 491
Ty1-
copi
a
in A
llium
Ty1-
copi
a is
no
t det
ecte
d in
Vic
ia, b
arle
y,
oat,
Pin
us
Ty1-
copi
a
in A
. ind
ora
Ty1-
copi
a &
Ty
3-gy
psy
in m
aize
Ty1-
copi
a &
Ty
3-gy
psy
LIN
Es
in A
rabi
dops
is,
cere
als,
Cic
er
Ty3-
gyps
y in
cer
eals
Not
de
tect
ed
yet
Kno
b D
NA
Nuc
leol
us
orga
nize
r re
gion
Inte
rstit
ial
hete
roch
rom
atic
re
gion
s
Term
inal
he
tero
chro
mat
ic
Cen
trom
ere
Par
acen
trom
eric
reg
ions
Ty1-
copi
a
is n
ot
dete
cted
in
rye
Ty1-
copi
a Ty
3-gy
psy
LIN
Es
Gen
eS
INE
sD
iffer
ent d
ensi
ty li
ne
indi
cate
s di
ffere
nt
retr
otra
nspo
sons
Fig
ure
2D
istr
ibut
ion
ofre
trot
rans
poso
nsin
am
odel
plan
tchr
omos
ome.
Inm
any
case
s,Ty
1-co
pia,
Ty3-
gyps
y,L
INE
and
SIN
Ere
trot
rans
poso
nsar
edi
sper
sed
wid
ely
thro
ugho
utpl
antc
hrom
osom
es.H
owev
er,e
ach
retr
otra
nspo
son
grou
pal
soha
sfa
mili
esth
atar
ecl
uste
red
orde
ficie
ntin
spec
ific
regi
ons
ofch
rom
osom
es.F
orex
ampl
e,so
me
Ty3-
gyps
yre
trot
rans
poso
nsar
ecl
uste
red
inth
ece
ntro
mer
icre
gion
sin
mai
ze,r
ye,a
ndw
heat
,whe
reas
som
eTy
1-co
pia
retr
otra
nspo
sons
are
pres
enti
nth
ete
rmin
alhe
tero
chro
mat
icre
gion
ofA
lliu
mce
pach
rom
osom
esor
abse
ntfr
omth
esa
me
regi
onin
rye
(Sec
ale
cere
ale)
.R
etro
tran
spos
onse
quen
ces
have
not
been
foun
din
the
nucl
eolu
sor
gani
zer
regi
ons
inpl
ants
.M
any
retr
otra
nspo
sons
are
pref
eren
tially
foun
din
euch
rom
atic
regi
ons
whe
rem
ost
gene
sar
elo
cate
d.Se
quen
cean
alys
isof
larg
ege
nom
iccl
ones
inm
aize
has
reve
aled
that
thes
eeu
chro
mat
icre
trot
rans
poso
nsar
em
ostly
loca
ted
inin
terg
enic
regi
ons.
P1: FDO
December 17, 1999 15:5 Annual Reviews AR095-02
?Figure 3 Physical locations of the Ty1-copia (a, b), Ty3-gypsy(c), LINE(d, e), and SINE (f, g) retroelements along metaphase chromosomes ofplants have been shown using in situ hybridization.
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?492 KUMAR ■ BENNETZEN
non-retrotransposon open reading frames (ORFs), is concentrated in the paracen-tromeric heterochromatin ofArabidopsis, where it is associated with the major180-bp centromeric satellite DNA (144).
Recently, several groups have identified retrotransposon sequences in the cen-tromeric regions of several cereal species (Figure 2) (1, 121, 150). Probes withsequence similarity to Ty3-gypsywere isolated from a BAC clone of a sorghumcentromere. The probes hybridized exclusively to centromeric regions of sorghumchromosomes and those of allGramineaetested (121). A barley homologue of thissorghum Ty3-gypsyintegrase sequence was used to clone a barley retrotransposon(cereba) that was subsequently found to be almost exclusively localized withinthe centromeric regions of several cereal species, including barley, wheat and rye(Figure 3c) (150).
The presence of Ty3-gypsyretrotransposon sequences in the centromeric re-gions of all investigated species ofGramineaeis intriguing. This suggests thatthese retrotransposons either are ancient insertions that amplified before the di-vergence of theGramineaeor have independently arrived and/or amplified, plusbecome preferentially located in centromeric regions, in all of these species. Thehigh degree of conservation ofcerebaretrotransposon sequences within the cen-tromeric regions of theGramineaespecies indicates a possible association withcentromeric function. Earlier studies have shown that someDrosophilaretrotrans-posons are stable components of the telomeric and centromeric heterochromatin(146). However, it remains to be determined whether plant retrotransposon se-quences have any direct roles in centromere and/or telomere functions.
Recently, individual full-size copies ofGrande, Zeon-1, PREM-2, RE-10, andRE-15 retrotransposons have been found to interrupt tandemly arranged 180-bpunits of knob DNA associated with cytologically detectable heterochromatic com-ponents of pachytene maize chromosomes (1). Knob DNA in maize is associatedwith several genetic effects, including segregation distortion in the female game-tophyte and late flowering time. Previously, genetic effects associated with knobheterochromatin have been attributed only to the 180-bp repeats. However, the oc-casional retrotransposon insertions could also be involved in attenuation or controlof the multiple genetic effects associated with knob DNA.
LINEs and SINEs, the non-LTR retrotransposons, also have been studied fortheir chromosomal distributions using in situ hybridization. Elements of both typesshow dispersed chromosomal patterns of distribution, but some elements show dis-tinct clustering in specific regions of chromosomes. For example, some LINEs insugar beets are preferentially clustered in the subtelomeric regions of most chromo-some arms (Figure 3d,e) (91). Like the Ty1-copia retrotransposons, these LINEsexhibited almost no hybridization signal in the region of the tandem 18S-5.8S-25SrDNA gene repeats (Figure 3e) (91, 161). In situ hybridization analyses with theSINE element S1Bn of Brassica napusyielded a dispersed pattern of chromoso-mal localization, but also showed hybridizational signal in the centromeric regionsand a tendency to co-localize with rDNA sites (Figure 3f, g) (53). Although S1Bnelements are methylated at a level twice that of the average methylation level of
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 493
theB. napusnuclear DNA, their chromosomal localization was not predominantlyassociated with some extensively methylated regions of chromosomes, like theheterochromatin associated with centromeres and telomeres (53).
From many in situ and genomic sequencing analyses, it initially appeared thatthe percentage of retrotransposons in euchromatic regions as compared to hete-rochromatic regions was generally high. However, these studies were biased byboth the common selection of retrotransposons originally found near genes asthe in situ hybridization probe (38), and by the fact that genomic sequencing ef-forts were focused on genic regions (11, 182). Recent data from the sequencing ofArabidopsiscentromeric regions suggest that retrotransposons are highly enrichednear centromeres, and often arranged as nested series like those seen in intergenicregions of the more complex maize genome (158; R Martienssen, R McCombie& R Wilson, personal communication; http://www.cshl.org/protarab).
Database searches have shown that retrotransposon sequences are present ei-ther within, or close to, many genes. These retrotransposon legacy sequences areoften fragmentary and apparently ancient (19, 202, 203). However, analysis of thedistribution of retrotransposons within a contiguous 240-kb stretch of the maizegenome revealed that most retrotransposons are located between genes (158). Itmay be that many retrotransposons have evolved to transpose primarily into rela-tively inactive regions of plant chromosomes, such as intergenic spacers and otherrepetitive sequences, to avoid mutating genes at a high frequency (158). Otherwise,the host cell would accumulate a lethal level of mutations. In this way, retrotrans-posons could proliferate as dispersed sequences without being deleterious to thehost genome. By minimizing the generally negative (e.g. mutational) effects ofindividual retrotransposons, these elements could actively increase host genomesize.
Retrotransposon Abundance in Plants Relativeto Other Eukaryotes
Genomic sequencing of the 80–90% of theArabidopsisgenome that does not in-clude centromeric heterochromatin, telomeres, or satellite repeats has revealed arelatively low density and simple arrangement of repetitive DNAs. In a contiguousregion of approximately 1.9 Mb on chromosome 4 ofA. thaliana(genome sizeof ca 130 Mbp) (11) a retrotransposon was found about every 130 kb, suggestingthat about 1000 of these elements will be found in the genic regions of the entiregenome. These would comprise about 4% of theArabidopsisgenome, with themore concentrated retrotransposons in the centromeric regions accounting foranother 4–6%. The 4-Mbp genome of the yeastSaccharomyces cerevisiaecon-tains 331 retrotransposon sequences that constitute about 377 kb, or 3.1%, of theyeast genome. These elements belong to five families of LTR retrotransposons;217 Ty1, 34 Ty2, 41 Ty3, 32 Ty4, and 7 Ty5 (87). The non-LTR retrotrans-posons, LINEs and SINEs, appear to be absent from theS. cerevisiaegenome.In contrast, the filamentous fungusMagnaporthe grisea(with a genome size of
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?494 KUMAR ■ BENNETZEN
about 40 Mbp) contains both LTR and non-LTR retrotransposons at copy num-bers varying from a few to 100 (84). InCaenorhabditis elegans, which has agenome size of about 100 Mbp, most of the retroelements appear to be non-LTRretrotransposons, including 59Rte-1family members, of which 9 are full-length(3298 bp) (209). A few Ty3-gypsygroup retrotransposons have been identifiedin this nematode, but it appears that the Ty1-copia group is either absent orrare. Furthermore, there have been no reports of either retrotransposon activity orretrotransposon-induced mutations inC. elegans. Most of the identified retrotrans-posons inC. eleganswere found to be defective (E Berezikov, personal communi-cation). InDrosophila melanogaster, which has a genome size of about 165 Mbp,both LTR-and non-LTR retrotransposons are present. Unlike yeast,C. elegans,andArabidopsis, Drosophilaalso contains active retroviruses (170). To date, thereare at least 23 families of LTR retrotransposons and retroviruses, and at least11 families of non-LTR retrotransposons, that have been identified inDrosophila(http://fly.ebi.ac.uk:7081/transposons/lk/melanogaster-transposon.html). Together,these retrotransposons constitute 5–10% of theDrosophilagenome. In stark con-trast toC. elegans, many of theDrosophilaretrotransposons are active and fre-quently generate insertional mutations.
Interestingly, SINEs are either absent or rare in the genomes ofArabidopsis,C. elegans, andDrosophila. These are likely to be exceptional cases, because closerelatives of these species do contain SINEs. For example,Brassica napus, whichbelongs to the sameCruciferaefamily as doesArabidopsis, contains numerousSINEs (32). It is not clear why these small genome eukaryotes (Arabidopsis, about130 Mbp;C. elegans, about 100 Mbp;Drosophila, about 165 Mbp) are deficientin SINEs. One possibility is that these organisms may have an effective process forremoval of retrotransposons and pseudogenes from their genomes. Alternatively,the relatively low activity of LINEs in these species (exceptDrosophila) mayhave led to a very low frequency of new SINE insertions (39). Further research isrequired to resolve this enigma.
Genome size variation is correlated both with the total mass of retrotrans-posons that are present and with the number of different retrotransposon families.As mentioned above, the 4 MbpS. cerevisiaegenome contains five different LTRretrotransposon families that account for about 3% of the genome (87). InAra-bidopsis, both LTR- and non-LTR retrotransposons have been identified, includingTy1-copia, Ty3-gypsy, and LINE elements, with 1–100 families of each group that,in total, probably account for 4–10% of total nuclear DNA, whereas in maize, thereare thousands of different families of LTR and non-LTR retrotransposons that to-gether account for 70–85% of the nuclear genome (156, 158).
In most mammalian genomes, including the human genome, LINEs and SINEs(e.g. L1 and ALU elements) appear to be the major mobile sequences, representingabout 35% of the total DNA, with up to 100,000 copies of each element type(167). Interestingly, although retroviruses and their remnants, including retroviralelements with deletedenvgene, (i.e. Ty3-gypsy-like elements), are widespread inmammals, LTR retrotransposons of the Ty1-copiagroups appear to be absent or
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 495
rare. In contrast, LINEs, SINEs, Ty3-gypsy, and Ty1-copiaretroelements appear tobe abundant in plants. All in all, plant genomes contain an exceptionally abundantand diverse set of retrotransposons compared to most other eukaryotic genomes,with the exception that functional retroviruses are either absent or present in verysmall numbers.
Local Structures
In most cases, plant retrotransposons have been studied initially as single insertions(12, 20, 56, 69, 70, 72, 76, 78, 91, 101, 128, 131, 155, 187, 188) or as PCR ampli-fied fragments of a population of elements (45, 46, 82, 83, 92, 141, 142, 172, 194).Hence, these studies did not provide much information regarding the local orga-nization of elements within a small chromosomal region. Similarly, standard insitu hybridization experiments only reveal gross general patterns. Recent genomicsequencing studies have begun to provide information on the local organization ofretrotransposons and their interspersion profiles relative to genes.
The first studies of the interspersion patterns of genes with repetitive DNAsin plants were initiated over 25 years ago, utilizing genomic DNA renaturationanalyses (44). It was not known then that most of the repetitive sequences underinvestigation would turn out to be retroelements, primarily LTR retrotransposons.However, these Cot analyses routinely indicated that most of the repetitive DNAswere interspersed with low copy-number, presumably gene-containing, sequences(44, 59). These studies had the disadvantage that they primarily provided resultsthat averaged the actual individual situations across an entire genome.
Genomic sequence analyses have largely confirmed the predictions from Cotstudies. The interspersed repetitive DNAs, mostly retrotransposons and a class ofsmall DNA transposable elements called miniature inverted repeat transposableelements (MITEs) (203), are intermixed with each other and with genes in domainsthat are usually 100 kb or less in size. Many transposable elements, commonlyincluding retrotransposon fragments (203), are actually found within functioninggenes. In small genome plants, these retroelements have usually been observed assingle elements or solo LTRs (11, 24). This simple pattern has also been observedin some gene-rich regions of large plant genomes, like those of maize and barley(105, 136). However, the largest segment of a complex plant genome sequenced todate, an approximate 225-kb region around maizeadh1, has shown a very differ-ent pattern of retroelement arrangement (158, 182). In this region, LTR retrotrans-posons were very abundant, and were found mainly as elements inserted withineach other. The insertions showed some other biases, including a fivefold prefer-ence for insertion into the LTRs rather than the internal retrotransposon domainsand the fact that 17 of 22 were in the same potential transcriptional orientation(158, 182). These nested LTR retrotransposons were found in six blocks, with oneof these blocks possibly interrupting one of the nine putative genes identified inthe area. In contrast, the three LINEs detected in this region were all found withinputative introns of genes (182). Although further genomic sequencing studies will
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?496 KUMAR ■ BENNETZEN
be needed to determine whether this organizational pattern is common in plants,or an exceptional property of the maize genome, it is interesting that the first se-quencing of DNA nearArabidopsiscentromeres has also uncovered a gene-poorregion that is rich in nested retrotransposons (R Martienssen, R McCombie &R Wilson, personal communication; http://www. cshl.org/protarab).
Insertion Site Preferences
All mobile DNAs exhibit some level of insertion site preference. Among theDNA transposable elements, the degree of specificity can vary tremendously.Even the most random insertion mutagens, like Tn5 ofE. coli, P elements ofD. melanogaster, or Mutator of maize, will insert into some genes at frequenciesover tenfold higher than into other genes. Many of these elements exhibit a pref-erence for insertion into genic regions of a genome, and into specific locationswithin or near a gene (e.g. promoters) (10, 86). Moreover, some elements, likeAcof maize, prefer to insert at sites linked to the location of the starting element (35).
Insertion specificities for retroviruses and retrotransposons have been reportedin many different eukaryotes (27, 99), but this question has not been extensivelyinvestigated in plants. In some cases, the preferential integration sites are locatedin actively transcribed chromosomal regions, particularly in or near promoters. Inother cases, preferential integration sites are located in repeated heterochromaticregions. In yeast, most or all retrotransposons show target specificity. For in-stance, the Ty1, Ty2, and Ty3 elements integrate preferentially upstream of genestranscribed by RNA polymerase III, including tRNA, 5S, and U6 genes, whileTy5 primarily inserts in silenced regions of the yeast genome, including silencedmating-type cassettes and telomeric regions (50, 211).
It is not entirely clear how and why retrotransposons target specific regions orsequences of the host genome. Selection of the insertion site may be made by anintegration complex, consisting of retrotransposon-encoded integrase, a reversetranscribed cDNA copy of the element, and various host-encoded factors (27, 99).The most precise information regarding preferential insertions has come fromresearch on yeast retrotransposons. For example, the Ty3 integration complex wasfound to have an affinity with transcription factors that interact with all tRNAgenes (88). Similarly, there is preferential insertion into silent chromatin by Ty5.This specificity can be abolished by a single amino acid change in the integraseprotein (50) or by mutations in nuclear proteins that specifically interact withsilent/heterochromatic regions of the yeast genome (211).
In plants, there is evidence that some retrotransposons show preferential inser-tion into some genomic regions. For example, many Ty1-copia retrotransposonsare found preferentially within euchromatic regions in those plant species studied todate (51, 69, 70). In contrast, some of the cereal Ty3-gypsygroup retrotransposonsaccumulate primarily within heterochromatic regions, such as near centromeresand the knob DNA in maize. Also, Zepp, a LINE-like retrotransposon inChlorella,which accumulates in the telomeric region, integrates preferentially into itself and
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 497
another Zepp sequence (62). It is not clear whether the biased accumulation inany of these cases is due to preferential insertion, or perhaps to a less active pro-cess for removal from these regions and/or less fitness selection acting againstretrotransposons in these regions.
When specificities are seen, some may be shared by many different retrotrans-poson families. For instance, theGrande, Zeon-1, and RE-15 retrotransposonseach were found inserted at the same site in the 180-bp repeat sequences of knobDNA (1). Both Ty1-copia and Ty3-gypsyretrotransposons have been found inintergenic locations within euchromatic regions of the maize genome (158). Incontrast, the low-copy-number Ty1-copiaretrotransposonsTos17of rice andTpv2of common bean,Phaseolus vulgaris, are mostly targeted near and within genes(51, 70). Therefore, it appears that each retrotransposon has evolved its own patternof insertion within a plant’s genome.
In Organellar Genomes
The completed sequences of both chloroplast and mitochondrial genomes fromplants (109, 185) have allowed investigation of whether these additional cellulargenomes contain any retrotransposons or retrotransposon legacies. There have notbeen any reports of retrotransposons or retrotransposon fragments within the smalland gene-dense chloroplast genome of plants. In contrast, mitochondrial genomesappear to contain many fragments of Ty1-copia, Ty3-gypsy, and LINE-like retro-transposons. No intact elements have been seen, and there is no evidence of mu-tations caused by retrotransposon insertion. About 4% or more of theArabidopsismitochondrial genome consists of these retrotransposon fragments, through 9 ormore independent acquisition events (185). It is possible that these fragments ac-cumulated within the relatively large and repeat-rich plant mitochondrial genomesas a passive outcome of DNA acquisition from the nuclear genome (14, 162).
REGULATION OF EXPRESSION AND TRANSPOSITION
The expression of retrotransposons in plants and in other eukaryotic organismsis regulated, thereby regulating their transposition frequency in the host genome.The evolution of control mechanisms for the transcription and transposition ofretrotransposons in the host genome may be crucial to minimize their possibledeleterious effects on the host. Thus, it is not surprising that plant retrotransposonsare transcriptionally silent in most plant tissues during development.
Transcriptional Regulation
Because retrotransposons cannot transpose without the presence of an RNA tem-plate available for reverse transcription, the simplest way to control their activitywould be via regulation of transcriptional initiation. Many retrotransposons showunique patterns of developmental and/or environmental regulation. For instance,
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?498 KUMAR ■ BENNETZEN
transcripts of tobaccoTnt1, barleyBARE-1, and maize PREM-2 have been detectedprimarily in roots, leaves, or young microspores, respectively (148, 172, 173, 184).Likewise, inS. cerevisiaeandDrosophila, the expression of retrotransposons isinduced by hormonal and developmental factors (16, 33).
There are only a few reports on the expression and regulation of non-LTR retro-transposons in plants. For example, the SIBn family of SINEs has been shown tobe expressed in shoots, roots, and callus tissues (31). Moreover, transcriptionalanalysis of SIBn has revealed that the major SIBn RNAs result from cotranscriptionderived from an element present in a POL II transcriptional unit. Only rarely dotissue-specific SIBn transcripts result from the expression of a small number of ge-nomic elements by POL III (31). It appears either that the SIBn POL III promoteris highly repressed inB. napuscells or that transcriptional signals external to theSINE element are necessary in vivo to obtain transcription. Our knowledge of theregulation of LINE expression in plants is limited. In the mouse genome, it has beendemonstrated that LINE-1 expression is under the control of a tightly regulated tem-poral and spatial program of events during development and differentiation (183).
A correlation between transcription and transposition of retrotransposons hasbeen demonstrated for the tobaccoTto1and riceTos17retrotransposons (64, 70).For example, transposition ofTto1andTos17was concomitant with an increase inthe levels of their RNAs, suggesting that transposition of these retrotransposonsis regulated mainly at the transcriptional level. For example, a tenfold increase inthe copy numbers ofTto1 elements was observed in cultured cells, whereTto1transcripts are very abundant (64). A similar pattern of increase in copy number forthe riceTos17elements during tissue culture conditions also has been demonstrated(70). However, this is not the situation for several other retrotransposons. Forexample, barleyBARE-1(an active element in the recent past) is highly transcribedin leaves, but its transposition has not been observed (172). This reminds us that,although transcription is a prerequisite for transposition of an element, other stepsare also required and may be regulated.
Activation by Biotic and Abiotic Stresses
Many of the plant retrotransposons studied to date are transcriptionally activatedby various biotic and abiotic stress factors (55). The expression of the tobaccoTnt1 and Tto1 retrotransposons is greatly increased by several abiotic stresses,including protoplast isolation, cell culture, wounding, methyl jasmonate, CuCl2,and salicylic acid (64, 120, 125, 147, 148, 177, 178). Similarly, various biotic stressfactors such as fungal extracts fromTrichoderma viride(148, 178) or inoculationwith various viral, bacterial, or fungal pathogens (119, 147) have been shownto activate transcription of these retrotransposons. In contrast toTnt1 andTto1,transcription ofTos17is induced only by tissue culture. TobaccoTto5was isolatedas a salicylic acid-inducible retrotransposons by the RT-PCR method (177). Atpresent, it is not known whether abiotic stresses such as cold and heat treatment canalso induce transcription in these retrotransposons. Thus, these results indicate that
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 499
retrotransposons carrycis-acting sequences that control their expression patternsin the host.
Regulatory Sequences
Transcription of LTR retrotransposons is controlled bycis-acting elements in the5′ LTR and adjacent untranslated regions. The structure and function of promotersfromTnt1A,Tto1, andBARE-1have been extensively studied using LTR-GUS tran-scriptional fusions in transformed plants. Transient and stable expression assays ofLTR-GUS fusions containing various lengths of 5′ LTR regulatory sequences haveshown that the majorcis-acting sequences controllingTnt1AandTto1expressionare located within the U3 region of the 5′ LTR (147, 148, 177, 178). In the case ofTnt1, the U3 region contains severalcis-acting sequences, including a short palin-dromic sequence named BI, and a segment of 31 bp tandemly repeated in three orfour copies, named BII. Both BI and BII behave as transcriptional activators. TheBII box binds specific protein factors in tobacco protoplasts (22), and has beenshown to be involved in the transcriptional activation ofTnt1by several biotic andabiotic factors (55, 189).
A similar study on the LTR promoter ofTto1using LTR-GUS constructs hasshown that the 5′ LTR containscis-acting sequences involved in the induction ofTto1expression by abiotic and biotic stress factors (177, 178). Deletion analysison the LTR promoter has shown that enhancer-like sequences upstream from theTATA box are required for a high level of expression in the callus. Recently, bygain-of-function analysis, a 13-bp repeated motif (TGGTAGGTGAGAT) has beenidentified as a positivecis-acting regulatory sequence in the U3 region. This motifis involved in expression in protoplasts and callus, as well as in wound- and methyljasmonate–induced expression in leaves (178). Furthermore, gene expression me-diated by the 13-bp motif was found to be associated with genes that conditionplant defense against microbial attack (178). For example, the complementarysequence of the 13-bp motif contains the box L sequence (also called AC-I) andthe core sequence of the H-box elements, which are highly conserved among thepromoters of phenylpropanoid biosynthesis genes (178).
To gain further insight into the regulation ofTto1 and defense-related genes,cDNAs encoding four different proteins that bind to the 13-bp motif have beenisolated (178). One protein (LBM1, LTR binding MYB) is identical to the pre-viously reported MYB-1, which is induced by virus infection. LBM1 contains aDNA binding domain conserved among the MYB class of transcription factors.Accumulation of LBM1 mRNA was induced rapidly and transiently by cuttingleaf tissues and by elicitors. LBM1 cantrans-activate reporter gene expressiondependent on its binding to the 13-bp motif. A 10-bp sequence (TTGGTAGGTG)containing a conserved motif between box L and the H-box has been shown tobe essential for the binding of LBM1 protein to the 13-bp motif. LBM1 also ac-tivated transcription from the promoter of the phenylalanine ammonia lyase gene(Pv-PAL2) in tobacco protoplasts through binding to the box L and the H-box-like
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?500 KUMAR ■ BENNETZEN
motif (box P). These results suggest that the LBM1 protein may be involved inthe stress response ofTto1and defense-related genes (178; H Hirochika, personalcommunication).
The activation of the expression of tobacco retrotransposonsTnt1 and Tto1by pathogen-derived factors that are involved in the activation of plant defenseresponses is intriguing. Moreover, thecis-acting regulatory motifs of both retro-transposons share similarities with the H-box motif involved in the activation ofseveral plant defense genes (55). The finding that the 13-bp motif contains theL/Ac-I/H-box motif, conserved among promoters of the phenylpropanoid biosyn-thetic genes, raises the question of whether an ancientTto1-like retrotransposoncontributed to acquisition ofcis-regulatory sequences in promoters of some de-fense genes (178). Alternatively,Tto1 may have acquired this sequence from adefense gene, or they could have evolved independently in defense genes andretrotransposons by convergent selection. Takeda and coworkers have analyzedthe promoter of the defense-activated asparagusAoPR1gene (200) and have foundthat it contains an H-box-like motif and several other regions homologous to thecomplementary sequence of theTto1LTR promoter. Interestingly, theAoPR1pro-moter can be activated in a similar fashion to theTto1LTR promoter by abioticand biotic stresses, including tissue culture, wounding, pathogen infection, andsalicylic acid. However, the biological significance of this, as well the origin ofthese regulatory sequences in retrotransposons and host genes, is unclear at presentand warrants further investigation.
LTR-GUS construct analyses have shown that theTnt1andTto1LTR promotersare active in heterologous systems, including transgenicArabidopsis, tomato, orrice and that the expression patterns are similar to those seen in the natural host,tobacco. Moreover, activation of theTnt1andTto1elements by abiotic and bioticstresses is maintained in heterologous hosts (69, 125). However, theTnt1promoter,with expression confined to roots in the homologous tobacco system, was alsoexpressed in the flowers of transgenicArabidopsisand tomato (125), and wasfound to be induced by treatment with auxins inArabidopsis(139). These resultssuggest that, beyond the conservedcis-acting regulators involved in abiotic andbiotic stress induction of these retrotransposons, there are also some host-specificregulators of the expression of theTnt1promoter.
Epigenetic and Posttranscriptional Regulations
Host cells, and possibly the retrotransposons themselves, have evolved mecha-nisms to minimize the negative effects of element retention and transposition. It hasbeen proposed that DNA methylation evolved as a means of containing the spreadof transposable elements and viruses in host genomes (208). This defense modelhas been supported recently by analysis of an interspecific mammalian hybridbetweenMacropus eugeniiandWallabia bicolor in which genome-wide under-methylation of the hybrid genome and a high level of amplification of retroelementsare correlated (132).
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 501
In plants, analyses of repetitive DNAs have routinely shown that they are cy-tosine methylated in the sequences 5′-CG-3′ and 5′-CNG-3′. These methylatedDNAs include those flanking standard plant genes in species like maize (9), andthese methylated sequences have now been shown to be mostly LTR retrotrans-posons (158). Cytosine methylation has been associated with genetic inactivityand a heterochromatic state in many higher eukaryotes, including plants where denovo methylation was first detected during the inactivation of DNA transposableelements of theMutator andAc/Ds families (10, 97). This cytosine methylationhas been associated with both transcriptional inactivity and a higher rate of C to Ttransitions, giving rise to enhanced transcriptional and mutational silencing (157).It is not clear, though, despite many years of study in many species, whether themethylation itself establishes the inactive state of a retrotransposon, or whether itis a secondary (possibly maintenance) outcome of inactivation initially caused bya change in chromatin structure.
The most recent evidence on the role of DNA methylation in the regulation ofretrotransposition comes from a study onTto1by the Hirochika group in Japan (67).It was observed that after the initial active retrotransposition in theArabidopsisgenome,Tto1 became silent. However, the silencing ofTto1 was only observedin those transgenicArabidopsislines whereTto1copy number was increased byactive transposition during culture-mediated transformation process. Thus, thissilencing appears to mimic repeat-induced gene silencing (115). This silencingcorrelated with a reduced level of theTto1 transcript and with methylation of theinactivatedTto1. To investigate the causal relationship between DNA methyla-tion and silencing ofTto1, the hypomethylation mutation,ddm1, was introducedinto the line carrying silencedTto1 copies through conventional crossing. In ahomozygousddm1background, the silencedTto1 became hypomethylated andits transcription was activated. This result suggests that DNA methylation may beresponsible for silencingTto1 in wild-typeArabidopsis(67).
Interestingly, in theDrosophilaandS. cerevisiaegenomes, which lack cytosinemethylation of DNA in general, an inactive chromatin state can be established andmaintained without the need for DNA methylation (36, 58). An even more drasticcontrast is shown by the exceptional case of the chordateCione intestinalis, wheretransposable elements are generally unmethylated and most genes are methylated(166). Hence, DNA methylation is not obligatory for genetic silencing in all eu-karyotes, although a standard structure for inactive chromatin with deacetylatedhistones may be universal (154).
It is not clear how an epigenetic silenced state is either initiated or maintainedfor retrotransposons. In many cases, epigenetic inactivation of an element may oc-cur because of its insertion in or near an already heterochomatic block of DNA, asis the likely case for the insertion of the most abundant maize retrotransposons intomethylated, intergenic blocks (9, 158, 182). Silencing in this case could occur by aprocess analogous to position effect variegation inDrosophila, where movement ofnormally euchromatic genes into a heterochromatic region leads to a progressive ra-diation of the silenced heterochromatic state into the translocated gene(s) (196). In
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?502 KUMAR ■ BENNETZEN
other cases, retrotransposons may be epigenetically modified and/or posttranscrip-tionally suppressed by a process similar to homology-based gene silencing, firstdetected in plants but now also identified in some animal systems (75, 135). Someforms of paramutation (71), the coordinate inactivation of a transposable elementfamily, and homology-based gene silencing all involve correlated DNA methyla-tion and transcriptional inactivation of repeated sequences scattered throughout aplant genome (112, 115). How homologous sequences widely dispersed through agenome are found and inactivated is not clear, nor is it completely clear why repeatslike transposable elements, transgenes, and their chromosomal homologues aretargeted for silencing while normal gene families largely escape this inactivation.Numerous studies are under way in this area (60), and the insights obtained will betremendously valuable to understanding the regulation of plant retrotransposons.
In the yeastS. cerevisiae, several studies have shown that the transpositionof Ty1 elements is controlled at posttranscriptional steps (99). A recent work il-lustrates the concept that conserved cellular functions, such as those involved indifferentiation and DNA repair/recombination, have been adapted by the host forpreventing rampant proliferation of retrotransposons (reviewed in 29). Host genesthat encode Fus3p (a MAP kinase involved in the mating pheromone signal trans-duction pathway) and TFIIH (an RNA polymerase II general transcription factorinvolved in initiation and promoter clearance, and in transcription-coupled repair)have been shown to regulate a posttranscriptional step in retroviral replication inyeast. It appears that Fus3p inhibits the accumulation of Ty1 proteins and thatTFIIH promotes degradation of Ty1 cDNAs. It is not known whether similar hostgene-mediated or other types of posttranscriptional control mechanisms exist forthe regulation of retrotransposition in plants.
Retrotransposon Regulation as a Form of Host Defense
Some retrotransposons might have beneficial effects on a plant, through mutationsthat provide new regulatory properties to a gene (202) or perhaps by contributingto DNA repair (123) or centromere function (121, 150). However, many of theexpression, mutation, and insertion properties of these elements suggest that theireffects are being minimized. With any highly adapted host/parasite interaction,the parasite will contribute as little as possible to decreased host fitness. The hostshould also evolve such minimization/defense processes, and several seem to beacting on plant retrotransposons.
Most of the known retrotransposons appear to be defective in their ability toencode all necessary transposition functions, owing to insertions, deletions, andother mutations (45, 46, 72, 76, 114, 141, 186, 194, 203). The epigenetic regulationof plant retrotransposons associated with DNA methylation and presumed hete-rochromatization may also be involved in keeping retrotransposon transcriptionat a low level (208). In maize, for example, the retrotransposons that make up50–80% of the DNA contribute only 10% or less of the mRNA in most tissuesunder most circumstances (B Bowen, personal communication). Moreover, thisDNA methylation is associated with a two- to threefold higher transition mutation
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 503
rate in these elements, thus causing them to decay to a nonfunctioning form morerapidly than other sequences.
Retrotransposons can inactivate other retrotransposons by inserting within thema structure often seen in the intergenic regions of the maize genome (158). In acompetition between retrotransposons for use of host resources, this inactivation ofone retrotransposon by insertion of another would be an ingenious approach (95).Moreover, the host might benefit from such an insertion pattern, because theseelements would not be generating mutations as often, and thus natural selectioncould act on host factors that regulate this insertion specificity.
PLANT RETROTRANSPOSON ORIGINS AND EVOLUTION
Very little is known about the timing or particular steps in the origin of retro-transposons overall, or about their arrival within a particular kingdom or species.Sequence comparisons of conserved gene products encoded by LINEs and LTRretrotransposons suggest that the first retrotransposons were LINE elements, andthat LTR retrotransposons evolved from them by the acquisition of terminal directrepeats (207). InDrosophila, tandem insertion by theHeT-Aelement suggests howthis process could simply create two flanking LTRs (30, 40). Retroviruses appearto have evolved in animals from a subclass of Ty3-gypsyretrotransposons by ac-quisition of an envelope function (180, 207). Some believe that retrotransposonsoriginated at the beginning of the transition from a putative RNA-based cellulargenome to the DNA-based genome now shared by all living cells (52).
The ubiquity of plant retrotransposons and their extreme sequence heterogeneitysuggests that they were present in the first plants (46, 95, 129, 176, 194). Alterna-tively, retrotransposons may have originated after the creation of the first eukary-otes, and have reached their current wide dispersal by a combination of verticaland horizontal transmission. Nonetheless, they have clearly been highly successfulin proliferating within all higher eukaryotic genomes. Their contributions to plantgenome evolution are many and significant (5, 48, 55, 93, 95, 97, 202). As with anyother DNA sequence present in a plant genome, selective pressures can act to uti-lize that retrotransposon sequence for the benefit of the entire organism, if anysuch benefit can be acquired by chance mutational alterations. Hence, we expectthat retrotransposons or retrotransposon components will occasionally be found tocontribute important factors to plant fitness, as has been seen with the contributionsof these elements to the promoters of many “wild-type” plant genes (203).
The Frequency and Timing of RetrotranspositionDuring Evolution
Although a great deal of information exists regarding the genomic organizationof retrotransposons, little is known concerning the frequency and possible period-icity of element transposition during plant evolution. These questions have beenmore deeply investigated with the DNA transposable elements.Mutatorof maize,
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?504 KUMAR ■ BENNETZEN
for instance, commonly transposes more than once per plant generation, therebyincreasing copy numbers several fold and making many mutations in a single grow-ing season (10, 97). Since the survival and evolution of retrotransposons and thelarge-scale structure of plant genomes are greatly influenced by the transpositionalactivity of retrotransposons, it is important to understand this aspect of retrotrans-poson behavior. Recently, element-anchored PCR analyses have been used to studythe insertional polymorphism of the Ty1-copia group retrotransposons in plants(41, 57, 143, 201). Those studies have shown thatBARE-1insertions, for example,are highly polymorphic in barley, wheat, rye, and oat, suggesting that the trans-positional activity of this retrotransposon has persisted for millions of years inseveral cereal species (57, 201). Recently, a similar study has shown that a fewretrotransposons have been transpositionally active in the recent past in severalPisumspecies (41, 143).
The transpositional activity of plant retrotransposons varies considerably onthe evolutionary time-scale, with some retrotransposons having been active in thedistant past whereas others have been active more recently. A Ty1-copiaelement,R9of rye [about 50,000 copies per genome (140)], is similar toBARE-1of barley[about 70,000 copies per genome (172)]. Gel blot hybridization analyses indicatethat a close relative of theR9subgroup is present in high copy numbers in barley,oat, and wheat. Although Northern analysis on RNAs from seedlings shows thatthe R9/BARE-1subgroup is transcribed in all these cereal plants, the amount oftranscript varies from high in barley, to moderate in wheat and rye, to extremely lowin oat (140). It will be interesting to determine how these retrotransposons havebecome different in transcriptional activity between these closely related cerealhosts, or whether differences in the host transcriptional machinery account for thevariable expression.
A different approach has been used to demonstrate that most retrotransposonsin the maize genome have transposed within the last 2–6 million years (157). Thisstudy was based on calculating the rate of mutations in the LTR sequences ofseveral retrotransposons in theadh1region of maize. As pointed out earlier, thetwo LTRs of a retrotransposon are synthesized from a single template during theirreplication cycle. Thus, the 5′ LTR is usually identical to the 3′ LTR at the time ofa retrotransposon’s insertion into the genome. Therefore, the date of insertion canbe estimated from sequence divergence between the two LTRs, assuming a fairlyconstant rate of nucleotide substitution since the insertion date.
Some retrotransposon families are essentially specific to one genome in plantsof a hybrid origin. For example, in Triticale, an allopolyploid with two wheatgenomes and one rye genome, a rye-origin retrotransposon probe hybridizesmuch more strongly to the rye-origin chromosomes than to those of the wheatgenomes (140). Likewise, in hexaploid oat,Avena sativa(2n = 6x = 42, withA, C, and D genomes), a retrotransposon probe isolated from a diploid A genomespecies hybridizes only weakly to C-genome chromosomes (83). This indicates thatretrotransposition of these elements has been relatively infrequent since the poly-ploid was created. Otherwise the retrotransposons would now be homogeneously
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 505
distributed across all genomes within the same nucleus. Alternatively, some retro-transposons may have evolved to a degree that they are compatible only withinsertion sites within one particular genome, as appears to be the case for retro-transposon amplification in an interspecific wallaby hybrid (132).
Heterogeneity
Fortuitous discovery by genomic sequencing, identification of retrotransposonsas the causal agents in various mutations, and the directed acquisition of retro-transposons by PCR techniques have all yielded information indicating there isa great variety of retrotransposons both within a single plant and between plantspecies. Both LTR and non-LTR retrotransposons, including the Ty1-copiagroup,the Ty3-gypsygroup, LINE, and SINE, are usually present in a plant genome(Table 1) (5, 55, 95, 97). Among these retrotransposons, the Ty1-copiaretrotrans-posons have been studied in most detail, and this has revealed that they are presentas highly heterogeneous populations in all higher plant genomes. For example,when 31 subcloned fragments of the reverse transcriptase genes of the Ty1-copiagroup retrotransposons from potato were sequenced, each was found to be differ-ent, with predicted amino acid similarities between individual fragments varyingfrom 5% to 75% (45). Such extreme heterogeneity has subsequently been observedamongst the reverse transcriptase genes of Ty1-copiagroup retrotransposons fromseveral monocotyledonous and dicotyledonous plant species (45, 46, 89, 114, 128,140–142, 186). BothD. melanogasterand S. cerevisiaepossess a small num-ber of homogeneous elements, with less than 5% divergence within the families(87, 127). These results indicated that plant Ty1-copiagroup retrotransposons aremore heterogeneous than those in many animals and lower eukaryotes.
There could be several reasons for differences in the degree of heterogeneityof Ty1-copiagroup retrotransposons between plants and other eukaryotes. Retro-transposons proliferate by a replicative mode of transposition, with each replicationcycle capable of generating defective elements. The mutation rate during a sin-gle transposition cycle of yeast Ty1 retrotransposons has been estimated to be2.5× 10−5 (49). Additionally, retrotransposons are capable of generating largepopulations in relatively short evolutionary times. In yeast, it has been found thathigh homogeneity in both sequence and size variation of Ty1 elements is due to ahigh level of genomic turnover of retrotransposons in the genome (80, 87). Thus,the majority of retrotransposons in the yeast genome are derived from transposedevents over a relatively recent evolutionary time. In contrast, a plant genome cantolerate accumulation of large amounts of DNA and does not appear to removethese sequences rapidly. It is possible, therefore, that early plants possessed onlya few active populations of retrotransposons that subsequently proliferated intolarge populations in the host genomes. The mixed populations of active and de-fective retrotransposons then accumulated mutations over time, and eventuallygave rise to highly heterogeneous populations. If, as appears likely (45, 46, 114,187, 191), the majority of plant retrotransposons have been transmitted vertically
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?506 KUMAR ■ BENNETZEN
into subsequent generations, with horizontal transfer between species being veryrare, then heterogeneous lineages should predominate. Another feature, whichmay contribute to the high level of heterogeneity in plant retrotransposons, is thedistinctive strategy for determination of germline in plants. Unlike animals, plantsdo not have a sequestered germline; they have meristematic cells that are capableof differentiating into somatic tissues or reproductive organs throughout the lifecycle. This means that somatic tissues, with mutations accumulated in them duringplant growth, can give rise to germ cells, as long as mutations are not lethal. In ani-mals, because the germ cells are selected early in embryogenesis, it may be that theanimal germ cells are better protected against mutation. All of these phenomenaare likely to be responsible for the current heterogeneity of plant retrotransposons.
Heterogeneity generated in the 5′ LTR sequences and adjacent leader sequencescan have effects on the evolution of retrotransposons. Indeed, extensive sequencevariation has been found among the LTRs and adjacent leader sequences ofcopiaretrotransposons inD. melanogasterpopulations (28). However, the functional sig-nificance of the sequence variation has not yet been demonstrated. In the humanretrovirus-like element, RTVL-H, it has been shown that the LTRs are functionallydiverse and can promote the expression of a linked gene (42). In plants, the tobaccoTnt1 retrotransposons members have been extensively studied for sequence het-erogeneity of both coding domains and U3 regulatory sequences from a number ofNicotianaspecies (23, 190). This study has revealed that there are three differentsubfamilies ofTnt1 elements,Tnt1A, Tnt1B, andTnt1C, that differ completelyin their U3 regions but share conserved flanking regions (190). U3 divergencebetween the three subfamilies is found in the region that contains the regulatorysequences of an activeTnt1-94retrotransposon. On the basis of these data, the au-thors have proposed that this high sequence variability could allow these elementsto evolve regulatory mechanisms in order to optimize their coexistence with theirhost genome. LINEs, SINEs, and Ty3-gypsyelements are also found as heteroge-neous populations in plant genomes, but detailed investigations of their possibleorigins and evolution have not been performed yet.
Internal Mutation
In most plants, the majority of the retrotransposons in the genome have structuressuggesting that they are defective. Many members of a retrotransposon familydiffer by large internal insertions or deletions (72, 111). In some cases, as with the5′ ends of LINE elements, truncated insertions suggest that the internal variationwas generated during transposition. The relatively low replication accuracy ofreverse transcriptase will also increase the rate of internal mutation.
In many cases, defective retrotransposons will continue to persist and evenamplify within a genome. By sequence criteria, many LTR retrotransposons appearto be lacking in some essential component for transpositon, such as the absenceof apol coding potential inBs1(76). The absence of complete coding capacity forone of these elements does not necessarily mean that it will not transpose.Bs1itselfwas first detected as a de novo insertion mutagen, indicating that it had utilized
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 507
POL and any other possible missing functions from some other retrotransposon(76, 78). Similarly, SINEs do not appear to encode any proteins, therefore requiringtrans-encoded factors for every enzymatic or packaging step in their transpositionprocess (133). It is not clear how broad the specificity is for an enzyme encodedby one retrotransposon to act on another retrotransposon of a different family.The similar structure of the target repeats flanking LINEs, SINEs, and intronlesspseudogenes suggests that they use the same mechanism for integration into thegenome, presumably the endonuclease encoded by LINEs (39), although they mayalso be inserted as repair templates at sites of double-stranded DNA breaks (123).The observation that LINEs, SINEs, and intronless pseudogenes all seem to becommon in some species (e.g. humans), while all are relatively rare in other species(e.g. maize) concurs with a LINE-endonuclease model. LTR retrotransposons mayshow some preference for use of their own integrase, but it is unlikely that thisbias is absolute. However, the knowledge that retrotransposons do not need to befully intact in order to transpose does not indicate how or why they are so variablein sequence and so often defective.
A second class of mutation seen for several LTR retrotransposons has beenan apparent deletion of some standard retrotransposon genes, often accompaniedby the presence of additional internal sequences (18, 72, 76, 77, 111, 182, 191).In many cases, these defective retrotransposons have been identified in multiplecopies or as de novo insertions, indicating that they can retrotranspose by the useof trans-acting factors. One class of internal mutation is observed in the case ofsolo LTRs, where an apparent unequal exchange has led to the loss of one LTRand all internal sequences (25, 158, 165, 191). It is not known whether other lossesof internal sequences might be due to ectopic recombination or through a standardDNA breakage/repair mechanism. Sequence acquisitions may occur by ectopicgene conversions like those proposed for theMutator elements of maize (10), orby recombination between an mRNA-derived template and the retrotransposonwithin the replication/integration complex (16).
Regardless of their origin(s), internal variation within retrotransposons canprovide the raw material for either new retrotransposon properties or new capabil-ities acquired by the host genome. Such changes could explain sudden bursts inretrotransposon activity, perhaps by random mutation, to create a retrotransposonpromoter with a new tissue specificity (e.g. in “germline” cells) and/or a higherexpression level. The relative inaccuracies of reverse transcriptases provides anindication that internal variation is programmed into the system, and many host“defense” activities (e.g. cytosine methylation) also will increase variability. Thesealtered elements should be selected for their ability to amplify and persist withina genome, while natural selection acting on the host will utilize this variabilitywhenever possible.
Retrotransposons and Retroviruses
Although LTR retrotransposons are strikingly similar to retroviruses in both ge-nomic organization and their coding sequences, they lack sequences encoding the
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?508 KUMAR ■ BENNETZEN
envelope (ENV) protein (Figure 1). Thus, the retrotransposon replication cycle isrestricted to the cell of the host genome. Theenvgene is required by retrovirusesfor cell-to-cell transfer, thereby allowing infectivity. Until recently, retroviruseswere thought to be restricted to vertebrates. The first conclusive evidence for thepresence of a retrovirus in an invertebrate,Drosophila melanogaster, was reportedin 1994. A LTR retrotransposon of the Ty3-gypsygroup (mdg-4) was shown topossess a functionalenvgene, and has now been reclassified as a retrovirus orendogenous retrovirus (170). As pointed out earlier, it is believed that retroviruseshave evolved from the Ty3-gypsygroup retrotransposons by acquiring theenvgene or vice versa (94, 177, 204). In plants, Ty3-gypsyretrotransposons have beenfound with additional sequences in the region usually occupied byenv, but it is notknown whether they have anyenv-like gene function (24, 72, 76, 206). Recently,a Ty1-copiagroup retrotransposon has been found in soybean that has novelenv-like sequences in the usualenvregion of the retrotransposon (100). This suggeststhat the acquisition ofenv-like and/orenv-localized sequences in Ty3-gypsyandTy1-copiagroups has occurred independently or occurred once in one element andthen was transferred to the others by recombination (94). Recombination betweenretrotransposons appears to be a more common occurrence than was consideredpreviously (79).
The presence of putativeenv-like sequences in plant LTR retrotransposonsis intriguing. In animals, theenvgene is involved in transferring virus particlesfrom one cell to another through the plasma membrane by a receptor-mediatedendocytosis. In plants, a scenario similar to that seen in animals is unlikely to occurbecause of the plant cell wall (94). Enveloped viruses might move from cell to cell inplants via the intercellular channels called plasmodesmata. Virus-encoded proteinsthat modify the molecular size exclusion limit of plasmodesmata (21) usually assistthe movement of plant viruses through plasmodesmata. Such transfer proteins arenot known to be encoded by retrotransposons. Perhaps some of theenv-localizedfunctions that appear to be encoded by some LTR retrotransposons in plants couldperform such a transfer function.
It is possible that the putativeenv-like genes in plants have evolved along dif-ferent lines, leading to ENV-like proteins with either a different function or nocurrent specific role in plants. Future work on this subject should reveal the pre-cise function, if any, of the putativeenv-like gene in plants. In animals, retrovirusesare responsible for some of most deadly diseases known, but such pathogenicityof retroviruses is not known in plants. It may be important to determine what it isabout the biology of plants that has restricted (or even eliminated) the pathogenic-ity of retroviruses in the absence of an identified humoral or cellular immunesystem.
Horizontal Transmission?
Phylogenetic analyses based on sequences of the reverse transcriptase and inte-grase genes from both Ty1-copiaand Ty3-gypsyretrotransposons in plants have
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 509
provided strong evidence, with some exceptions, for the vertical transmission ofretrotransposons (45, 46, 114, 187, 190). Simple inheritance thus appears to be themost frequent mode of transmission. In contrast, the horizontal transmission ofplant retrotransposons has not been demonstrated. This is not surprising, becausemost or all plant retrotransposons lack the envelope (env) gene that is required byretroviruses for cell-to-cell transmission.
Nonetheless,env-like gene sequences have been found in some plant retrotrans-posons. One possible explanation for this result would be that these sequences arelegacies ofenv genes that functioned in another species, perhaps in an insect(170). If an insect retrovirus, for instance, were transferred to a plant cell duringthe feeding process, it is possible that it would prove to be a competent intracel-lular retrotransposon. The success of some human-assisted horizontal transfers ofretrotransposons from one plant species to another indicates that the host factorsneeded for retrotransposition are often conserved in higher plants (69, 108, 139),so natural horizontal transfers should have an equal chance of success. A possibleorigin from insect retroviruses is suggested for many plant retrotransposons be-cause of their transcriptional activation by stress (55). This activation makes sensefor a virus, because damage-associated stress (e.g. insect feeding) suggests that avector is available for transfer to the next possible host.
Similarly, it is possible that retrotransposons could be passively hijacked, in rareevents, by true retroviruses or by some other invading plant pathogen, pest, or en-doparasite (7, 94). Given that eukaryotic cells, including those of plant protoplasts,have the capacity to take up foreign nucleic acids into their genomes, perhaps thechief limitation to horizontal transmissions will be the small number of cells thatcontribute to the next generation (i.e. the germline). However, horizontal transfersof retrotransposons that occur as rarely as once every few million years can betremendously important, particularly if the acquired element has the potential toamplify up to thousands of copies per nucleus (141, 156). Conclusive evidencefor a recent horizontal transfer ofcopia retrotransposon betweenDrosophilamelanogasterandD. willistoni has been demonstrated (80a). Although, the modeof this horizontal transfer is unknown since thecopiaelement lacks the envelopegene and therefore thought to be not infectious. The circumstantial evidence sug-gests that parasitic mites may be involved in vectoring DNA betweenDrosophilaspecies (80a).
One likely mode for the horizontal transfer of retrotransposons from one plantspecies into another would be by wide crosses. Such crosses can lead to at leasttransient loss of epigenetic control (i.e. suppression) of transposable elements(86, 132). Because many plant species use relatively nonspecific gametic inter-change processes (e.g. wind-assisted or insect-assisted pollination), usually unpro-ductive crosses between different species may be particularly common in plants.Rarely, such a wide cross may give rise to an allopolyploid (102), thereby bringingthe retrotransposon populations from two species into the same resultant species.More commonly in chance matings between two different species, the fertiliza-tion is not productive and/or embryogenesis aborts. In some exceptional cases,
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?510 KUMAR ■ BENNETZEN
however, the two genomes of very different species (e.g. maize and oats) may co-exist in the same nucleus for a few plant generations (2). Eventually, one parentalset of chromosomes may be eliminated, but the transfer of a retrotransposon fromone genome to another may occur in the interim.
CONTRIBUTIONS OF RETROTRANSPOSONS TO THEEVOLUTION OF PLANT GENES AND GENOMES
Gene Mutation
Retrotransposons are involved in generating mutations through insertions near orwithin genes. Both LTR and non-LTR retrotransposons have generated mutationsin plants: some examples for the Ty1-copia, Ty3-gypsy, LINE, and SINE retro-transposons are listed in Table 2. Many retrotransposon insertions within and inclose proximity to genes affect their expression in a negative fashion by decreasingor abolishing transcription of the gene or by detrimental alterations in transcriptprocessing and/or stability. In other cases, however, insertion of retrotransposonsequences within or near a gene has more complex effects on gene expression, in-cluding alterations of temporal and spatial patterns of transcription or the structureof the resultant protein (48, 97, 202).
The splicing of retrotransposon sequences can sometimes restore proper synthe-sis of a protein (187) or a truncated protein (60, 110). Insertion of a retrotransposoncan also result in gross changes in the prematuration of a transcript. Intron removalby splicing, using a cryptic splice site or sites within the element, will sometimesyield a protein product that differs from the wild type, usually having reducedfunctionality. For example, one of three retrotransposon-mutated maizewaxyalle-les,wxG, has an altered pattern of tissue-specific expression, with thirtyfold moreenzymatic activity in pollen than in the endosperm (110). It was demonstrated thatthe increase in Waxy enzymatic activity in pollen as compared to the endospermwas due to tissue-specific differences in RNA processing (i.e. there is approx-imately 30-fold more correctly spliced RNA in pollen than in the endosperm).Therefore, thewxGallele provides a good example of a retrotransposon insertionthat has resulted in tissue-specific alternative splicing.
Many plant genes currently considered wild type contain legacies of previ-ous retrotransposon insertion (Table 2) (202, 203). For example, over 30 genesfrom both dicotyledenous and monocotyledenous plants were found to have an-cient, degenerate retrotransposon insertions in 5′ or 3′ flanking regions, basedon computer-assisted database searches using Ty1-copia group retrotransposonsas query sequences (202, 203). This is probably a major underestimate, partlybecause elements that have greatly diverged will not be recognized. As databasesgrow and search programs become more sophisticated, most (perhaps all) plantgenes will likely be found to contain some transposable element legacy withintheir regulatory and/or structural components.
Three of four genes in the pearbcSfamily have the same retrotransposon se-quence in the promoter region, suggesting that the insertion predated the gene
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 511
duplication events. The combined expression level of three element-associatedrbcSgenes is 30–50% lower than that of the gene that does not contain the retro-transposon insertion, strongly suggesting that the 829-bp insertion sequence har-bors a negative regulator of transcription (203). Thus, it appears that retrotrans-poson insertions have been important contributors to the establishment of novelpatterns of transcription for a wide range of plant genes, over 100 reported sofar. The presence of remnants of retrotransposon sequences within and in closeproximity to genes suggests that these sequences have evolved and have been fixedunder selective pressures for regulatory roles (117).
Compared to the relatively intact retrotransposons found in recent insertion mu-tations or in the intergenic regions between maize genes (56, 65, 76, 158), manyof the retrotransposon sequences found within or close to wild-type genes appearto be truncated or rearranged (202, 203). Older retrotransposon-induced mutationsin maize often contain defective elements (151, 187, 193, 203). The degeneratestructure of these elements is probably due to a combination of their age and theprocess of natural selection. Once inserted near or within a gene, the retrotrans-poson sequences would be selected for a possible positive effect on that gene’sfunction.
Interestingly, the retrotransposons that are found to cause insertional mutationsin plants are usually present in the host genome at relatively low copy numbers.In maize, for example, theBs1retrotransposon is found in one to five copies (78),Hopscotchin two to six copies (203), andMagellanin four to eight copies (151).Similarly, the mutationalTnt1andTto1 families in tobacco andTos17family inrice are also present in relatively low copy numbers [about 1–100 (56, 64, 65, 70)].On the other hand, the abundant maize retrotransposons such asOpie, Grande,Ji, andHuck (each with copy numbers of 10,000 or more) have not been asso-ciated with any characterized maize mutations. Thus, it has been postulated thatthere might be a cause-and-effect relationship between retrotransposon familycopy number and the propensity to insert into genes (158). Very abundant fam-ilies of retrotransposons in maize andBARE-1(about 50,000 copies) in barleyappear to display a target site preference for intergenic regions and for previouslytransposed retrotransposons (158, 174). Alternatively, less fitness selection actingagainst retrotransposons in these regions may be responsible for such distributionpatterns of these high copy retrotransposons. The target site preference and activeselection hypotheses are not mutually exclusive.
Genome Size
Retrotransposons play a major role in determining the size of plant genomes (93).For example, small genomes in plants likeArabidopsis(1C equal to about 130Mbp) might be the consequence of a lack of retrotransposon proliferation (205).On the other hand, large genomes in plants like faba bean and maize (1C equal toabout 128,000 Mbp and 3200 Mbp, respectively) might be the result of success-ful colonization and amplification of retrotransposons (141, 158). Indeed, severaldifferent families of retrotransposons have amplified and attained copy numbers
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?512 KUMAR ■ BENNETZEN
TA
BL
E2
Exa
mpl
esof
the
cons
eque
nces
ofre
trot
rans
poso
nin
sert
ion
inor
near
plan
tgen
es
Gen
eR
etro
tran
spos
onP
lant
Com
men
tsR
efer
ence
Age
neex
pres
sed
Tna
1N
icot
iana
Inse
rtio
nw
ithin
age
neth
atis
155
inre
spon
seto
polli
natio
n(G
ypsy
)al
ata
expr
esse
din
the
styl
ein
resp
onse
topo
llina
tion
orto
uch;
chim
eric
tran
scri
pt;t
his
gene
isno
tess
entia
lfo
rfe
rtili
tyan
dis
loca
ted
abou
t10
kbup
stre
amof
the
S 6-R
Nas
ege
ne
Alc
ohol
dehy
drog
enas
eB
s1M
aize
Inse
rtio
nin
toex
on9
ofth
eA
dh1
gene
;76
,78
(Adh
1-S5
446)
(Cop
ia)
inac
tivat
ion
ofth
ege
ne;m
utan
tph
enot
ype:
ally
lalc
ohol
-res
ista
ntpo
llen
Ant
hocy
anin
Cin
4(L
INE
)M
aize
Inse
rtio
nw
ithin
codi
ngre
gion
;alte
red
162
pigm
entg
ene
stru
ctur
eof
the
A1
gene
;no
mut
ant
(A1)
phen
otyp
e
Caf
feic
acid
B5
Mai
zeIn
sert
ion
near
the
junc
tion
ofth
e3′
192
O-m
ethy
ltran
sfer
ase
(Cop
ia?)
codi
ngre
gion
ofth
eC
OM
Tge
ne(C
OM
T)
intr
on;a
ltere
dtr
ansc
ript
sbo
thin
size
and
amou
nt;e
xhib
itsre
ddis
hbr
own
mut
antp
igm
enta
tion
ofth
ele
afm
idri
b
Cat
alas
ep-
SIN
E-1
Ric
eIn
sert
ion
betw
een
two
exon
s,ex
on-2
and
74ex
on-3
,of
Cat
Age
ne;p
SIN
E-1
elem
ent
beha
ves
asin
tron
inC
atA
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 513
Dm
3C
opia
-lik
eL
ettu
ceIn
sert
ion
ofin
ters
pers
edco
pia-
like
118
Gen
ere
gion
retr
otra
nspo
son
sequ
ence
sin
the
Dm
3ge
nere
gion
that
issu
rrou
nded
bya
fam
ilyof
resi
stan
cege
neca
ndid
ates
(RG
C2)
;ret
rotr
ansp
oson
-med
iate
dre
arra
ngem
ents
Glu
telin
Wis
2-1A
Whe
atIn
sert
ion
with
inco
ding
regi
onof
the
124
(Glu
-1)
(Cop
ia)
Glu
1ge
ne;n
otr
ansc
ript
Her
oge
ne(G
ypsy
)To
mat
oIn
sert
ion
ofin
ters
pers
edgy
psy-
like
—re
gion
retr
otra
nspo
son
sequ
ence
sin
the
regi
onof
Her
oge
nefa
mily
Nitr
ate
redu
ctas
eT
nt1
Toba
cco
Inse
rtio
nsin
toex
on2
&3;
alte
red
56(N
R)
(Cop
ia)
tran
scri
pts
ofth
eN
Rge
ne;N
Rde
ficie
ntm
utan
ts
Nitr
ate
redu
ctas
eT
np2
Nic
otia
naIn
sert
ion
into
exon
;alte
red
tran
scri
pts
55(N
R)
(Cop
ia)
plum
bagi
nifo
lia
ofth
eN
Rge
ne;N
Rde
ficie
ntm
utan
ts
Nod
ule-
spec
ific
gene
Tm
s1M
edic
ago
Inse
rtio
nin
toin
tron
9w
hich
may
188
(Nm
s-25
)(C
opia
)sa
tiva
bere
spon
sibl
efo
rdo
wns
trea
mde
letio
ns;
retr
otra
nspo
son-
med
iate
dre
arra
ngem
ents
Ory
zasa
tiva
Tos1
7R
ice
Inse
rtio
nin
toex
on4;
notr
ansc
ript
;15
9ho
meo
box
(OSH
15)
(Cop
ia)
defe
cts
inin
tern
ode
elon
gatio
n
(con
tinu
ed )
PAP
2T
SB
ellp
eppe
rIn
sert
ion
into
the
first
intr
onof
the
PAP
214
9(S
INE
)ge
ne;s
plic
ing
defe
ctre
sulti
ngin
inco
mpl
etel
ysp
liced
tran
scri
pts;
nom
utan
tphe
noty
pe
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?514 KUMAR ■ BENNETZEN
TA
BL
E2
Gen
eR
etro
tran
spos
onP
lant
Com
men
tsR
efer
ence
Poly
gala
ctur
anas
eP
RE
M-2
Mai
zeIn
sert
ion
atth
eid
entic
al5′
flank
ing
184
(Pol
gal)
(Cop
ia)
posi
tion
ofth
ree
Polg
alge
nes
ofa
mul
tigen
efa
mily
mem
bers
;no
know
nef
fect
son
gene
expr
essi
on
Star
chph
osph
oryl
ase
Tst
1Po
tato
Inse
rtio
nof
the
elem
enti
nth
e20
(Cop
ia)
oppo
site
orie
ntat
ion
toth
atof
the
gene
with
inin
tron
5;un
impa
ired
tran
scri
pt;n
om
utan
tphe
noty
pe
Self
-inc
ompa
tibili
tyM
elm
oth
B.o
lera
cea
Inse
rtio
nw
ithin
tran
scri
bed
regi
ons;
138
(Slo
cus)
(Cop
ia)
notr
ansc
ript
;thi
sin
sert
ion
mut
atio
nan
alys
isre
veal
edth
atfu
nctio
nalS
LA
gene
isno
treq
uire
dfo
rth
eS1
resp
onse
inB
rass
ica
Soyb
ean
fung
alT
gmr
Soyb
ean
Inse
rtio
nin
the
flank
ing
regi
onof
the
12re
sist
ance
(Cop
ia)
Rps
1-k
gene
;no
know
nm
utan
t(R
ps1-
K)
phen
otyp
es
(con
tinu
ed )
Wax
y-M
(wx-
M)
Mag
ella
nM
aize
Inse
rtio
nw
ithin
wx
exon
3ju
st2
bp15
1lo
cus
(Gyp
sy)
dow
nstr
eam
ofth
ein
tron
2/ex
on3
splic
eju
nctio
n;w
axy
mut
ant
phen
otyp
e.
a
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 515
Wx
Ston
orSt
onor
Mai
zeIn
sert
ion
with
inw
xex
on11
0lo
cus
(Cop
ia)
ofth
ein
tron
5/ex
on6
splic
eju
nctio
n;nu
llfo
xw
xex
pres
sion
inpo
llen;
reta
ins
norm
alex
pres
sion
inth
een
dosp
erm
Xa2
1R
etro
fitR
ice
Inse
rtio
nco
ding
regi
onof
171
(are
cept
or-l
ike
(Cop
ia)
Xa2
1fa
mily
mem
ber
Dle
ads
toki
nase
)a
trun
cate
dpr
otei
n w
ith
Xa2
1ETr
unca
tor
Ric
eIn
sert
ion
with
inco
ding
regi
onof
171,
197
(Cop
ia)
Xa2
1fa
mily
mem
ber
Ele
ads
toa
trun
cate
dpr
otei
n w
ith n
o ac
tivity
;ge
nera
tion
ofa
rece
ptor
-lik
eO
RF
stru
ctur
ally
sim
ilar
toth
eto
mat
ofu
ngal
gene
Cf-
9
27kD
aze
inZ
eon-
1M
aize
Inse
rtio
nin
tosi
te1.
1kb
upst
ream
of72
(mai
zest
orag
e(G
ypsy
)th
eA
TG
stan
dard
27kD
aze
inge
ne;
prot
ein)
mito
ticre
arra
ngem
ent;
redu
ced
leve
lsof
stor
age
prot
ein
inm
atur
ese
ed
dow
nstr
eam
Din
to th
e
res
ista
nce
activ
ity; t
hepr
edic
ted
enc
odes
a n
ew c
lass
of
LR
R r
esis
tanc
e ge
ne p
rodu
cts,
lack
tran
smem
bran
e an
d a
kina
sedo
mai
n.
pred
icte
d
and
a ki
nase
sim
ilar
to th
e to
mat
o ge
ne P
TO
.
prot
ein
dise
ase
ing
a
part
ial
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?516 KUMAR ■ BENNETZEN
greater than 10,000 in the maize genome. The total copy number of all types ofmaize retrotransposons is estimated to be approximately 300,000, thereby compris-ing about 50–80% of the maize nuclear genome (156). An estimated divergencetime between maize and sorghum is about 15–20 million years ago, and bothhave 10 chromosome pairs, but the maize nuclear genome is 3–4 times longer thanthe sorghum genome (3). Molecular hybridization and sequence analysis data haverevealed that these two grass genomes are very similar in the content and order oftheir genes (6, 73, 122). However, the nearly two dozen retrotransposons that makeup over 70% of the region surrounding the maizeadh1gene are all absent fromthe orthologousadh region in sorghum (4, 156) and most are not detected in thesorghum genome by gel blot hybridization (9). Hence, these results indicate thatthe increase in maize genome size was largely due to an extensive proliferationof retrotransposons. This result also suggests that these elements inserted afterthe divergence of maize and sorghum, a suggestion that was confirmed by datingexperiments that indicated these retrotransposons had all arrived and/or amplifiedin the maize genome within the last 2 to 6 million years (157).
It is possible that analogous patterns of retrotransposon distribution and orga-nization will be found in the genomes of other plant and animal species that havehigh retrotransposon copy numbers. Genomic sequencing in large genome eukary-otes continues to be limited primarily to gene-rich regions, so we do not know thearrangement of DNAs in the largest genomic component of any large genome.However, preliminary sequence analysis of the DNA adjacent toArabidopsiscen-tromeres has uncovered regions rich in LTR retrotransposons, arranged in a nestedorganization like that observed in maize (R Martienssen, R McCombie & R Wilson,personal communication, http://www. cshl.org/protarab).
Although retrotransposons have played a major role in the expansion of plantgenomes (8, 93), it is also likely that many retrotransposon sequences have beeneliminated by various mechanisms, such as unequal recombination between theLTRs of an LTR retrotransposon (to generate solo LTRs) (25, 158, 165, 191) orother types of deletion (145). Nonetheless, the dispersed organization of retro-transposons makes it unlikely that known genomic elimination processes suchas large deletions can effectively remove a significant amount of the genomicsequences within a relatively short evolutionary time period without having dele-terious affects on the survival of the host by concurrent elimination of the genesthat are interspersed with retrotransposons (8, 95).
Genome Rearrangement
Retrotransposons can rearrange genomes with or without transpositional compe-tence. At least one retrotransposon,Bs1, has acquired a portion of another gene, andthen amplified it by transposition throughout the maize genome (18, 77, 78). Thisis highly significant, as it shows how genes or parts of genes might be amplifiedand dispersed in genomes. Similarly, the rare action of retrotransposon reversetranscription and integration functions on normal cellular mRNAs has created
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 517
intronless pseudogenes in plants (37, 106). These amplified gene sequences canserve as the raw material for the evolution of new genes.
Because of their high copy numbers and dispersal throughout plant genomes,retrotransposons could serve as sites of unequal or ectopic recombination. Thelegacies of such unequal recombinations between the LTRs of a single retro-transposon have been noted many times in plants by the existence of solo LTRs(25, 158, 165, 191). Even in methylated, presumably heterochromatic regionswhere these elements often accumulate, rare solo LTR are observed (158). Ectopicrecombination between retrotransposons of the same family that are in direct ori-entation on the same chromosome can cause reciprocal duplications and deletions,whereas unequal recombinations between elements on the same chromosome thatare in opposite orientation will cause a chromosomal inversion of the sequencesbetween the two retrotransposons. Ectopic recombination between two elementson different chromosomes can cause reciprocal translocations. These types ofretrotransposon-mediated genome rearrangement have been observed in yeast andDrosophilaas products of unequal recombination (104, 204). In plants, however,the structural and mechanistic foundations of such chromosomal rearrangementshave not yet been studied in great numbers, so we do not yet know how frequentlythey can be attributed to retrotransposon involvement.
Roles for Retrotransposons?
In most eukaryotic organisms, centromeres, telomeres, and other heterochromaticregions of chromosomes contain large numbers of retrotransposons, making itpossible that retrotransposons have acquired some structural and/or functionalroles in these heterochromatic regions in plants. Indeed, a role ofHeT-A andTART retrotransposons in buffering the shortening of chromosome ends has beenclearly demonstrated inDrosophila(137).
The exceptional conservation of a particular family of retrotransposons in allgrass centromeric regions suggests that these elements are involved in centromerefunction (121, 150). Similarly, the presence of retrotransposon fragments withinthe regulatory regions of genes indicates that they have evolved to a form that isinvolved in specific gene regulation. Because LTR retrotransposons, for instance,carry a promoter within each LTR, it is not surprising that their insertions shouldsometimes cause new patterns of gene expression or at least have the potential toevolve into new patterns of gene expression (48, 202). It was partly this capacity ofmobile DNAs to serve as controlling elements that caused McClintock to believethat their major role in evolution was to rearrange genomes under stresses so severethat only a rare individual with a totally changed genome might survive (116).Although this model does not appear to make evolutionary sense, it is certain thatthe presence of transposable elements within a genome will allow them to be usedby their hosts in whatever way might be beneficial, even if the elements themselvesare present primarily for selfish or parasitic reasons (34, 134). Future studies willindicate how plant genomes have evolved to utilize retrotransposon sequences.
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?518 KUMAR ■ BENNETZEN
RETROTRANSPOSONS AS GENETIC TOOLS
Molecular Markers for Phylogenetic, Biodiversity,and Genetic Linkage Analyses
There are several advantages in using retrotransposon sequences as molecularmarkers. They are ubiquitous, present in high copy numbers as highly heteroge-neous populations, are widely dispersed on chromosomes, and show insertionalpolymorphism both within and between species in plants (95). Furthermore, activeretrotransposons will produce new insertions in the genome, leading to polymor-phism. The new insertions may then be detected and used to temporally orderinsertion events in a lineage, helping to establish phylogenies. These genetic prop-erties have recently allowed retrotransposons to be exploited as DNA markersto study biodiversity in maize, pea, and barley (41, 81, 152) and to generate ge-netic linkage maps in barley and pea (41, 95, 198, 199, 202). Furthermore, becausemany retrotransposons are widely distributed within the euchromatin domains ofchromosomes, it should be possible to generate markers linked to agronomicallyimportant genes. Indeed, retrotransposonTnd-1 has provided a marker that islinked to black root resistance in tobacco (85).
One of the most useful retrotransposon-based marker systems in revealing largenumbers of highly polymorphic markers is sequence-specific amplification poly-morphism (SSAP). SSAP is a multiplex amplified fragment length polymorphisms(AFLP)-like technique that displays individual retrotransposon insertion as bandson a sequencing gel. Moreover, SSAP-based markers appear to be better for esti-mating phylogenetic relationship in plants compared with the conventional AFLP-based makers, because one of the primers is based on specific retrotransposonsequences (41). Furthermore, a multi-retrotransposon approach has been recentlyused to estimate phylogenetic relationships within species and between speciesin legume (143) and cereal (57) plants. This approach is particularly informativebecause each element has a unique transpositional history. For example, retroele-ments that have transposed in the recent past should be extremely polymorphicwithin a species, and these can be used for genetic linkage and intraspecific geneticdiversity studies. Some that were active only a million years ago or more shouldbe useful for determining relationships between species or even between genera.
In the past, a major hurdle in using the retrotransposon-based SSAP method wasthe limited numbers of LTR sequences available. The recent development of a newtechnique for rapid isolation of plant Ty1-copia retrotransposon LTR sequencesfor molecular studies (143) has made the application of a retrotransposon-basedSSAP method highly accessible to all plant species. The SSAP method should alsobe applicable for determining the integration patterns of specific elements in thehost genome by subcloning, sequencing, and analyzing SSAP bands.
A PCR-based approach to detect individual retrotransposon insertions has beendeveloped using primers derived from the element and its flanking DNA (47, 51).This method produces a dominant marker system, where the different allelic states
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 519
(i.e. presence and absence of the retrotransposon insertion) at a locus can be re-vealed. A major advantage of this retrotransposon-based insertion polymorphism(RBIP) method is that it could be fully automated for high throughput markeranalysis (47). Sequence analysis of clones from an enriched barley microsatellitelibrary has revealed that retrotransposons sequences are intimately associated withmicrosatellites (153). This property has been exploited to develop a molecularmarker method, REMAP (Retrotransposon-Microsatellite Amplified Polymorp-phism), that can be used to distinguish between barley varieties and to producefingerprint patterns for species across a genus (81).
Non-LTR retrotransposons also have been used as molecular markers to studygenetic diversity at the species and genus levels (63, 126). For example, ricep-SINE1elements have been used to demonstrate thatpSINE-1-r2transposed tothewx locus after the divergence ofOryza rufipogonandO. sativafrom theOryzaancestor that carried the AA genome. In contrast,pSINE-1-r1transposed at anearlier time to another position in thewx locus before the divergence of variousrice species (63). Also,BrassicaSINE elements have been used to study geneticdiversity inCruciferaespecies and to determine the extent to which introgressionhas occurred between cultivated and wild species inBrassica(103; JM Deragon,unpublished data).
Gene Tagging and Functional Analysis of Genes
Some retrotransposons have features that make them ideal genetic tools for genetagging in plants (65). These include (a) retrotransposon-mediated insertion muta-tions that are stable because they transpose by a replicative mode, (b) transpositiontarget sites that are unlinked to the site of the original copy, making it relativelyeasy to generate a large collection of random insertions for saturation mutagenesis,(c) transposition that can be regulated by abiotic and biotic stress conditions,(d) high mutagenicity due to preferential transposition into genes, (e) low copynumbers, thereby facilitating the identification of the retrotransposon insertion re-sponsible for a specific mutation, and (f ) endogenous retrotransposons that areubiquitous in plants and may be active in many species. In rice, the endogenousTos17retrotransposon meets all the conditions mentioned above for its use in tag-ging genes.Tos17is highly active under tissue culture conditions and preferentiallyinserts into or near genes (65). It has been used to generate a large collection ofrandom insertions for saturation mutagenesis and has been used to clone a numberof genes in rice (65; H Hirochika, unpublished data). Additionally,Tos17has beenused in a reverse genetic approach to identify null mutant lines for theOSH15gene(Oryza sativahomeobox; aknotted-type homeobox gene), thereby demonstratingthatOSH15has a role in rice internode development (159).
Evaluation of Somaclonal Variations
Several techniques for genetic manipulation and vegetative propagation of plantsare dependent upon in vitro plant regeneration systems. However, in vitro
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?520 KUMAR ■ BENNETZEN
manipulation processes are prone to somaclonal variations, which may be highlyundesirable (70). Somaclonal variation is the collective term given to the highfrequency of mutations encountered among plants regenerated from cell and tis-sue cultures under in vitro conditions. There is now good evidence to show thata significant percentage of mutations among in vitro manipulated plants are dueto the induced activation of retrotransposons during cell and tissue culture condi-tions. For example, over 10% of regenerated rice plants possessed retrotransposon-induced mutations (70; H Hirochika, unpublished data). The retrotransposon-based molecular markers described earlier provide an excellent tool for evaluatingretrotransposon-induced mutations among in vitro manipulated plants. Addition-ally, a better understanding of the regulation of expression and transposition ofretrotransposons should give some clues to avoiding somaclonal variation amongin vitro manipulated plants.
CONCLUSIONS AND FUTURE PROSPECTS
We now know that plant retrotransposons are responsible not only for generatingmutations in plant genomes but also for significantly increasing the size of manyplant genomes. Their replicative modes of transposition have equipped them withthe ability to rapidly proliferate within their hosts, allowing them to become amajor component of higher plant genomes. Retrotransposons are also one of themost fluid of genomic components, varying greatly in copy number, genomiclocalization, and sequence structure over relatively short evolutionary times. Theseproperties make retrotransposons one of the major forces influencing the structureof plant genomes. Although their initial and major continuing motivation appearsto be selfish and/or parasitic, retrotransposons will continue to be used wheneverpossible by their hosts.
Mutations caused by retrotransposon insertions within or in close proximity togenes can result in gene inactivation or alterations of the expression patterns ofthe genes, or the structure of the encoded proteins. Many gene promoters con-tain fragments of retrotransposons that now contribute to that gene’s regulation.Ectopic recombination between these mobile sites of homology could amplify ordecrease genome size and gene family number. However, it remains to be provenwhether any retrotransposon-induced changes have been positively selected innatural populations.
For the future, many questions remain to be answered. These can be conve-niently separated into four categories: origins, specificities, effects, and uses.There is no definitive information on the origin of any plant retrotransposon fam-ily. Are some LTR retrotransposons actually defective insect retroviruses that havearrived in plant genomes via horizontal transfer? Or, do horizontal transfers ofintracellular elements occur through wide crosses or by rare incorporation of se-quences from a pathogen or parasite? Perhaps horizontal transfers do not happen atall, or at least do not account for many of the elements. Their differential presenceand/or copy numbers may be determined by whether they have amplified or have
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 521
been lost in one species or another. At a more detailed level, how do retroelementswithin a family change; how fast and by what mechanisms? These will be diffi-cult questions to answer, partly because their great diversity, high copy numbers,unpredictable amplification potential, and rapid evolutionary change can easilyobscure simple patterns of element relatedness or origin. However, these prob-lems will be partly offset by the huge amounts of genome sequence data that willbecome available in upcoming years.
Numerous studies have documented specificities for retrotransposon expres-sion and insertion. Several groups have begun characterizing the regulation ofLTR promoters by transacting factors associated with host developmental andstress response pathways. Studies of epigenetic gene regulation, including thosepertinent to mobile DNAs and transgenes, also will expand our understanding ofretrotransposon biology. We have not yet identified any factors in plants that mightdetermine the different insertion specificities of different retrotransposon families.We also do not know the degree to which a retrotransposon can use transacting fac-tors from other retroelements for replication and insertion. What is the relationshipbetween an element’s transcription and its potential for transposition?
The effects of retrotransposons on gene and genome structure, including genomesize, have been well documented. However, how often do retrotransposons am-plify to high copy numbers? What are the processes that remove retrotransposonsfrom a genome, and how rapidly do they act? How often do retrotransposonsinsert into or near genes in standard plant populations, and how often/rapidly dothese sequences evolve into a form that is utilized by the affected gene? Howoften have retrotransposons been used for non-genic tasks, such as contributinga functional component to centromeres? If retrotransposons are used for variousroles in genome structure or function, how did the elements change to permit thisuse, and how does their use differ between species?
Finally, we are just beginning to see some of the ways that plant retrotransposonscan be used for the study and improvement of plants. Their outstanding potentialas genetic tools for plant genome analysis, linkage mapping, phylogeny and ge-netic biodiversity studies, gene tagging, functional analysis of genes, evaluationof somaclonal variants, and in gene transfer processes appear to be most promis-ing. Understanding the possible horizontal transfer of retrotransposons could alsobe very valuable, perhaps to set an upper limit for the rate at which transgenesmight escape from genetically engineered crops. Despite comprising the majorityof many plant genomes and their many roles in the mutation and evolution of genesand genomes, only a handful of laboratories investigate these elements. Much fu-ture work is needed, and the tools are all in hand, so we can expect continuing greatgains in our understanding of plant retrotransposons for the foreseeable future.
ACKNOWLEDGMENTS
We thank all of our colleagues for providing their published and unpublishedpapers, especially Michael Ashburner, Andreas Bachmir, Eugene Berezikov, BenBowen, Jean-Marc Deragon, Tom Eickbush, Noel Ellis, David Finnegan, Andy
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?522 KUMAR ■ BENNETZEN
Flavell, Marie-Angele Grandbastien, Pat Heslop-Harrison, Hirohiko Hirochika,Howard Laten, Rob Martienssen, John McDonald, Norihiro Okada, Steve Pearce,Ingo Schubert, Alan Schulman, Daniel Voytas, and Susan Wessler. We are gratefulto Sue and Hiro for their comments on the manuscript, and to Pat and Ingo forthe LINE, SINE, andgypsypictures, respectively. AK acknowledges the ScottishOffice Agricultural, Environmental and Food Department and EC BiotechnologyFramework IV for financial support. JLB acknowledges financial support from theUSDA CSREES program (97-35300-4594).
Visit the Annual Reviews home page at http://www.AnnualReviews.org
LITERATURE CITED
1. Ananiev EV, Phillips RL, Rines HW. 1998.Complex structure of knob DNA on maizechromosome 9. Retrotransposon invasioninto heterochromatin.Genetics149:2025–37
2. Ananiev EV, Riera-Lizarazu O, Rines HW,Phillips RL. 1997. Oat-maize chromosomeaddition lines: a new system for mapping themaize genome.Proc. Natl. Acad. Sci. USA94:3524–28
3. Arumuganathan K, Earle ED. 1991. Nu-clear DNA content of some importantplant species.Plant Mol. Biol. Rep.9:208–19
4. Avramova Z, Tikhonov A, SanMiguel P, JinYK, Lui C, et al. 1996. Gene identifica-tion in a complex chromosomal continuumby local genomic cross-referencing.Plant J.10:1163–68
5. Bennetzen JL. 1996. The contributions ofretroelements to plant genome organisation,function and evolution.Trends Microbiol.4:347–53
6. Bennetzen JL. 1998. The structure and evo-lution of angiosperm nuclear genomes.Curr.Opin. Plant Biol.1:103–8
7. Bennetzen JL. 1999. Transposable elementcontributions to plant gene and genome evo-lution. Plant Mol. Biol.In press
8. Bennetzen JL, Kellogg EA. 1997. Do plantshave a one-way ticket to genomic obesity?Plant Cell9:1509–14
9. Bennetzen JL, Schrick K, Springer PS,
Brown WE, SanMiguel P. 1994. Activemaize genes are unmodified and flanked bydiverse classes of modified, highly repetitiveDNA. Genome37:565–76
10. Bennetzen JL, Springer PS, Cresse AD,Hendrickx M. 1993. Specificity and regu-lation of theMutator transposable elementssystem in maize.Crit. Rev. Plant Sci.12:57–95
11. Bevan M, Bancroft I, Bent E, Love K, Good-man H, et al. 1998. Analysis of 1.9 Mb ofcontiguous sequence from chromosome 4of Arabidopsis thaliana. Nature 391:485–88
12. Bhattacharyya MK, Gonzales RA, Kraft M,Buzzell RI. 1997. Acopia-like retrotranspo-sonTgmrclosely linked to theRps1-k allelethat confers race-specific resistance of soy-bean toPhytophthora sojae.Plant Mol. Biol.34:255–64
13. Bingham PM, Zachar Z. 1989. Retrotrans-posons and FB elements fromDrosophilamelanogaster. In Mobile DNA, ed. by DEBerg, MM Howe, pp. 485–502. Washing-ton, DC: Am. Soc. Microbiol.
14. Blanchard JL, Schmidt GW. 1995. Pervasivemigration of organellar DNA to the nucleusin plants.J. Mol. Biol.41:397–406
15. Boeke JD. 1997. LINEs and Alus—thepolyA connection.Nat. Genet.16:6–7
16. Boeke JD, Corces VG. 1989. Transcrip-tion and reverse-transcription of retrotrans-posons.Annu. Rev. Microbiol.43:403–34
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 523
17. Brandes A, Heslop-Harrison JS, KammA, Kubis S, Doudrick RL, Schmidt T.1997. Comparative analysis of the chro-mosomal and genomic organization ofTy1-copia-like retrotransposons in pteri-dophytes, gymnosperms and angiosperms.Plant Mol. Biol.33:11–21
18. Bureau TE, White SE, Wessler SR. 1994.Transduction of a cellular genes by a plantretroelement.Cell 77:479–80
19. Bureau TE, Ronald PC, Wessler SR. 1996.A computer-based systematic survey re-veals the predominance of small inverted-repeat elements in wild-type rice genes.Proc. Natl. Acad. Sci. USA93:8524–29
20. Camirand A, St-Pierre B, Morineau C,Brisson N. 1990. Occurrence of acopia-like transposable element in one of the in-trons of the potato starch phosphorylasegene.Mol. Gen. Genet.224:33–39
21. Carrington JC, Kasschau K, Mahajan SK,Schaad MC. 1996. Cell-to-cell and long-distance transport of viruses in plants.Plant Cell8:1669–81
22. Casacuberta JM, Grandbastien MA. 1993.Characterisations of LTR sequences in-volved in the protoplast specific expressionof the tobaccoTnt1 retrotransposon.Nu-cleic Acid Res.21:2087–93
23. Casacuberta JM, Vernhettes S, Audeon C,Grandbastien MA. 1997. Quasispecies inretrotransporons: Sequence variability inTnt1evolution.Genetica100:109–17
24. Chavanne F, Zhang DX, Liaud MF, CerffR. 1998. Structure and evolution ofCy-clops: a novel giant retrotransposon of theTy3/Gypsyfamily highly amplified in peaand other legume species.Plant Mol. Biol.37:363–75
25. Chen M, SanMiguel P, Bennetzen JL.1998. Sequence organization and conser-vation in sh2/a1-homologous regions ofsorghum and rice.Genetics148:435–43
26. Cost GJ, Boeke JD. 1998. Targeting of hu-man retrotransposon integration is directedby the specificity of the L1 endonuclease
for regions of unusual DNA structure.Bio-chemistry37:18081–93
27. Craig NL. 1998. Target site selection intransposition.Annu. Rev. Biochem.66:437–74
28. Csink AK, McDonald JF. 1995. Analysis ofCopia sequences variation within and be-tweenDrosophilaspecies.Mol. Biol. Evol.12:83–93
29. Curcio MJ, Garfinkel DJ. 1999. New linesof host defense: inhibition of Ty1 retro-transposon by Fus3p and NER/TFIIH.Trend Genet.15:43–45
30. Danilevskaya ON, Arkhipova JR, TraverseKL, Pardue ML. 1997. Promoting in tan-dem: the promoter for telomere transposonHeT-Aand implications for the evolution ofretroviral LTRs.Cell 88:647–55
31. Deragon JM, Gilbert N, Rouquet L, LenoirA, Arnaud P, Picard G. 1996. A transcrip-tional analysis of the S1Bn (Brassica na-pus) family of SINE retroposons.PlantMol. Biol. 32:869–78
32. Deragon JM, Landry BS, Pelissier T, Tu-tois S, Tourmente S, Picard G. 1994. Ananalysis of retroposition in plants based ona family of SINEs fromBrassica napus. J.Mol. Evol.39:378–86
33. Ding D, Lipshitz H. 1994. Spatially regu-lated expression of retroviruses-like trans-posons duringDrosophila melanogasterembryogenesis.Genet. Res.64:167–81
34. Doolittle WF, Sapienza C. 1980. Self-ish genes, the phenotype paradigm andgenome evolution.Nature284:601–3
35. Dooner HK, Belachew A. 1989. Transpo-sition pattern of the maize elementAcfromthebz-m2(Ac) allele.Genetics122:447–57
36. Dorer D, Henikoff S. 1994. Expansionsof transgene repeats cause heterochro-matin formation and gene silencing inDrosophila. Cell 77:993–1002
37. Drouin G, Dover GA. 1987. A plant pro-cessed pseudogene.Nature328:557–58
38. Edwards KJ, Veuskens J, Rawles H, DalyA, Bennetzen JL. 1996. Characterization offour dispersed repetitive DNA sequences in
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?524 KUMAR ■ BENNETZEN
Zea maysand their use in constructing con-tiguous DNA fragments using YAC clones.Genome39:811–17
39. Eickbush TH. 1992. Transposing withoutend: the non-LTR retrotransposable ele-ments.New Biol.4:430–40
40. Eickbush TH. 1997. Telomerase and retro-transposons: Which came first?Science277:911–12
41. Ellis THN, Poyser SJ, Knox MR, VershininAV, Ambrose MJ. 1998. Polymorphism ofinsertion sites of Ty1-copia class retro-transposons and its use for linkage and di-versity analysis in pea.Mol. Gen. Genet.260:9–19
42. Feuchter A, Mager D. 1990. Functionalheterogeneity of a large family of humanLTR-like promoters and enhancers.Nu-cleic Acids Res.18:1261–70
43. Feuerbach F, Drouaud J, Lucas H. 1997.Retrovirus-like end processing of the to-baccoTnt1retrotransposon linear interme-diates of replication.J. Virol. 71:4005–15
44. Flavell RB, Bennet MD, Smith JB, SmithDB. 1974. Genome size and proportionof repeated nucleotide sequence DNA inplants.Biochim. Genet.12:257–69
45. Flavell AJ, Smith DB, Kumar A. 1992.Extreme heterogeneity of Ty1-copiagroupretrotransposons in plants.Mol. Gen.Genet.231:233–42
46. Flavell AJ, Dunbar E, Anderson R, PearceSR, Hartley R, Kumar A. 1992. Ty1-copiagroup retrotransposons are ubiquitous andheterogeneous in higher plants.NucleicAcids Res.20:3639–44
47. Flavell AJ, Knox MR, Pearce SR, El-lis THN. 1998. Retrotransposon-based in-sertion polymorphisms (RBIP) for highthroughput marker analysis.Plant J. 16:643–50
48. Flavell AJ, Pearce SR, Kumar A. 1994.Plant transposable elements and thegenome.Curr. Opin. Genet. Dev.4:838–44
49. Gabriel A, Willems M, Mules EH, BoekeJD. 1996. Replication infidelity duringa single cycle ofTy1 retrotransposition.
Proc. Natl. Acad. Sci. USA93:7767–7150. Gai X, Voytas DF. 1998. A single amino
acid change in the yeast retrotransposonTy5 abolishes targeting to silent chromatin.Mol. Cell 1:1051–55
51. Garber K, Bilic I, Pusch O, Tohme J,Bachmair A, et al. 1999. TheTpv2familyof retrotransposons ofPhaseolus vulgaris:structure, integration characteristics, anduse for genotype classification.Plant Mol.Biol. 39:797–807
52. Gesteland RF, Atkin JF. 1993.The RNAWorld. New York: Cold Spring HarborPress
53. Goubely C, Arnaud P, Tatout C, Heslop-Harrison JS, Deragon J-M. 1999. S1 SINEretroposons are methylated at symmetricaland non-symmetrical positions inBrassicanapus: identification of preferred targetsite for asymmetrical methylation.J. Mol.Biol. 39:243–55
54. Grandbastien M-A. 1992. Retroelementsin higher-plants.Trends Genet.8:103–8
55. Grandbastien M-A. 1998. Activation ofplant retrotransposons under stress condi-tions.Trends Plant Sci.3:181–87
56. Grandbastien M-A, Spielmann A, CabocheM. 1989. Tnt1, a mobile retroviral-liketransposable element of tobacco isolatedby plant cell genetics.Nature337:376–80
57. Gribbon BM, Pearce SR, Kalendar R,Schulman A, Paulin L, et al. 1999. Phy-logeny and transpositional activity of Ty1-copia group retrotransposons in cerealgenomes.Mol. Gen. Genet.261:883–91
58. Grunstein M. 1998. Yeast heterochro-matin: regulation of its assembly and in-heritance by histones.Cell 93:325–28
59. Hake S, Walbot V. 1980. The genome ofZea mays, its organization and homologyto related grasses.Chromosoma79:251–70
60. Henikoff S, Comai L. 1998. Trans-sensingeffects.Cell 93:329–32
61. Heslop-Harrison JS, Brandes A, Taketa S,Schmidt T, Vershinin AV, et al. 1997. TheChromosomaldistributions of Ty1-copiagroup retrotransposable elements in higher
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 525
plants and their implications for genomeevolution.Genetica100:197–204
62. Higashiyama T, Noutoshi Y, Fujie M, Ya-mada T. 1995. Zepp, a LINE-like retro-transposon accumulated in theChlorellatelomeric region.EMBO J.16:3715–23
63. Hirano HY, Mochizuki K, Umeda M, Oht-subo H, Ohtsubo E, Sano Y. 1994. Retro-transposition of a plant SINE into thewxlocus during evolution of rice.J. Mol. Evol.38:132–37
64. Hirochika H. 1993. Activation of to-bacco retrotransposons during tissue-cul-ture.EMBO J.12:2521–28
65. Hirochika H. 1997. Retrotransposons ofrice: their regulation and use for genomeanalysis.Plant Mol. Biol.35:231–40
66. Hirochika H, Fukuchi A, Kikuchi F. 1992.Retrotransposon families in rice.Mol. Gen.Genet.233:209–16
67. Hirochika H, Kakutani T, Okamoto H.1999. Silencing of the tobacco retrotrans-posonTto1in Arabidopsisand reactivationby DNA hypomethylation mutation.PlantCell. In press
68. Hirochika H, Otsuki H. 1995. Extra chro-mosomal circular forms of the tobaccoretrotransposonTto1. Gene165:229–32
69. Hirochika H, Otsuki H, Yoshikawa M,Otsuki Y, Sugimoto K, Takeda S. 1996.Autonomous transposition of the tobaccoretrotransposonTto1 in rice. Plant Cell8:725–34
70. Hirochika H, Sugimoto K, Otsuki Y, Tsug-awa H, Kanda M. 1996. Retrotransposonsof rice involved in mutations induced bytissue culture.Proc. Natl. Acad. Sci. USA93:7783–88
71. Hollick JB, Dorweiler JE, Chandler VL.1997. Paramutation and related allelic in-teractions.Trends Genet.13:302–8
72. Hu W, Das OP, Messing J. 1995.Zeon-1,a member of a new maize retrotransposonfamily. Mol. Gen. Genet.248:471–80
73. Hulbert SH, Richter TE, Axtell JD, Bennet-zen JL. 1990. Genetic mapping and charac-terization of sorghum and related crops by
means of maize DNA probes.Proc. Natl.Acad. Sci. USA87:4251–55
74. Iwamoto M, Nagashima H, Nagamine T,Higo H, Higo K. 1999.p-SINE1–like intronof theCatAcatalase homologs and phylo-genetic relationships among AA-genomeOryza and related species.Theor. Appl.Genet.98:853–61
75. Jensen S, Gassama MP, Heidmann T. 1999.Taming of retrotransposable elements byhomology-dependent gene silencing.Nat.Genet.21:209–12
76. Jin Y-K, Bennetzen JL. 1989. Struc-ture and coding properties ofBs1, amaize retrovirus-like transposon.Proc.Natl. Acad. Sci. USA86:6235–39
77. Jin Y-K, Bennetzen JL. 1994. Integrationand non-random mutation of a plasmamembrane proton ATPase gene fragmentwithin theBs1retroelement of maize.PlantCell 6:1177–86
78. Johns MA, Mottinger J, Freeling M. 1985.A low copy number,copia-like transposonin maize.EMBO J.4:1093–102
79. Jordan IK, McDonald JF. 1998. Evidencefor the role of recombination in regulatoryevolution ofSaccharomyces cerevisiaeTyelements.J. Mol. Evol.47:14–20
80. Jordan IK, McDonald JF. 1999. Tempo andmode of Ty element evolution inSaccha-romyces cerevisiae. Genetics151:1341–51
80a. Jordan IK, Matyunina LV, McDonald JF.1999. Evidence for the recent horizontaltransfer of an LTR retrotransposon.Proc.Natl. Acad. Sci. USA66: In press
81. Kalendar R, Grob T, Regina M, SuoniemiA, Schulman A. 1999. IRAP and REMAP:Two new retrotransposon-based DNA fin-gerprinting techniques.Theor. Appl. Genet.98:704–11
82. Kamm A, Doudrick RL, Heslop-HarrisonJS, Schmidt T. 1996. The genomic andphysical organisation of Ty1-copia-like se-quences as a component of large genomesin Pinus elliottii var. elliottii and othergymnosperms.Proc. Natl. Acad. Sci. USA93:2708–13
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?526 KUMAR ■ BENNETZEN
83. Katsiotis A, Schmidt T, Heslop-HarrisonJS. 1996. Chromosomal and genomic or-ganisation of Ty1-copia-like retrotranspo-son sequences in the genusAvena. Genome39:410–17
84. Kempken F, Kuck U. 1998. Transposons infilamentous fungi-facts and perspectives.BioEssays20:652–59
85. Kenward KD, Bai D, Ban MR, BrandleJE. 1999. Isolation and characterisation ofTnd-1, a retrotransposon marker linked toblack root resistance in tobacco.Theor.Appl. Genet.98:387–95
86. Kidwell MG, Lisch D. 1997. Transposableelements as sources of variation in ani-mals and plants.Proc. Natl. Acad. Sci. USA94:7704–11
87. Kim JM, Vanguri S, Boeke JD, Gabriel A,Voytas DF. 1998. Transposable elementsand genome organisation: A compara-tive survey of retrotransposons revealedby the completeSaccharomyces cerevisiaegenome sequence.Genome Res.8:464–78
88. Kirchner J, Connolly CM, Sandmeyer SB1995. Requirement of RNA-polymerase-III transcription factors forin vitro posi-tion-specific integration of a retrovirus-likeelement.Science267:1488–91
89. Konieczny A, Voytas DF, Cummings MP,Ausubel FM. 1991. A superfamily ofAra-bidopsis thalianaretrotransposons.Genet-ics127:801–9
90. Kossack DS, Kinlaw CS. 1999. IFG,a gypsy-like retrotransposon inPinus(Pinaceae), has an extensive history inpines.Plant Mol. Biol.39:417–26
91. Kubis SE, Heslop-Harrison JS, Desel C,Schmidt T. 1998. The genomic organisa-tion of non-LTR retrotransposons (LINEs)from threeBetaspecies and five other an-giosperms.Plant Mol. Biol.36:821–31
92. Kuipers AGJ, Heslop-Harrison JS, Jacob-sen E. 1998. Characterisation and phys-ical localisation of Ty1-copia-like retro-transposons in fourAlstroemeriaspecies.Genome41:357–67
93. Kumar A. 1996. The adventures of the Ty1-copiagroup of retrotransposons in plants.Trends Genet.12:41–43
94. Kumar A. 1998. The evolution of plantretroviruses: moving to green pasture.Trends Plant Sci.3:371–74
95. Kumar A, Pearce SR, McLean K, Harri-son G, Heslop-Harrison JS, et al. 1997.The Ty1-copia group of retrotransposonsin plants: genomic organisation, evolution,and use as molecular markers.Genetica100:205–17
96. Kumekawa N, Ohtsubo H, Horiuchi T, Oht-subo E. 1999. Identification and charac-terisation of novel retrotransposons of thegypsytype in rice.Mol. Gen. Genet.260:593–602
97. Kunze R, Saedler H, Lonnig WE. 1997.Plant transposable elements.Adv. Bot. Res.27:331–470
98. Kurose K, Hata K, Hattori M, Sakaki Y.1995. RNA polymerase III dependence ofthe human L1 promoter and possible par-ticipation of the RNA polymerase II factorYY1 in the RNA polymerase III transcrip-tion system.Nucleic Acids Res.23:3704–9
99. Labrador M, Corces VG. 1997. Transpos-able element-host interactions: regulationof insertion and excision.Annu. Rev. Genet.31:381–404
100. Laten HM, Majumdar A, Gaucher EA.1998. SIRE-1, a copia/Ty1-like retroele-ment from soybean, encodes a retroviralenvelope-like protein.Proc. Natl. Acad.Sci. USA95:6897–902
101. Leeton PR, Smyth DR. 1993. An abun-dant LINE-like element amplified in thegenome ofLilium speciosum. Mol. Gen.Genet.237:97–104
102. Leitch IJ, Bennett MD. 1997. Polyploidy inin angiosperms.Trends Plant Sci.2:470–75
103. Lenoir A, Cournoyer B, Warwick S, PicardG, Deragon JM. 1994. Evolution of SINES1 retroposons inCruciferaeplant species.Mol. Biol. Evol.14:934–41
104. Lim JK, Simmons JM. 1994. Grosschromosome rearrangements mediated
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 527
by transposable elements inDrosophilamelanogaster. BioEssays16:269–73
105. Llaca V, Messing J. 1998. Amplicons ofmaize zein genes are conserved withingenic but expanded and constricted in in-tergenic regions.Plant J.15:211–20
106. Loguercio LL, Wilkins TA. 1998. Struc-tural analysis of a hmg-coA reductasepseudogene: insights into evolutionaryprocesses affecting thehmgrgene familyin allotetraploid cotton (Gossypium hir-sutumL). Curr. Genet.34:241–49
107. Luan DD, Korman MH, Jakubczak JL,Eickbush TH. 1993. Reverse transcrip-tion of R2Bm RNA is primed by a nickat the chromosomal target site: a mecha-nism for non-LTR retrotransposition.Cell72:595–605
108. Lucas H, Feuerbach F, Kunert K,Grandbastien M-A, Caboche M. 1995.RNA-mediated transposition of the to-bacco retrotransposonTnt1 in Arabidop-sis thaliana. EMBO J.14:2364–73
109. Maier RM, Neckermann K, Igloi GL,Kossel H. 1995. Complete sequence of themaize chloroplast genome, gene content,hotspots of divergence and fine tuning ofgenetic information by transcript editing.J. Mol. Evol.25:614–28
110. Marillonnet S, Wessler SR. 1997. Retro-transposon insertion into the maizewaxygene results in tissue-specific RNA pro-cessing.Plant Cell9:967–78
111. Marillonnet S, Wessler SR. 1998. Ex-treme structural heterogeneity among themembers of a maize retrotransposon fam-ily. Genetics150:1245–56
112. Martienssen RA. 1996. Epigenetic phe-nomena: paramutation and gene silenc-ing in plants.Curr. Biol. 6:810–13
113. Martinez-Izquierdo JA, Garcia-MartinezJ, Vicient CM. 1997. What makesGrande1 retrotransposon different?Ge-netica100:15–28
114. Matsuoka Y, Tsunewaki K. 1999. Evo-lutionary dynamics of Ty1-copia groupretrotransposons in grass shown by re-
verse transcriptase domain analysis.Mol.Biol. Evol.16:208–17
115. Matzke AJM, Matzke MA. 1998. Posi-tion effects and epigenetic silencing ofplant transgenes.Curr. Opin. Plant Biol.1:142–8
116. McClintock B. 1984. The significance ofresponses of the genome to challenge.Sci-ence226:792–801
117. McDonald JF, Matyunina LV, Wilson S,Jordan IK, Brown NJ, Miller WJ. 1997.LTR retrotransposons and evolution ofeukaryotic enhancers.Genetica100:3–13
118. Meyers BC, Chin DB, Shen KA, Sivara-makrishnan S, Lavelle DO, et al. 1998.The major resistance gene cluster in let-tuce is highly duplicated and spans severalmegabases.Plant Cell10:1817–32
119. Mhiri C, De Wit PJGM, GrandbastienM-A. 1999. Activation of the promoter ofthe Tnt1 retrotransposon in tomato afterinoculation with the fungal pathogenCla-dosporium fulvum. Mol. Plant-MicrobeInteract.12:592–603
120. Mhiri C, Morel JB, Vernhettes S, Casacu-berta JM, Lucas H, Grandbastien MA.1997. The promoter of the tobaccoTnt1retrotransposon is induced by woundingand by abiotic stress.Plant Mol. Biol.33:257–66
121. Miller JT, Dong F, Jackson SA, SongJ, Jiang J. 1998. Retrotransposon-relatedDNA sequences in the centromeres ofgrass chromosomes.Genetics150:1615–23
122. Moore G, Devos KM, Wang Z, Gale MD.1995. Grasses, line up and form a circle.Curr. Biol. 5:737–39
123. Moore JK, Haber JE. 1996. Capture ofretrotransposon DNA at the sites of chro-mosomal double-strand DNA breaks.Na-ture383:644–45
124. Moore G, Lucas H, Batty N, Flavell R.1991. A family of retrotransposons andassociated genomic variation in wheat.Genomics10:461–68
125. Moreau-Mhiri C, Morel JB, Audeon C,
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?528 KUMAR ■ BENNETZEN
Ferault M, Grandbastien MA, Lucas H.1996. Regulation of expression of the to-baccoTnt1 retrotransposon in heterolo-gous species following pathogen-relatedstresses.Plant J.9:409–19
126. Motohashi R, Mochizuiki K, OhtsuboH, Ohtsubo E. 1997. Structures and dis-tribution of p-SINE1 members in ricegenomes.Theor. Appl. Genet.95:359–68
127. Mount SM, Rubin GM. 1985. Completenucleotide sequence of theDrosophilatransposable elementcopia: homologybetweencopia and retroviral proteins.Mol. Cell. Biol.5:1630–38
128. Noma K, Nakajima R, Ohtsubo H, Oht-subo E. 1997.RIRE1, a retrotranspo-son from wild rice (Oryza australiensis).Genes Genet. Syst.72:131–40
129. Noma K, Ohtsubo E, Ohtsubo H. 1999.Non-LTR retrotransposon LINEs areubiquitous components of plant genomes.Mol. Gen. Genet.261:71–79
130. Noutoshi Y, ItoY, Kanetani S, Fuji M,Usami S, Yamada T. 1998. Molecularanatomy of a small chromosome in thegreen algaChlorella vulgaris. NucleicAcids Res.26:3900–7
131. Nozue K, Kanegae T, Wada M. 1997. Afull length Ty3/gypsy-type retrotranspo-son in the fernAdiantum. J. Plant Res.110:495–99
132. O’Neill RJW, O’Neill MJ, Graves MA.1998. Undermethylation associated withretroelements activation and chromo-somes remodelling in an interspecificmammalian hybrid.Nature393:68–72
133. Okada N, Hamada M, Ogiwara I,Ohshima K. 1997. SINEs and LINEsshare common 3′ sequences: a review.Gene205:229–43
134. Orgel LE, Crick FHC. 1980. Selfish DNA:the ultimate parasite.Nature284:604–7
135. Pal-Bhadra M, Bhadra U, Birchler JA.1997. Cosuppression inDrosophila: genesilencing of alcohol dehydrogenasebywhite-adhtransgenes ispolycombdepen-dent.Cell 90:479–90
136. Panstruga R, Buschges R, Piffanelli P,Schulze-Lefert P. 1998. A contiguous 60kb genomic stretch from barley revealsmolecular evidence for gene islands ina monocot genome.Nucleic Acids Res.26:1056–62
137. Pardue ML, Danilevskaya ON, TraverseKL, Lowenhaupt K. 1997. Evolutionarylinks between telomeres and transposableelements.Genetica100:73–84
138. Pastugilia M, Ruffio-Chable V, DelormeV, Gaude T, Dumas C, Cock JM. 1997.A functional S locus anther gene is notrequired for the self-incompatibility re-sponse inBrassica oleracea. Plant Cell9:2065–76
139. Paul PK, Kunert E, Hunter E, Grand-bastien M-A. 1994. Expression of the to-baccoTnt1 retrotransposon promoter inheterologous species.Plant Mol. Biol.26:393–402
140. Pearce SR, Harrison G, Heslop-HarrisonJS, Flavell AJ, Kumar A. 1997. Character-isation and genomic organisation of Ty1-copia group retrotransposons in rye (Se-cale cereale). Genome40:617–25
141. Pearce SR, Harrison G, Li D, Heslop-Harrison JS, Kumar A, Flavell AJ. 1996.The Ty1-copiagroup retrotransposons inVicia species: copy number, sequenceheterogeneity and chromosomal localisa-tion. Mol. Gen. Genet.250:305–15
142. Pearce SR, Pich U, Harrison G, FlavellAJ, Heslop-Harrison JS, et al. 1996.The Ty1-copiagroup retrotransposons ofAllium cepa are distributed throughoutthe chromosomes but are enriched in theterminal heterochromatin.ChromosomeRes.4:357–64
143. Pearce SR, Stuart-Rogers CM, Kumar A,Ellis THN, Flavell AJ. 1999. Rapid iso-lation of plant Ty1-copia group retro-transposon LTR sequences for molecularmarker studies.Plant J.19:711–17
144. Pelissier T, Tutois S, Tourmente S, De-ragon JM, Picard G. 1996. DNA re-gions flanking the majorArabidopsis
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 529
thalianasatellite are principally enrichedin Athila retroelement sequences.Genet-ica 97:141–51
145. Petrov D. 1977. Slow but steady: Reduc-tion of genome size through biased muta-tion. Plant Cell11:1900–1
146. Pimpinelli S, Berloco M, Fanti L, DemitriP, Bonaccorsi E, et al. 1995. Trans-posable elements are stable compo-nents ofDrosophila melanogasterhete-rochromatin.Proc. Natl. Acad. Sci. USA92:3804–8
147. Pouteau S, Grandbastien MA, Boccara M.1994. Microbial elicitors of plant defenseresponse activate transcription of a retro-transposon.Plant J.5:535–42
148. Pouteau S, Huttner E, Grandbastien MA,Caboche M. 1991. Specific expression ofthe tobaccoTnt1 retrotransposon in pro-toplasts.EMBO J.10:1911–18
149. Pozueta-Romero J, Houlne G, SchantzR. 1998. Identification of a short in-terspersed repetitive element in partiallyspliced transcripts of the bell pepper(Capsicum annuum) PAP gene: new evo-lutionary and regulatory aspects on planttRNA-related SINEs.Gene214:51–58
150. Presting GG, Malysheva L, FuchsJ, Schubert I. 1998. A TY3/GYPSYretrotransposon-like sequence localises tothe centromeric regions of cereal chromo-somes.Plant J.16:721–28
151. Purugganan MD, Wessler SR. 1994.Molecular evolution of magellan, amaize Ty3/gypsy-like retrotransposon.Proc. Natl. Acad. Sci. USA91:11674–78
152. Purugganan MD, Wessler SR. 1995.Transposon signatures-species-specificmolecular markers that utilise a class ofmultiple-copy nuclear-DNA.Mol. Ecol.4:265–69
153. Ramsay L, Macaulay M, Cardle L, Mor-gante M, Degli-Ivanissevich S, et al. 1999.Intimate association of microsatellite re-peats with retrotransposons and other dis-persed repetitive elements in barley.PlantJ. 17:415–23
154. Razin A. 1998. CpG methylation, chro-matin structure and gene silencing—athree way connection.EMBO J.17:4905–8
155. Royo J, Nass N, Matton DP, OkamotoS, Clarke AE, Newbigin E. 1996. Aretrotransposon-like sequence linked tothe S-locus ofNicotiana alata is ex-pressed in styles in response to touch.Mol. Gen. Genet.250:180–88
156. SanMiguel P, Bennetzen JL. 1998. Ev-idence that a recent increase in maizegenome size was caused by the mas-sive amplification of intergene retrotrans-posons.Ann. Bot.81:37–44
157. SanMiguel P, Gaut BS, Tikhonov A,Nakajima Y, Bennetzen JL. 1998. Thepalaeontology of intergene retrotrans-posons of maize.Nat. Genet.20:43–45
158. SanMiguel P, Tikhonov A, Jin YK,Motchoulskaia N, Zakharov D, et al.1996. Nested retrotransposons in the in-tergenic regions of the maize genome.Sci-ence274:765–68
159. Sato Y, Sentoku N, Miura Y, HirochikaH, Kitano H, Matsuoka M. 1999. Loss-of-function mutations in the rice homeoboxgeneOSH15affect the architecture of in-ternodes resulting in dwarf plants.EMBOJ. 18:992–1002
160. Schmid CW. 1998. Does SINE evolutionpreclude Alu function? Nucleic AcidsRes.26:4541–50
161. Schmidt T, Kubis S, Heslop-Harrison JS.1995. Analysis and chromosomal local-isation of retrotransposons in sugar beet(Beta vulgaris) LINES and Ty1-copia-like elements as major components of thegenome.Chromosome Res.3:335–45
162. Schuster W, Brennicke A. 1987. Plastid,nuclear and reverse transcriptase se-quences in the mitochondrial genome ofOenothera: is genetic information trans-ferred between organelles via RNA?EMBO J.45:61–78
163. Schwarz-Sommer Z, Leclercq L, G¨obel
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?530 KUMAR ■ BENNETZEN
E, Saedler H. 1987.Cin4, an insert al-tering the structure of theA1 gene ofZea mays, exhibits properties of nonviralretrotransposons.EMBO J.6:3873–80
164. Sentry JW, Smyth DR. 1989. An elementwith long terminal repeats and its vari-ant arrangements in the genome ofLiliumhenryi. Mol. Gen. Genet.215:349–54
165. Shepherd NS, Schwarz-Sommer Z,Blumberg vel Spalve J, Gupta M, Wien-and U, Saedler H. 1984. Similarity oftheCin1 repetitive family ofZea maystoeukaryotic transposable elements.Nature307:185–87
166. Simmen MW, Leitgeb S, Charlton J, JonesSJ, Harris BR, et al. 1999. Nonmethy-lated transposable elements and methy-lated genes in chordate genome.Science11:64–67
167. Smit AFA. 1996. The origin of inter-spersed repeats in the human genome.Curr. Opin. Genet. Dev.6:743–48
168. Smyth DR, Kalitsis P, Joseph JL, SentryJW. 1989. Plant retrotransposon fromLil-ium henryiis related to Ty3 of yeast andthegypsygroup ofDrosophila.Proc. Natl.Acad. Sci. USA86:5015–19
169. Smyth DR. 1993. Plant retrotransposons.In Control of Gene Expression, ed. DPSVerma, pp. 1–15. London: CRC Press
170. Song SU, Gerasimova T, Kurkulos M,Boeke JD, Corces VG. 1994. An Env-likeprotein encoded byDrosophila retroele-ment: evidence thatgypsyis an infectiousretrovirus.Genes Dev.8:2026–57
171. Song W-Y, Pi L-Y, Wang G-L, GardnerJ, Holsten T, Ronald PC. 1997. Evolutionof the riceXa21disease resistance genefamily. Plant Cell9:1279–87
172. Suoniemi A, Anamthawat-Jonsson K,Arna T, Schulman AH. 1996. Retrotrans-posonBARE-1is a major, dispersed com-ponent of the barley (Hordeum vulgareL.) genome.Plant Mol. Biol. 30:1321–29
173. Suoniemi A, Narvanto A, Schulman AH.1996. The BARE-1 retrotransposon is
transcribed in barley from an LTR pro-moter active in transient assays.PlantMol. Biol. 31:295–306
174. Suoneimi A, Schmidt D, Schulman AH.1997.BARE-1insertion site preferencesand evolutionary conservation of RNAand cDNA processing sites.Genetica100:219–30
175. Suoniemi A, Tanskanen J, Pentikainen O,Johnson MS, Schulman AH. 1998. Thecore domain of retrotransposon integrasein Hordeum: predicted structure and evo-lution. Mol. Biol. Evol.15:1135–44
176. Suoniemi A, Tanskanen J, Schulman AH.1998. Gypsy-like retrotransposons arewidespread in the plant kingdom.PlantJ. 13:699–705
177. Takeda S, Sugimoto K, Otsuki H, Hi-rochika H. 1998. Transcriptional activa-tion of the tobacco retrotransposonTto1by wounding and methyl jasmonate.PlantMol. Biol. 36:365–76
178. Takeda S, Sugimoto K, Otsuki H, Hi-rochika H. 1999. 13–bpcis-regulatory el-ement in the LTR promoter of tobaccoretrotransposonTto1 is involved in re-sponsiveness to tissue culture, wound-ing, methyl jasmonate and fungal elici-tors.Plant J.18:1–11
179. Tatout C, Lavie L, Deragon JM. 1998.Similar target site selection occurs in in-tegration of plant and mammalian retro-posons.J. Mol. Evol.47:463–70
180. Temin HM. 1980. Origin of retrovirusesfrom cellular moveable genetic elements.Cell 21:599–600
181. Thomson KG, Thomas JE, Dietzgen RG.1998. Retrotransposon-like sequences in-tegrated into the genome of pineapple,Ananas comosus. Plant Mol. Biol. 38:461–65
182. Tikhonov AP, SanMiguel PJ, Nakajima Y,Gorenstein ND, Bennetzen JL, AvramovaZ. 1999. Colinearity and its exceptionsin orthologousadh regions of maize andsorghum.Proc. Natl. Acad. Sci. USA96:7409–14
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?PLANT RETROTRANSPOSONS 531
183. Trelogan S, Martin SL. 1995. Tightlyregulated, developmentally specific ex-pression of the first open reading frameLINE-1 during mouse embryogenesis.Proc. Natl. Acad. Sci. USA92:1520–24
184. Turcich MP, Bokharririza A, HamiltonDA, He CP, Messier W, et al. 1996.PREM-2, acopia-type retroelement inmaize is expressed preferentially in earlymicrospores.Sexual Plant Reprod.9:65–74
185. Unseld M, Marienfeld JR, Brandt P,Brennicke A. 1997. The mitochondrialgenome ofArabidopsis thalianacontains57 genes in 366,924 nucleotides.Nat.Genet.15:57–61
186. VanderWiel PL, Voytas DF, Wendel JF.1993. Copia-like retrotransposable ele-ment evolution in diploid and polyploidcotton (GossypiumL.). J. Mol. Evol.36:429–47
187. Varagona MJ, Purugganan M, WesslerSR. 1992. Alternative splicing inducedby insertion of retrotransposons into themaizewaxygene.Plant Cell4:811–20
188. Vegh Z, Vincze E, Kadirov R, TothG, Kiss GB. 1990. The nucleotide se-quence of a nodule-specific geneNms-25of Medicago sativa: its primary evolutionvia exon-shuffling and retrotransposon-mediated DNA rearrangements.PlantMol. Biol. 15:295–306
189. Vernhettes S, Grandbastien MA, Casacu-berta JM. 1997. In vivo characterisation oftranscriptional regulatory sequences in-volved in the defence-associated expres-sion of the tobacco retrotransposonTnt1.Plant Mol. Biol.35:673–79
190. Vernhettes S, Grandbastien MA, Casacu-berta JM. 1998. The evolutionary analy-sis of theTnt1 retrotransposon inNico-tiana species reveals the high variabilityof its regulatory sequences.Mol. Biol.Evol.15:827–36
191. Vicient CM, Suoniemi A, Anamthawat-Jonsson K, Tanskanen J, et al. 1999.RetrotransposonBARE-1and its role in
genome evolution in the genusHordeum.Plant Cell.11:1769–84
192. Vignols F, Rigau J, Torres MA, Capel-lades M, Puigdomenech P. 1995. Thebrown midrib3 (bm3)mutation in maizeoccurs in the gene encoding caffeic acidO-methyltransferase.Plant Cell 7:407–16
193. Voytas DF, Ausubel FM. 1988. Acopia-like transposable element family inAra-bidopsis thaliana. Nature336:242–44
194. Voytas DF, Cummings MP, KoniecznyA, Ausubel FM, Rodermel SR. 1992.Copia-like retrotransposons are ubiqui-tous among plants.Proc. Natl. Acad. Sci.USA89:7124–28
195. Voytas DF, Konieczny A, Cummings MP,Ausubel FM. 1990. The structure, dis-tribution and evolution of theTa1 retro-transposable element family ofArabidop-sis thaliana. Genetics126:713–21
196. Wakimoto B. 1998. Beyond the nucle-osome: epigenetic aspects of position-effect variegation inDrosophila. Cell 93:321–24
197. Wang G-L, Ruan DL, Song W-Y, SiderisS, Chen L, et al. 1998.Xa21D encodesa receptor-like molecule with a leucine-rich repeat domain that determines race-specific recognition and is subject toadaptive evolution.Plant Cell10:765–79
198. Wang S, Zhang Q, Maughan PJ, Saghai-Maroof MA. 1997.Copia-like retrotrans-posons in rice: Sequence heterogeneity,species distribution and chromosomal lo-cations.Plant Mol. Biol.33:1051–58
199. Wang SP, Liu N, Peng KM, Zhang QF.1999. The distribution and copy num-ber ofcopia-like retrotransposons in rice(Oryza sativa) and their implications inthe organization and evolution of therice genome.Proc. Natl. Acad. Sci. USA96:6824–28
200. Warner SA, Scott RJ, Draper J. 1993. Iso-lation of an asparagus intracellular PRgene (AoPR) wound-responsive promoterby the inverse polymerase chain reaction
P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE
November 2, 1999 22:53 Annual Reviews AR095-15
?532 KUMAR ■ BENNETZEN
and its characterisation in transgenic to-bacco.Plant J.3:191–201
201. Waugh R, McLean K, Flavell AJ, PearceSR, Kumar A, et al. 1997. Genetic distri-bution ofBARE-1–like retrotransposableelements in the barley genome revealedby sequence-specific amplification poly-morphisms (S-SAP).Mol. Gen. Genet.253:687–94
202. Wessler SR, Bureau TE, White SE. 1995.LTR-retrotransposons and MITES, im-portant players in the evolution of plantgenomes.Curr. Opin. Genet. Dev.5:814–21
203. White SE, Habera LF, Wessler SR. 1994.Retrotransposons in the flanking regionsof normal plant genes-a role forcopia-likeelements in the evolution of gene struc-ture and expression.Proc. Natl. Acad. Sci.USA91:11792–96
204. Williamson VM. 1983. Transposable el-ements in yeast.Int. Rev. Cytol.83:1–25
205. Wright DA, Ke N, Smalle J, Hauge BM,Goodman HM, Voytas DF. 1996. Mul-tiple non-LTR retrotransposons in thegenome ofArabidopsis thaliana. Genet-ics142:569–78
206. Wright DA, Voytas DF. 1998. Potentialretroviruses in plants:Tat1 is related to agroup ofArabidopsis thalianaTy3/gypsyretrotransposons that encode envelope-like proteins.Genetics149:703–15
207. Xiong Y, Eickbush TH. 1990. Origin andevolution of retroelements based upontheir reverse transcriptase sequences.EMBO J.9:3353–62
208. Yoder JA, Walsh CP, Bestor TH. 1997.Cytosine methylation and the ecology ofintragenomic parasites.Trends Genet.13:335–40
209. Youngman S, Van Luenen HGAM, Plas-terk RHA. 1996.Rte-1, a retrotransposon-like element inCaenorhabditis elegans.FEBS Lett.380:1–7
210. Yoshioka Y, Matsumoto S, Kjima S,Ohshima K, Okada N, Machida Y. 1993.Molecular characterization of a short in-terspersed repetitive element from to-bacco that exhibits sequence homologyto specific tRNAs.Proc. Natl. Acad. Sci.USA90:6562–66
211. Zou S, Voytas DF. 1997. Silent chromatindetermines target preference of theSac-charomycesretrotransposon Ty5.Proc.Natl. Acad. Sci. USA94:7412–16