PLANT RETROTRANSPOSONS - MTUforest.mtu.edu/faculty/joshi/publish/pdf/amar.pdf · P1: FNE/fgo/fok...

55
? Annu. Rev. Genet. 1999. 33:479–532 Copyright c 1999 by Annual Reviews. All rights reserved PLANT RETROTRANSPOSONS Amar Kumar and Jeffrey L. Bennetzen Scottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA, Scotland and Department of Biological Sciences, Purdue University, West Lafayette, Indiana 47907-1392; e-mail: [email protected] Key Words DNA markers, evolution, genome organization, mutation, regulation, retroelements Abstract Retrotransposons are mobile genetic elements that transpose through reverse transcription of an RNA intermediate. Retrotransposons are ubiquitous in plants and play a major role in plant gene and genome evolution. In many cases, retrotrans- posons comprise over 50% of nuclear DNA content, a situation that can arise in just a few million years. Plant retrotransposons are structurally and functionally similar to the retrotransposons and retroviruses that are found in other eukaryotic organisms. How- ever, there are important differences in the genomic organization of retrotransposons in plants compared to some other eukaryotes, including their often-high copy num- bers, their extensively heterogeneous populations, and their chromosomal dispersion patterns. Recent studies are providing valuable insights into the mechanisms involved in regulating the expression and transposition of retrotransposons. This review de- scribes the structure, genomic organization, expression, regulation, and evolution of retrotransposons, and discusses both their contributions to plant genome evolution and their use as genetic tools in plant biology. CONTENTS Introduction .................................................... 480 Types, Structures, and Distributions of Retrotransposons in Plants ......... 481 LTR Retrotransposons of the Ty1-copia Group .......................... 482 LTR Retrotransposons of the Ty3-gypsy Group .......................... 489 LINEs ....................................................... 489 SINEs ....................................................... 490 Retrotransposon Dispersal and Organization in The Plant Genome ........ 490 Chromosomal Locations .......................................... 490 Retrotransposon Abundance in Plants Relative to Other Eukaryotes ........... 493 Local Structures ................................................ 495 Insertion Site Preferences ......................................... 496 In Organellar Genomes .......................................... 497 Regulation of Expression and Transposition .......................... 497 Transcriptional Regulation ........................................ 497 0066-4197/99/1215-0479$08.00 479

Transcript of PLANT RETROTRANSPOSONS - MTUforest.mtu.edu/faculty/joshi/publish/pdf/amar.pdf · P1: FNE/fgo/fok...

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?Annu. Rev. Genet. 1999. 33:479–532

Copyright c© 1999 by Annual Reviews. All rights reserved

PLANT RETROTRANSPOSONS

Amar Kumar and Jeffrey L. BennetzenScottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA, Scotland andDepartment of Biological Sciences, Purdue University, West Lafayette, Indiana47907-1392; e-mail: [email protected]

Key Words DNA markers, evolution, genome organization, mutation, regulation,retroelements

■ Abstract Retrotransposons are mobile genetic elements that transpose throughreverse transcription of an RNA intermediate. Retrotransposons are ubiquitous in plantsand play a major role in plant gene and genome evolution. In many cases, retrotrans-posons comprise over 50% of nuclear DNA content, a situation that can arise in just afew million years. Plant retrotransposons are structurally and functionally similar to theretrotransposons and retroviruses that are found in other eukaryotic organisms. How-ever, there are important differences in the genomic organization of retrotransposonsin plants compared to some other eukaryotes, including their often-high copy num-bers, their extensively heterogeneous populations, and their chromosomal dispersionpatterns. Recent studies are providing valuable insights into the mechanisms involvedin regulating the expression and transposition of retrotransposons. This review de-scribes the structure, genomic organization, expression, regulation, and evolution ofretrotransposons, and discusses both their contributions to plant genome evolution andtheir use as genetic tools in plant biology.

CONTENTS

Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 480Types, Structures, and Distributions of Retrotransposons in Plants. . . . . . . . . 481

LTR Retrotransposons of the Ty1-copiaGroup . . . . . . . . . . . . . . . . . . . . . . . . . . 482LTR Retrotransposons of the Ty3-gypsyGroup . . . . . . . . . . . . . . . . . . . . . . . . . . 489LINEs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 489SINEs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 490

Retrotransposon Dispersal and Organization in The Plant Genome. . . . . . . . 490Chromosomal Locations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 490Retrotransposon Abundance in Plants Relative to Other Eukaryotes. . . . . . . . . . . 493Local Structures. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 495Insertion Site Preferences. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 496In Organellar Genomes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 497

Regulation of Expression and Transposition. . . . . . . . . . . . . . . . . . . . . . . . . . 497Transcriptional Regulation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 497

0066-4197/99/1215-0479$08.00 479

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?480 KUMAR ■ BENNETZEN

Activation by Biotic and Abiotic Stresses. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 498Regulatory Sequences. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 499Epigenetic and Posttranscriptional Regulations. . . . . . . . . . . . . . . . . . . . . . . . . 500Retrotransposon Regulation as a Form of Host Defense. . . . . . . . . . . . . . . . . . . . 502

Plant Retrotransposon Origins and Evolution. . . . . . . . . . . . . . . . . . . . . . . . . 503The Frequency and Timing of Retrotransposition During Evolution. . . . . . . . . . . 503Heterogeneity. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 505Internal Mutation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 506Retrotransposons and Retroviruses. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 507Horizontal Transmission?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 508

Contributions of Retrotransposons to the Evolution of Plant Genesand Genomes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510Gene Mutation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510Genome Size. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 511Genome Rearrangement. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 516Roles for Retrotransposons?. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 517

Retrotransposons as Genetic Tools. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518Molecular Markers for Phylogenetic, Biodiversity, and Genetic

Linkage Analyses. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518Gene Tagging and Functional Analysis of Genes. . . . . . . . . . . . . . . . . . . . . . . . . 519Evaluation of Somaclonal Variations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 519

Conclusions and Future Prospects. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 520

INTRODUCTION

The mobile genetic elements called retrotransposons are ubiquitous in the plantkingdom. They are present in high copy numbers in most plants, making themmajor constituents of plant genomes. Retrotransposons move to new chromosomallocations via an RNA intermediate that is converted into extrachromosomal DNAby the encoded reverse transcriptase/RNaseH enzymes prior to reinsertion intothe genome (13, 16). This replicative mode of transposition can rapidly increasethe copy numbers of elements and can thereby greatly increase plant genome size(93, 156). The DNA transposable elements (for example,Ac, Tam1, andEn/Spm)transpose by an excision/repair mechanism and usually do not greatly increaseplant genome size (97). Retrotransposons, like DNA transposable elements, cangenerate mutations by inserting within or near genes. Moreover, retrotransposon-induced mutations are relatively stable; because they transpose via replication, thesequence at the insertion site is retained.

Many of the properties of transposable elements suggest that they are parasiticor selfish DNAs (34, 134). Because they are expressed in the genome and subjectto many of the same rules of inheritance as are the genes of their genomic “host,”a retrotransposon can be co-opted for use in a biological process that benefitsthe organism. For instance, the preferential insertion of some retrotransposons inDrosophilaat telomeric locations has removed the need for a telomerase function inthese insects (137). However, discussion of plant/transposable element interactions

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 481

are often most informative from a plant/parasite perspective, and we often employthis approach of looking at the genome as an ecosystem in which different DNAcomponents vie for influence and survival (7, 34, 134).

Because retrotransposons have the potential to dramatically alter gene func-tion and host genome structure, it is not surprising that their transpositional ac-tivities are regulated both by retrotransposon- and host-encoded factors, possiblyto avoid deleterious effects on host and retrotransposon survival. The intimaterelationship between retrotransposons and their plant hosts has existed for manymillions of years. We are just beginning to understand how retrotransposons andtheir hosts’ genomes have co-evolved mechanisms to regulate transposition, in-sertion specificities, and mutational outcomes in order to optimize each other’ssurvival. Here, we review the current status of research on plant retrotransposonswith respect to their structure, genomic organization, expression, regulation, trans-positional activities, and evolution. We also discuss their contributions to plantgenome evolution and their applications as tools for phylogenetic studies, bio-diversity assessments, and gene tagging. Since the first publication on a plantretrotransposon in 1984 (165), many publications on this subject have appeared,and it is not possible to refer to all of them in this review. For additional back-ground, we suggest any of several previous reviews on plant retrotransposons(5, 48, 54, 55, 95, 97, 169, 202).

TYPES, STRUCTURES, AND DISTRIBUTIONSOF RETROTRANSPOSONS IN PLANTS

Retrotransposons are the most abundant and widespread class of eukaryotic trans-posable element, consisting of the long terminal repeat (LTR) and the non-LTRretrotransposons (Figure 1) (5, 54, 93, 97). LTR retrotransposons are further sub-classified into the Ty1-copia and the Ty3-gypsygroups that differ from eachother in both their degree of sequence similarity (207) and the order of en-coded gene products (Figure 1). Ty1-copiaretrotransposons are present through-out the plant kingdom, in species ranging from single-cell algae to bryophytes,gymnosperms, and angiosperms (44, 195). Ty3-gypsyretrotransposons also havebeen found to be widely distributed in the plant kingdom, including both gym-nosperms and angiosperms (90, 96, 131, 164, 176, 181). Furthermore, both thesegroups of retrotransposons are commonly found in high copy number (up to afew million copies per haploid nucleus) in plants with large genomes (Table 1)(5, 95, 97, 101, 147, 158). The non-LTR retrotransposons, LINEs (long interspersedrepetitive elements) and SINEs (short interspersed repetitive elements) can alsobe found in high copy numbers (up to 250,000) in the plant species studied to date(5, 97, 101, 169). Like the LTR retrotransposons, LINEs are present throughoutthe plant kingdom (129, 130). SINEs have been found in several angiosperms(32, 126, 130, 142, 210) and may be ubiquitous in plants. Table 1 shows a list ofsome well-characterized retrotransposons in plants.

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?482 KUMAR ■ BENNETZEN

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 483

LTR Retrotransposons of the Ty1-copia Group

The LTR retrotransposons have direct long terminal repeats (LTRs) that can rangefrom a few 100 bp to over 5 kb in size (Figure 1; Table 1). The LTRs do notencode any known proteins, but they do contain the promoters and terminatorsassociated with the transcription of LTR retrotransposons. LTRs terminate in shortinverted repeats, usually 5′—TG–3′ and 5′—CA–3′. The LTR retrotransposonsencode a number of proteins, specified by three major genes calledgag, pol, andint. These genes and proteins are all specified by a single mRNA molecule thathas the structure 5′—R–U5—PBS–coding region–PPT–U3—R–3′, where R, U5,PBS, PPT, and U3 stand for repeated RNA, unique 5′ RNA, primer binding site,polypurine tract, and unique 3′ RNA, respectively (Figure 1). Hence, transcriptioninitiates at the 5′ end of R in the 5′ LTR and terminates in the 3′ end of R inthe 3′ LTR. As this result suggests, the 3′ LTR also contains a promoter thatcan lead to readthrough transcription of sequences downstream from the insertedelement. The proteins encoded bygag, pol, andint are synthesized as a polyproteinthat is cleaved into functional peptides by apol-encoded protease (PR). Thegaggene encodes proteins involved in maturation and packaging of retrotransposonRNA and proteins into a form suitable for integration into the genome. Thepolgene encodes reverse transcriptase and RNase H activities that are required forreplication/transposition of the retrotransposon, whereasint encodes the integrasethat allows the DNA form of the retrotransposon to insert at a new chromosomallocation. In some cases,gag, pol, and int proteins are encoded within a singletranslational reading frame, but in other cases, two or more reading frames arepresent. This leads to a requirement for a frameshift or translational reinitiation to

←−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−−Figure 1 General structures of the Ty1-copia, Ty3-gypsy, LINE, and SINE retrotrans-posons. The LTR retrotransposons have long terminal repeats in direct orientation ateach end. Within the LTRs are U3, R, and U5 regions that contain signals for initiationand termination of transcription. The transcript (thin arrow) is shown below each block;it initiates at the 5′ end of R within the 5′ LTR and terminates at the 3′ end of R withinthe 3′ LTR. The genes within the retrotransposons encode capsid-like proteins (CP),endonuclease (EN), integrase (INT), protease (PR), reverse transcriptase (RT), andRNAase-H. Other sequences featured are PBS (primer binding sites), PPT (polypurinetracts), NA (nucleic acid binding moiety), IR (inverted terminal repeats), DR (flankingtarget direct repeat), 5′ UTR (5′ untranslated region), 3′ UTR (3′ untranslated region),and Pol III A and B-promoter recognition sites for RNA polymerase III. These figuresare not drawn to scale, as the LTR retrotransposons range from a few kb up to 15 kbin size. LINEs usually range in size from less than 1 kb to maybe 8 kb, while SINEsare normally 100 bp to 300 bp in size. The envelope (env) gene-like sequence in theposition of ORF 3, where a functionalenvgene is present in the animal retroviruses,has been found in both Ty1-copiaand Ty3-gypsygroups (not shown). The function oftheenvgene-like sequence in plant retrotransposons is unknown.

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?T

AB

LE

1Pr

oper

ties

ofso

me

plan

tret

rotr

ansp

oson

s

Size

Pla

nt&

Tota

lLT

Rge

nom

eR

NA

Ret

rotr

ansp

oson

kbbp

Cop

No.

size

(1C

)L

ocat

ion

tran

scri

ptC

omm

ents

Ref

eren

ce

LTR

retr

otra

nspo

sons

Ty1-

copi

agr

oup

BA

RE

-112

.118

29>

5000

0B

arle

yD

ispe

rsed

Lea

ves,

callu

sA

ctiv

ein

the

rece

nt17

2(

487

0 M

bp)

past

;the

mos

tabu

n-da

ntkn

own

fam

ilyin

the

barl

eyge

nom

e

Bs1

3.2

302

1–5

Mai

ze—

—L

acks

RT

sequ

ence

s;76

( 2

290 0

Mbp

)ha

sac

quir

edco

n-se

rved

tran

smem

bran

edo

mai

nsof

ace

llula

rpl

asm

am

embr

ane

prot

onA

TPa

sege

ne

Opi

e-1

8.7

1256

>30

000

Mai

zeD

ispe

rsed

Roo

ts,l

eave

sA

ctiv

ew

ithin

the

last

158

inte

rgen

icta

ssel

s2

to6

mill

ion

year

s;pr

efer

entia

llyin

sert

sin

toth

eLT

Rs

ofot

her

retr

otra

nspo

sons

PRE

M-2

9.5

1307

>10

000

Mai

zeD

ispe

rsed

Mic

rosp

ores

Act

ive

inth

ere

cent

past

;18

4in

sert

ions

atth

e5′

flank

ing

regi

ons

ofth

epo

lgal

gene

s,co

nsti-

tute

sab

out5

%of

the

mai

zege

nom

e

∼ ∼

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?SI

RE

-111

.010

00>

200

Soyb

ean

—Se

edlin

gs,l

eaf

Act

ive

inth

ere

cent

past

;10

0(1

115

Mbp

)tis

sues

cont

ains

enve

lope

-lik

ese

quen

ces

Tnt

15.

361

0>

100

Toba

cco

Dis

pers

edPr

otop

last

,A

ctiv

ely

tran

spos

ing;

56(

229

0

Mbp

)eu

chro

mat

inro

ots,

wou

ndin

g,is

olat

edas

anin

sert

ion

path

ogen

atta

cks

inth

eni

age

nein

tissu

e-cu

lture

cells

;tra

nspo

ses

intr

ansg

enic

tom

ato

and

Ara

bido

psis

ToS1

74.

113

81–

5R

ice

Dis

pers

edPr

otop

last

,T

rans

pose

sin

tissu

ecu

l-66

( 4

20M

bp)

tissu

ecu

lture

ture

;mos

tlyin

sert

sin

orne

arco

ding

regi

ons.

Tto

15.

357

4>

30To

bacc

oD

ispe

rsed

Prot

opla

st,t

issu

eT

rans

pose

sdu

ring

tissu

e64

cultu

re,v

iral

,cu

lture

;tra

nspo

ses

inw

ound

ing,

path

o-tr

ansg

enic

rice

and

gen

atta

cks

Ara

bido

psis

.

Ty3-

gyps

ygr

oup

Ath

ila

10.5

1539

30A

.tha

lian

aPa

race

ntro

mer

ic—

Ath

ila

fam

ilyre

pres

ents

144

( 1

30 M

bp)

regi

ons

0.3%

ofth

eA

rabi

dops

isge

nom

e

Ath

ila-

1-1

∼12.

013

2473

0A

.tha

lian

a—

—C

onta

ins

enve

lope

-lik

e20

6se

quen

ces

Cer

eba

∼10.

0—

Bar

ley

Cen

trom

eric

—E

xclu

sive

lycl

uste

red

in15

0ce

ntro

mer

icre

gion

Cin

ful-

18.

658

6M

aize

—L

eave

sC

onta

ins

enve

lope

-lik

e15

6se

quen

ces

Cyc

lops

-2∼1

2.5

1504

1000

Pea

Dis

pers

ed—

Con

tain

sen

velo

pe-l

ike

24(

440

0 M

bp)

sequ

ence

s

cont

inue

d(

)

∼∼ ∼∼ ∼

1500

2000

0

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?G

rand

e-1

13.5

600

>13

00M

aize

Dis

pers

ed—

Con

tain

sa

larg

e3′

regi

on11

3of

mor

eth

an7

kbth

atap

pare

ntly

harb

ors

anac

quir

edce

llula

rge

neth

atpr

oduc

esa

900-

bptr

ansc

ript

inal

ltis

sues

exam

ined

IFG

5.9

333

1000

0P

inus

radi

ata

——

Inac

tive,

the

IFG

retr

o-90

( 4

0000

Mbp

)tr

ansp

oson

repr

esen

ts0.

25–0

.5%

ofth

eP

inus

geno

me

RIR

E3

∼10.

523

1610

Ric

e—

—In

activ

e;a

fam

ilyof

96R

IRE

3el

emen

tsar

efo

und

inal

lpla

nts

test

ed

Zeo

n-1

7.3

648

>12

50M

aize

—E

ndos

perm

Lac

ksR

Tse

quen

ces,

72bu

tpro

duce

sga

gpr

otei

nde

tect

edby

antib

odie

s

Non

-LT

Rre

trot

rans

poso

nsL

INE

s

Cin

41–

6.8

—50

–100

Mai

ze—

—A

fam

ilyof

hete

ro-

163

gene

ousl

ytr

unca

ted

5′-e

nds

TA

BL

E1

Size

Pla

nt&

Tota

lLT

Rge

nom

eR

NA

Ret

rotr

ansp

oson

kbbp

Cop

No.

size

(1C

)L

ocat

ion

tran

scri

ptC

omm

ents

Ref

eren

ce

c

onti

nued

)

(

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?D

el2

4.5

—25

0000

Lil

ium

Dis

pers

ed—

Acc

ount

sfo

r4%

ofth

e10

1(2

9000

Mbp

)L

iliu

mge

nom

e

Tal1

6.1

—1–

6A

.tha

lian

aD

ispe

rsed

—A

fam

ilyof

non-

LTR

205

elem

ents

;17

addi

tiona

lre

vers

etr

ansc

ript

ase

sequ

ence

s(T

a12-

28)

wer

eid

entifi

ed;T

a17

islo

cate

din

the

mito

-ch

ondr

ialg

enom

e

Zep

p0.

987–

8943

—13

0C

hlor

ella

Telo

mer

ic—

Clu

ster

edin

the

telo

mer

ic62

vulg

aris

regi

on

SIN

Es

p-SI

NE

10.

125

—>

100

Ric

e—

—U

sefu

las

DN

Am

arke

rs74

S1B

n0.

170

—50

0B

rass

ica

napu

sD

ispe

rsed

Shoo

t,ro

ots,

Ele

men

tsar

etw

ofol

d32

( 6

30M

bp)

callu

sm

ore

met

hyla

ted

than

the

aver

age

met

hyla

tion

leve

lof

B.n

apus

nucl

ear

DN

A

TS

0.11

1—

>50

,000

Toba

cco

Dis

pers

ed—

TS-

hom

olog

ous

sequ

ence

s21

0ar

epr

esen

tin

the

intr

ons

and

flank

ing

regi

ons

ofm

any

gene

sin

the

Sola

nace

ae

Ts

0.20

6—

>10

00B

ellp

eppe

rD

ispe

rsed

—T

rans

pose

dw

ithin

the

149

( 3

420

Mbp

)PA

P2

gene

∼ ∼

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?488 KUMAR ■ BENNETZEN

make appropriate peptides downstream of the change in frame, and thereby canlead to a lower level of these downstream proteins relative to those upstream.

LTR retrotransposons undergo intracellular replication by a complex processthat first requires synthesis of a retrotransposon mRNA molecule (16). This mRNAspecifies the proteins needed for replication, and also serves as the template forreplication. The primer binding site on the mRNA molecule is complementary to acellular RNA, usually the 3′ end of a host tRNA. In vivo hybridization between theretrotransposon mRNA and the appropriate tRNA provides a short double-strandedRNA region with a free 3′ hydroxyl from the tRNA. Reverse transcriptase can usethis 3′ end as a primer, then synthesizing a DNA complement to the R and U5portions of the 5′ LTR. Reverse transcriptase cannot initially synthesize any addi-tional DNA, however, because it has come to the end of the template, the 5′ endof the retrotransposon mRNA. However, the RNase H molecule encoded by theretrotransposon specifically digests the RNA in any RNA:DNA hybrid, thus free-ing up a single-stranded DNA with homology to the R sequence that is also foundat the 3′ end of retrotransposon mRNAs. Hybridization between these sequencesleads to a circular structure that allows a continuation of the reverse transcrip-tion until a single-stranded DNA complementary to all of the element-internalsequences is synthesized to generate a single-stranded DNA circle. Second-strandDNA synthesis requires the action of RNase H and a polypurine tract (PPT) thatis just 5′ to the 3′ LTR. Once the double-stranded linear DNA molecule is synthe-sized, it can then be incorporated into the target genome by the action of integrase,which appears to cut both donor and target molecules. The DNA is apparentlycut with nicks that are staggered by 3 to 5 bp (a size that is consistent for anygiven integrase), thereby creating a flanking target direct repeat that is 3 to 5 bpin size.

In plants, only a few groups have attempted to study the replication cycle ofLTR retrotransposons. Initially, it was shown that tobaccoTnt1 transposes via anRNA-intermediate in the genome ofArabidopsis thaliana(108). Later, this groupdemonstrated that the replication cycle of the element involves generation of extra-chromosomal double-stranded DNA linear intermediates ofTnt1(43). They havealso shown that the primer binding site (PBS) and the PPT of theTnt1 used asprimers for the initiation of minus- and plus-strand DNAs are separated by twonucleotides from the 5′ and 3′ LTRs, respectively, giving rise to extrachromosomallinear intermediates that have two more base pairs at each end than the integratedcopies, as was observed for retroviruses and the yeast Ty3 retrotransposons (43).Prior to integration into the host genome, the two terminal nucleotides at the 3′

end of these linear intermediates are removed, as was seen with yeast Ty3 retro-transposons and retroviruses. Extrachromosomal circular DNA intermediates ofanother tobaccoTto1retrotransposon have also been identified and characterized(68). Interestingly, analyses on generated models for the secondary and tertiarystructure of the integrase (INT) core domain ofBARE-1retrotransposons, com-prising 80 clones from 28Hordeumaccessions, have revealed that they are similarto structures of human immunodeficiency virus and avian sarcoma virus INs and

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 489

thatHordeumretrotransposon INT has been under purifying selection to maintaina structure typical of retroviral INTs (175). However, further research is neededto determine whether the replication cycle of LTR retrotransposons is the same inplants as in other eukaryotes.

LTR Retrotransposons of the Ty3-gypsy Group

Ty3-gypsyretrotransposons encode the same functions as do Ty1-copiaretrotrans-posons, and they have a very similar genomic organization (Figure 1; Table 1).A significant difference is the order ofint andpol genes within the retrotranspo-son. Comparisons of reverse transcriptase sequences indicate that the Ty3-gypsyand Ty1-copia retrotransposons do form different lineages, so this difference ingene order makes good cladistic sense (207). By sequence and cladistic criteria,it appears that animal retroviruses are derived from Ty3-gypsyretrotransposons,presumably by the acquisition of an envelope (env) function that allowed themto be packaged into viral particles that permit intercellular infectivity (180, 207).When present,envis usually the last protein coding sequence upstream of the 3′

LTR. In some cases, it can be unclear whether a Ty3-gypsy-like element is actuallya retrotransposon or a retrovirus (5, 94). Based on present knowledge, the mechan-ics of intracellular element transcription, replication, and integration are the samefor Ty1-copiaand Ty3-gypsyretrotransposons, and for retroviruses.

LINEs

The LINE elements are simpler than LTR retrotransposons, but still encode manyof the same proteins (Figure 1; Table 1). LINEs havegag and pol genes, butlack an identified integrase. Thegag loci of LINEs usually specify a protein withendonucleolytic activity (EN), and this may be involved in integration of DNAversions of LINEs into chromosomal DNA (26). Alternatively, or additionally,LINEs may integrate into chromosomal DNA via the action of a low-frequencyDNA repair activity (123). By sequence diversity criteria, LINE elements appearto be the oldest class of eukaryotic retrotransposons, and cladistic studies suggestthat the first LTR retrotransposons may have arisen by acquisition of LTRs by aLINE (207).

In plants, little is known about the mechanisms of LINE transcription or inte-gration. Most of the LINEs that have been seen in plants are 5′ truncated, as areabout 95% of human L1 elements (160), suggesting similar transcription, replica-tion, and integration processes. In humans, active L1 elements are transcribed afterrecognition of an internal promoter, perhaps involving participation of both RNApolymerase II and RNA polymerase III (98). The bicistronic L1 mRNA is thentranslated, producinggagandpol proteins. Apparently, the endonuclease protein(EN) encoded bypol then cuts the target site, and homology at this target siteserves to prime reverse transcription of the L1 element bypol (107), subsequentlyleading to integration, presumably via nonhomologous end joining (123).

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?490 KUMAR ■ BENNETZEN

SINEs

The small retrotransposons called SINEs are very different from the other classesof retrotransposons in that they do not encode anytrans-acting transposition func-tions (Figure 1; Table 1). All known SINEs are derived from RNA polymerase IIIproducts (e.g. tRNAs) that appear to have evolved an ability to be efficiently repli-cated and integrated by proteins encoded by LINEs and/or LTR retrotransposons(15, 160). In this regard, SINEs are most like intronless pseudogenes, the mRNAmolecules that are occasionally reverse-transcribed and then inserted into eukary-otic genomes (37, 106). However, unlike the mRNAs that are RNA polymerase IIproducts, genes transcribed by RNA polymerase III usually contain their promotersequences within the RNA-coding region. Hence, if an RNA polymerase III prod-uct is rarely (perhaps inadvertently) utilized as a template to make DNA and thenintegrates into a genome, it will have a promoter that allows essentially normaltranscription. The simplest interpretation of natural selection says that evolutionof such sequences will lead to the origin of SINEs with better and better affinityfor the reverse transcription and integration processes (34, 134). Although it is notclear how SINEs replicate or integrate, the similarity in the varying sizes of theirflanking target DNAs, the shared presence of integrated polyA tails, and their fre-quent correlative abundance with LINEs and pseudogenes suggest that they mayuse LINE-specified functions (15, 39, 133, 179).

RETROTRANSPOSON DISPERSAL AND ORGANIZATIONIN THE PLANT GENOME

Major efforts have been made recently to study the genomic and physical organiza-tion of retrotransposons in gymnosperms and angiosperms. Their replicative modeof transposition and amplification has made retrotransposons highly successful ininvading almost all parts of plant genomes, where they can account for over 50%of the nuclear genome (90, 101, 124, 128, 141, 142, 156, 158, 210).

Chromosomal Locations

In situ hybridization data on metaphase chromosomes and prophase nuclei haverevealed that Ty1-copia retrotransposon sequences are dispersed throughout theeuchromatin, sometimes evenly and sometimes unevenly, depending on the plantspecies and the particular element employed (Figure 2) (17, 61, 124, 140–142).However, many elements are present in lower amounts or absent in certain regions(e.g. centromeres, interstitial and terminal heterochromatic regions, and ribosomalDNA sites) (Figure 2, Figure 3a,b) (17, 61, 124, 140, 141). There are exceptions tothis general observation, though, including elements that are preferentially abun-dant at the terminal heterochromatic regions ofAllium cepa(Figure 2) (142) orthe paracentromeric heterochromatic region ofA. thalianaandCicer arietinum(Figure 2) (17, 61). For example,Athila, which consists of LTRs flanking two

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 491

Ty1-

copi

a

in A

llium

Ty1-

copi

a is

no

t det

ecte

d in

Vic

ia, b

arle

y,

oat,

Pin

us

Ty1-

copi

a

in A

. ind

ora

Ty1-

copi

a &

Ty

3-gy

psy

in m

aize

Ty1-

copi

a &

Ty

3-gy

psy

LIN

Es

in A

rabi

dops

is,

cere

als,

Cic

er

Ty3-

gyps

y in

cer

eals

Not

de

tect

ed

yet

Kno

b D

NA

Nuc

leol

us

orga

nize

r re

gion

Inte

rstit

ial

hete

roch

rom

atic

re

gion

s

Term

inal

he

tero

chro

mat

ic

Cen

trom

ere

Par

acen

trom

eric

reg

ions

Ty1-

copi

a

is n

ot

dete

cted

in

rye

Ty1-

copi

a Ty

3-gy

psy

LIN

Es

Gen

eS

INE

sD

iffer

ent d

ensi

ty li

ne

indi

cate

s di

ffere

nt

retr

otra

nspo

sons

Fig

ure

2D

istr

ibut

ion

ofre

trot

rans

poso

nsin

am

odel

plan

tchr

omos

ome.

Inm

any

case

s,Ty

1-co

pia,

Ty3-

gyps

y,L

INE

and

SIN

Ere

trot

rans

poso

nsar

edi

sper

sed

wid

ely

thro

ugho

utpl

antc

hrom

osom

es.H

owev

er,e

ach

retr

otra

nspo

son

grou

pal

soha

sfa

mili

esth

atar

ecl

uste

red

orde

ficie

ntin

spec

ific

regi

ons

ofch

rom

osom

es.F

orex

ampl

e,so

me

Ty3-

gyps

yre

trot

rans

poso

nsar

ecl

uste

red

inth

ece

ntro

mer

icre

gion

sin

mai

ze,r

ye,a

ndw

heat

,whe

reas

som

eTy

1-co

pia

retr

otra

nspo

sons

are

pres

enti

nth

ete

rmin

alhe

tero

chro

mat

icre

gion

ofA

lliu

mce

pach

rom

osom

esor

abse

ntfr

omth

esa

me

regi

onin

rye

(Sec

ale

cere

ale)

.R

etro

tran

spos

onse

quen

ces

have

not

been

foun

din

the

nucl

eolu

sor

gani

zer

regi

ons

inpl

ants

.M

any

retr

otra

nspo

sons

are

pref

eren

tially

foun

din

euch

rom

atic

regi

ons

whe

rem

ost

gene

sar

elo

cate

d.Se

quen

cean

alys

isof

larg

ege

nom

iccl

ones

inm

aize

has

reve

aled

that

thes

eeu

chro

mat

icre

trot

rans

poso

nsar

em

ostly

loca

ted

inin

terg

enic

regi

ons.

P1: FDO

December 17, 1999 15:5 Annual Reviews AR095-02

?Figure 3 Physical locations of the Ty1-copia (a, b), Ty3-gypsy(c), LINE(d, e), and SINE (f, g) retroelements along metaphase chromosomes ofplants have been shown using in situ hybridization.

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?492 KUMAR ■ BENNETZEN

non-retrotransposon open reading frames (ORFs), is concentrated in the paracen-tromeric heterochromatin ofArabidopsis, where it is associated with the major180-bp centromeric satellite DNA (144).

Recently, several groups have identified retrotransposon sequences in the cen-tromeric regions of several cereal species (Figure 2) (1, 121, 150). Probes withsequence similarity to Ty3-gypsywere isolated from a BAC clone of a sorghumcentromere. The probes hybridized exclusively to centromeric regions of sorghumchromosomes and those of allGramineaetested (121). A barley homologue of thissorghum Ty3-gypsyintegrase sequence was used to clone a barley retrotransposon(cereba) that was subsequently found to be almost exclusively localized withinthe centromeric regions of several cereal species, including barley, wheat and rye(Figure 3c) (150).

The presence of Ty3-gypsyretrotransposon sequences in the centromeric re-gions of all investigated species ofGramineaeis intriguing. This suggests thatthese retrotransposons either are ancient insertions that amplified before the di-vergence of theGramineaeor have independently arrived and/or amplified, plusbecome preferentially located in centromeric regions, in all of these species. Thehigh degree of conservation ofcerebaretrotransposon sequences within the cen-tromeric regions of theGramineaespecies indicates a possible association withcentromeric function. Earlier studies have shown that someDrosophilaretrotrans-posons are stable components of the telomeric and centromeric heterochromatin(146). However, it remains to be determined whether plant retrotransposon se-quences have any direct roles in centromere and/or telomere functions.

Recently, individual full-size copies ofGrande, Zeon-1, PREM-2, RE-10, andRE-15 retrotransposons have been found to interrupt tandemly arranged 180-bpunits of knob DNA associated with cytologically detectable heterochromatic com-ponents of pachytene maize chromosomes (1). Knob DNA in maize is associatedwith several genetic effects, including segregation distortion in the female game-tophyte and late flowering time. Previously, genetic effects associated with knobheterochromatin have been attributed only to the 180-bp repeats. However, the oc-casional retrotransposon insertions could also be involved in attenuation or controlof the multiple genetic effects associated with knob DNA.

LINEs and SINEs, the non-LTR retrotransposons, also have been studied fortheir chromosomal distributions using in situ hybridization. Elements of both typesshow dispersed chromosomal patterns of distribution, but some elements show dis-tinct clustering in specific regions of chromosomes. For example, some LINEs insugar beets are preferentially clustered in the subtelomeric regions of most chromo-some arms (Figure 3d,e) (91). Like the Ty1-copia retrotransposons, these LINEsexhibited almost no hybridization signal in the region of the tandem 18S-5.8S-25SrDNA gene repeats (Figure 3e) (91, 161). In situ hybridization analyses with theSINE element S1Bn of Brassica napusyielded a dispersed pattern of chromoso-mal localization, but also showed hybridizational signal in the centromeric regionsand a tendency to co-localize with rDNA sites (Figure 3f, g) (53). Although S1Bnelements are methylated at a level twice that of the average methylation level of

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 493

theB. napusnuclear DNA, their chromosomal localization was not predominantlyassociated with some extensively methylated regions of chromosomes, like theheterochromatin associated with centromeres and telomeres (53).

From many in situ and genomic sequencing analyses, it initially appeared thatthe percentage of retrotransposons in euchromatic regions as compared to hete-rochromatic regions was generally high. However, these studies were biased byboth the common selection of retrotransposons originally found near genes asthe in situ hybridization probe (38), and by the fact that genomic sequencing ef-forts were focused on genic regions (11, 182). Recent data from the sequencing ofArabidopsiscentromeric regions suggest that retrotransposons are highly enrichednear centromeres, and often arranged as nested series like those seen in intergenicregions of the more complex maize genome (158; R Martienssen, R McCombie& R Wilson, personal communication; http://www.cshl.org/protarab).

Database searches have shown that retrotransposon sequences are present ei-ther within, or close to, many genes. These retrotransposon legacy sequences areoften fragmentary and apparently ancient (19, 202, 203). However, analysis of thedistribution of retrotransposons within a contiguous 240-kb stretch of the maizegenome revealed that most retrotransposons are located between genes (158). Itmay be that many retrotransposons have evolved to transpose primarily into rela-tively inactive regions of plant chromosomes, such as intergenic spacers and otherrepetitive sequences, to avoid mutating genes at a high frequency (158). Otherwise,the host cell would accumulate a lethal level of mutations. In this way, retrotrans-posons could proliferate as dispersed sequences without being deleterious to thehost genome. By minimizing the generally negative (e.g. mutational) effects ofindividual retrotransposons, these elements could actively increase host genomesize.

Retrotransposon Abundance in Plants Relativeto Other Eukaryotes

Genomic sequencing of the 80–90% of theArabidopsisgenome that does not in-clude centromeric heterochromatin, telomeres, or satellite repeats has revealed arelatively low density and simple arrangement of repetitive DNAs. In a contiguousregion of approximately 1.9 Mb on chromosome 4 ofA. thaliana(genome sizeof ca 130 Mbp) (11) a retrotransposon was found about every 130 kb, suggestingthat about 1000 of these elements will be found in the genic regions of the entiregenome. These would comprise about 4% of theArabidopsisgenome, with themore concentrated retrotransposons in the centromeric regions accounting foranother 4–6%. The 4-Mbp genome of the yeastSaccharomyces cerevisiaecon-tains 331 retrotransposon sequences that constitute about 377 kb, or 3.1%, of theyeast genome. These elements belong to five families of LTR retrotransposons;217 Ty1, 34 Ty2, 41 Ty3, 32 Ty4, and 7 Ty5 (87). The non-LTR retrotrans-posons, LINEs and SINEs, appear to be absent from theS. cerevisiaegenome.In contrast, the filamentous fungusMagnaporthe grisea(with a genome size of

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?494 KUMAR ■ BENNETZEN

about 40 Mbp) contains both LTR and non-LTR retrotransposons at copy num-bers varying from a few to 100 (84). InCaenorhabditis elegans, which has agenome size of about 100 Mbp, most of the retroelements appear to be non-LTRretrotransposons, including 59Rte-1family members, of which 9 are full-length(3298 bp) (209). A few Ty3-gypsygroup retrotransposons have been identifiedin this nematode, but it appears that the Ty1-copia group is either absent orrare. Furthermore, there have been no reports of either retrotransposon activity orretrotransposon-induced mutations inC. elegans. Most of the identified retrotrans-posons inC. eleganswere found to be defective (E Berezikov, personal communi-cation). InDrosophila melanogaster, which has a genome size of about 165 Mbp,both LTR-and non-LTR retrotransposons are present. Unlike yeast,C. elegans,andArabidopsis, Drosophilaalso contains active retroviruses (170). To date, thereare at least 23 families of LTR retrotransposons and retroviruses, and at least11 families of non-LTR retrotransposons, that have been identified inDrosophila(http://fly.ebi.ac.uk:7081/transposons/lk/melanogaster-transposon.html). Together,these retrotransposons constitute 5–10% of theDrosophilagenome. In stark con-trast toC. elegans, many of theDrosophilaretrotransposons are active and fre-quently generate insertional mutations.

Interestingly, SINEs are either absent or rare in the genomes ofArabidopsis,C. elegans, andDrosophila. These are likely to be exceptional cases, because closerelatives of these species do contain SINEs. For example,Brassica napus, whichbelongs to the sameCruciferaefamily as doesArabidopsis, contains numerousSINEs (32). It is not clear why these small genome eukaryotes (Arabidopsis, about130 Mbp;C. elegans, about 100 Mbp;Drosophila, about 165 Mbp) are deficientin SINEs. One possibility is that these organisms may have an effective process forremoval of retrotransposons and pseudogenes from their genomes. Alternatively,the relatively low activity of LINEs in these species (exceptDrosophila) mayhave led to a very low frequency of new SINE insertions (39). Further research isrequired to resolve this enigma.

Genome size variation is correlated both with the total mass of retrotrans-posons that are present and with the number of different retrotransposon families.As mentioned above, the 4 MbpS. cerevisiaegenome contains five different LTRretrotransposon families that account for about 3% of the genome (87). InAra-bidopsis, both LTR- and non-LTR retrotransposons have been identified, includingTy1-copia, Ty3-gypsy, and LINE elements, with 1–100 families of each group that,in total, probably account for 4–10% of total nuclear DNA, whereas in maize, thereare thousands of different families of LTR and non-LTR retrotransposons that to-gether account for 70–85% of the nuclear genome (156, 158).

In most mammalian genomes, including the human genome, LINEs and SINEs(e.g. L1 and ALU elements) appear to be the major mobile sequences, representingabout 35% of the total DNA, with up to 100,000 copies of each element type(167). Interestingly, although retroviruses and their remnants, including retroviralelements with deletedenvgene, (i.e. Ty3-gypsy-like elements), are widespread inmammals, LTR retrotransposons of the Ty1-copiagroups appear to be absent or

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 495

rare. In contrast, LINEs, SINEs, Ty3-gypsy, and Ty1-copiaretroelements appear tobe abundant in plants. All in all, plant genomes contain an exceptionally abundantand diverse set of retrotransposons compared to most other eukaryotic genomes,with the exception that functional retroviruses are either absent or present in verysmall numbers.

Local Structures

In most cases, plant retrotransposons have been studied initially as single insertions(12, 20, 56, 69, 70, 72, 76, 78, 91, 101, 128, 131, 155, 187, 188) or as PCR ampli-fied fragments of a population of elements (45, 46, 82, 83, 92, 141, 142, 172, 194).Hence, these studies did not provide much information regarding the local orga-nization of elements within a small chromosomal region. Similarly, standard insitu hybridization experiments only reveal gross general patterns. Recent genomicsequencing studies have begun to provide information on the local organization ofretrotransposons and their interspersion profiles relative to genes.

The first studies of the interspersion patterns of genes with repetitive DNAsin plants were initiated over 25 years ago, utilizing genomic DNA renaturationanalyses (44). It was not known then that most of the repetitive sequences underinvestigation would turn out to be retroelements, primarily LTR retrotransposons.However, these Cot analyses routinely indicated that most of the repetitive DNAswere interspersed with low copy-number, presumably gene-containing, sequences(44, 59). These studies had the disadvantage that they primarily provided resultsthat averaged the actual individual situations across an entire genome.

Genomic sequence analyses have largely confirmed the predictions from Cotstudies. The interspersed repetitive DNAs, mostly retrotransposons and a class ofsmall DNA transposable elements called miniature inverted repeat transposableelements (MITEs) (203), are intermixed with each other and with genes in domainsthat are usually 100 kb or less in size. Many transposable elements, commonlyincluding retrotransposon fragments (203), are actually found within functioninggenes. In small genome plants, these retroelements have usually been observed assingle elements or solo LTRs (11, 24). This simple pattern has also been observedin some gene-rich regions of large plant genomes, like those of maize and barley(105, 136). However, the largest segment of a complex plant genome sequenced todate, an approximate 225-kb region around maizeadh1, has shown a very differ-ent pattern of retroelement arrangement (158, 182). In this region, LTR retrotrans-posons were very abundant, and were found mainly as elements inserted withineach other. The insertions showed some other biases, including a fivefold prefer-ence for insertion into the LTRs rather than the internal retrotransposon domainsand the fact that 17 of 22 were in the same potential transcriptional orientation(158, 182). These nested LTR retrotransposons were found in six blocks, with oneof these blocks possibly interrupting one of the nine putative genes identified inthe area. In contrast, the three LINEs detected in this region were all found withinputative introns of genes (182). Although further genomic sequencing studies will

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?496 KUMAR ■ BENNETZEN

be needed to determine whether this organizational pattern is common in plants,or an exceptional property of the maize genome, it is interesting that the first se-quencing of DNA nearArabidopsiscentromeres has also uncovered a gene-poorregion that is rich in nested retrotransposons (R Martienssen, R McCombie &R Wilson, personal communication; http://www. cshl.org/protarab).

Insertion Site Preferences

All mobile DNAs exhibit some level of insertion site preference. Among theDNA transposable elements, the degree of specificity can vary tremendously.Even the most random insertion mutagens, like Tn5 ofE. coli, P elements ofD. melanogaster, or Mutator of maize, will insert into some genes at frequenciesover tenfold higher than into other genes. Many of these elements exhibit a pref-erence for insertion into genic regions of a genome, and into specific locationswithin or near a gene (e.g. promoters) (10, 86). Moreover, some elements, likeAcof maize, prefer to insert at sites linked to the location of the starting element (35).

Insertion specificities for retroviruses and retrotransposons have been reportedin many different eukaryotes (27, 99), but this question has not been extensivelyinvestigated in plants. In some cases, the preferential integration sites are locatedin actively transcribed chromosomal regions, particularly in or near promoters. Inother cases, preferential integration sites are located in repeated heterochromaticregions. In yeast, most or all retrotransposons show target specificity. For in-stance, the Ty1, Ty2, and Ty3 elements integrate preferentially upstream of genestranscribed by RNA polymerase III, including tRNA, 5S, and U6 genes, whileTy5 primarily inserts in silenced regions of the yeast genome, including silencedmating-type cassettes and telomeric regions (50, 211).

It is not entirely clear how and why retrotransposons target specific regions orsequences of the host genome. Selection of the insertion site may be made by anintegration complex, consisting of retrotransposon-encoded integrase, a reversetranscribed cDNA copy of the element, and various host-encoded factors (27, 99).The most precise information regarding preferential insertions has come fromresearch on yeast retrotransposons. For example, the Ty3 integration complex wasfound to have an affinity with transcription factors that interact with all tRNAgenes (88). Similarly, there is preferential insertion into silent chromatin by Ty5.This specificity can be abolished by a single amino acid change in the integraseprotein (50) or by mutations in nuclear proteins that specifically interact withsilent/heterochromatic regions of the yeast genome (211).

In plants, there is evidence that some retrotransposons show preferential inser-tion into some genomic regions. For example, many Ty1-copia retrotransposonsare found preferentially within euchromatic regions in those plant species studied todate (51, 69, 70). In contrast, some of the cereal Ty3-gypsygroup retrotransposonsaccumulate primarily within heterochromatic regions, such as near centromeresand the knob DNA in maize. Also, Zepp, a LINE-like retrotransposon inChlorella,which accumulates in the telomeric region, integrates preferentially into itself and

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 497

another Zepp sequence (62). It is not clear whether the biased accumulation inany of these cases is due to preferential insertion, or perhaps to a less active pro-cess for removal from these regions and/or less fitness selection acting againstretrotransposons in these regions.

When specificities are seen, some may be shared by many different retrotrans-poson families. For instance, theGrande, Zeon-1, and RE-15 retrotransposonseach were found inserted at the same site in the 180-bp repeat sequences of knobDNA (1). Both Ty1-copia and Ty3-gypsyretrotransposons have been found inintergenic locations within euchromatic regions of the maize genome (158). Incontrast, the low-copy-number Ty1-copiaretrotransposonsTos17of rice andTpv2of common bean,Phaseolus vulgaris, are mostly targeted near and within genes(51, 70). Therefore, it appears that each retrotransposon has evolved its own patternof insertion within a plant’s genome.

In Organellar Genomes

The completed sequences of both chloroplast and mitochondrial genomes fromplants (109, 185) have allowed investigation of whether these additional cellulargenomes contain any retrotransposons or retrotransposon legacies. There have notbeen any reports of retrotransposons or retrotransposon fragments within the smalland gene-dense chloroplast genome of plants. In contrast, mitochondrial genomesappear to contain many fragments of Ty1-copia, Ty3-gypsy, and LINE-like retro-transposons. No intact elements have been seen, and there is no evidence of mu-tations caused by retrotransposon insertion. About 4% or more of theArabidopsismitochondrial genome consists of these retrotransposon fragments, through 9 ormore independent acquisition events (185). It is possible that these fragments ac-cumulated within the relatively large and repeat-rich plant mitochondrial genomesas a passive outcome of DNA acquisition from the nuclear genome (14, 162).

REGULATION OF EXPRESSION AND TRANSPOSITION

The expression of retrotransposons in plants and in other eukaryotic organismsis regulated, thereby regulating their transposition frequency in the host genome.The evolution of control mechanisms for the transcription and transposition ofretrotransposons in the host genome may be crucial to minimize their possibledeleterious effects on the host. Thus, it is not surprising that plant retrotransposonsare transcriptionally silent in most plant tissues during development.

Transcriptional Regulation

Because retrotransposons cannot transpose without the presence of an RNA tem-plate available for reverse transcription, the simplest way to control their activitywould be via regulation of transcriptional initiation. Many retrotransposons showunique patterns of developmental and/or environmental regulation. For instance,

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?498 KUMAR ■ BENNETZEN

transcripts of tobaccoTnt1, barleyBARE-1, and maize PREM-2 have been detectedprimarily in roots, leaves, or young microspores, respectively (148, 172, 173, 184).Likewise, inS. cerevisiaeandDrosophila, the expression of retrotransposons isinduced by hormonal and developmental factors (16, 33).

There are only a few reports on the expression and regulation of non-LTR retro-transposons in plants. For example, the SIBn family of SINEs has been shown tobe expressed in shoots, roots, and callus tissues (31). Moreover, transcriptionalanalysis of SIBn has revealed that the major SIBn RNAs result from cotranscriptionderived from an element present in a POL II transcriptional unit. Only rarely dotissue-specific SIBn transcripts result from the expression of a small number of ge-nomic elements by POL III (31). It appears either that the SIBn POL III promoteris highly repressed inB. napuscells or that transcriptional signals external to theSINE element are necessary in vivo to obtain transcription. Our knowledge of theregulation of LINE expression in plants is limited. In the mouse genome, it has beendemonstrated that LINE-1 expression is under the control of a tightly regulated tem-poral and spatial program of events during development and differentiation (183).

A correlation between transcription and transposition of retrotransposons hasbeen demonstrated for the tobaccoTto1and riceTos17retrotransposons (64, 70).For example, transposition ofTto1andTos17was concomitant with an increase inthe levels of their RNAs, suggesting that transposition of these retrotransposonsis regulated mainly at the transcriptional level. For example, a tenfold increase inthe copy numbers ofTto1 elements was observed in cultured cells, whereTto1transcripts are very abundant (64). A similar pattern of increase in copy number forthe riceTos17elements during tissue culture conditions also has been demonstrated(70). However, this is not the situation for several other retrotransposons. Forexample, barleyBARE-1(an active element in the recent past) is highly transcribedin leaves, but its transposition has not been observed (172). This reminds us that,although transcription is a prerequisite for transposition of an element, other stepsare also required and may be regulated.

Activation by Biotic and Abiotic Stresses

Many of the plant retrotransposons studied to date are transcriptionally activatedby various biotic and abiotic stress factors (55). The expression of the tobaccoTnt1 and Tto1 retrotransposons is greatly increased by several abiotic stresses,including protoplast isolation, cell culture, wounding, methyl jasmonate, CuCl2,and salicylic acid (64, 120, 125, 147, 148, 177, 178). Similarly, various biotic stressfactors such as fungal extracts fromTrichoderma viride(148, 178) or inoculationwith various viral, bacterial, or fungal pathogens (119, 147) have been shownto activate transcription of these retrotransposons. In contrast toTnt1 andTto1,transcription ofTos17is induced only by tissue culture. TobaccoTto5was isolatedas a salicylic acid-inducible retrotransposons by the RT-PCR method (177). Atpresent, it is not known whether abiotic stresses such as cold and heat treatment canalso induce transcription in these retrotransposons. Thus, these results indicate that

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 499

retrotransposons carrycis-acting sequences that control their expression patternsin the host.

Regulatory Sequences

Transcription of LTR retrotransposons is controlled bycis-acting elements in the5′ LTR and adjacent untranslated regions. The structure and function of promotersfromTnt1A,Tto1, andBARE-1have been extensively studied using LTR-GUS tran-scriptional fusions in transformed plants. Transient and stable expression assays ofLTR-GUS fusions containing various lengths of 5′ LTR regulatory sequences haveshown that the majorcis-acting sequences controllingTnt1AandTto1expressionare located within the U3 region of the 5′ LTR (147, 148, 177, 178). In the case ofTnt1, the U3 region contains severalcis-acting sequences, including a short palin-dromic sequence named BI, and a segment of 31 bp tandemly repeated in three orfour copies, named BII. Both BI and BII behave as transcriptional activators. TheBII box binds specific protein factors in tobacco protoplasts (22), and has beenshown to be involved in the transcriptional activation ofTnt1by several biotic andabiotic factors (55, 189).

A similar study on the LTR promoter ofTto1using LTR-GUS constructs hasshown that the 5′ LTR containscis-acting sequences involved in the induction ofTto1expression by abiotic and biotic stress factors (177, 178). Deletion analysison the LTR promoter has shown that enhancer-like sequences upstream from theTATA box are required for a high level of expression in the callus. Recently, bygain-of-function analysis, a 13-bp repeated motif (TGGTAGGTGAGAT) has beenidentified as a positivecis-acting regulatory sequence in the U3 region. This motifis involved in expression in protoplasts and callus, as well as in wound- and methyljasmonate–induced expression in leaves (178). Furthermore, gene expression me-diated by the 13-bp motif was found to be associated with genes that conditionplant defense against microbial attack (178). For example, the complementarysequence of the 13-bp motif contains the box L sequence (also called AC-I) andthe core sequence of the H-box elements, which are highly conserved among thepromoters of phenylpropanoid biosynthesis genes (178).

To gain further insight into the regulation ofTto1 and defense-related genes,cDNAs encoding four different proteins that bind to the 13-bp motif have beenisolated (178). One protein (LBM1, LTR binding MYB) is identical to the pre-viously reported MYB-1, which is induced by virus infection. LBM1 contains aDNA binding domain conserved among the MYB class of transcription factors.Accumulation of LBM1 mRNA was induced rapidly and transiently by cuttingleaf tissues and by elicitors. LBM1 cantrans-activate reporter gene expressiondependent on its binding to the 13-bp motif. A 10-bp sequence (TTGGTAGGTG)containing a conserved motif between box L and the H-box has been shown tobe essential for the binding of LBM1 protein to the 13-bp motif. LBM1 also ac-tivated transcription from the promoter of the phenylalanine ammonia lyase gene(Pv-PAL2) in tobacco protoplasts through binding to the box L and the H-box-like

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?500 KUMAR ■ BENNETZEN

motif (box P). These results suggest that the LBM1 protein may be involved inthe stress response ofTto1and defense-related genes (178; H Hirochika, personalcommunication).

The activation of the expression of tobacco retrotransposonsTnt1 and Tto1by pathogen-derived factors that are involved in the activation of plant defenseresponses is intriguing. Moreover, thecis-acting regulatory motifs of both retro-transposons share similarities with the H-box motif involved in the activation ofseveral plant defense genes (55). The finding that the 13-bp motif contains theL/Ac-I/H-box motif, conserved among promoters of the phenylpropanoid biosyn-thetic genes, raises the question of whether an ancientTto1-like retrotransposoncontributed to acquisition ofcis-regulatory sequences in promoters of some de-fense genes (178). Alternatively,Tto1 may have acquired this sequence from adefense gene, or they could have evolved independently in defense genes andretrotransposons by convergent selection. Takeda and coworkers have analyzedthe promoter of the defense-activated asparagusAoPR1gene (200) and have foundthat it contains an H-box-like motif and several other regions homologous to thecomplementary sequence of theTto1LTR promoter. Interestingly, theAoPR1pro-moter can be activated in a similar fashion to theTto1LTR promoter by abioticand biotic stresses, including tissue culture, wounding, pathogen infection, andsalicylic acid. However, the biological significance of this, as well the origin ofthese regulatory sequences in retrotransposons and host genes, is unclear at presentand warrants further investigation.

LTR-GUS construct analyses have shown that theTnt1andTto1LTR promotersare active in heterologous systems, including transgenicArabidopsis, tomato, orrice and that the expression patterns are similar to those seen in the natural host,tobacco. Moreover, activation of theTnt1andTto1elements by abiotic and bioticstresses is maintained in heterologous hosts (69, 125). However, theTnt1promoter,with expression confined to roots in the homologous tobacco system, was alsoexpressed in the flowers of transgenicArabidopsisand tomato (125), and wasfound to be induced by treatment with auxins inArabidopsis(139). These resultssuggest that, beyond the conservedcis-acting regulators involved in abiotic andbiotic stress induction of these retrotransposons, there are also some host-specificregulators of the expression of theTnt1promoter.

Epigenetic and Posttranscriptional Regulations

Host cells, and possibly the retrotransposons themselves, have evolved mecha-nisms to minimize the negative effects of element retention and transposition. It hasbeen proposed that DNA methylation evolved as a means of containing the spreadof transposable elements and viruses in host genomes (208). This defense modelhas been supported recently by analysis of an interspecific mammalian hybridbetweenMacropus eugeniiandWallabia bicolor in which genome-wide under-methylation of the hybrid genome and a high level of amplification of retroelementsare correlated (132).

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 501

In plants, analyses of repetitive DNAs have routinely shown that they are cy-tosine methylated in the sequences 5′-CG-3′ and 5′-CNG-3′. These methylatedDNAs include those flanking standard plant genes in species like maize (9), andthese methylated sequences have now been shown to be mostly LTR retrotrans-posons (158). Cytosine methylation has been associated with genetic inactivityand a heterochromatic state in many higher eukaryotes, including plants where denovo methylation was first detected during the inactivation of DNA transposableelements of theMutator andAc/Ds families (10, 97). This cytosine methylationhas been associated with both transcriptional inactivity and a higher rate of C to Ttransitions, giving rise to enhanced transcriptional and mutational silencing (157).It is not clear, though, despite many years of study in many species, whether themethylation itself establishes the inactive state of a retrotransposon, or whether itis a secondary (possibly maintenance) outcome of inactivation initially caused bya change in chromatin structure.

The most recent evidence on the role of DNA methylation in the regulation ofretrotransposition comes from a study onTto1by the Hirochika group in Japan (67).It was observed that after the initial active retrotransposition in theArabidopsisgenome,Tto1 became silent. However, the silencing ofTto1 was only observedin those transgenicArabidopsislines whereTto1copy number was increased byactive transposition during culture-mediated transformation process. Thus, thissilencing appears to mimic repeat-induced gene silencing (115). This silencingcorrelated with a reduced level of theTto1 transcript and with methylation of theinactivatedTto1. To investigate the causal relationship between DNA methyla-tion and silencing ofTto1, the hypomethylation mutation,ddm1, was introducedinto the line carrying silencedTto1 copies through conventional crossing. In ahomozygousddm1background, the silencedTto1 became hypomethylated andits transcription was activated. This result suggests that DNA methylation may beresponsible for silencingTto1 in wild-typeArabidopsis(67).

Interestingly, in theDrosophilaandS. cerevisiaegenomes, which lack cytosinemethylation of DNA in general, an inactive chromatin state can be established andmaintained without the need for DNA methylation (36, 58). An even more drasticcontrast is shown by the exceptional case of the chordateCione intestinalis, wheretransposable elements are generally unmethylated and most genes are methylated(166). Hence, DNA methylation is not obligatory for genetic silencing in all eu-karyotes, although a standard structure for inactive chromatin with deacetylatedhistones may be universal (154).

It is not clear how an epigenetic silenced state is either initiated or maintainedfor retrotransposons. In many cases, epigenetic inactivation of an element may oc-cur because of its insertion in or near an already heterochomatic block of DNA, asis the likely case for the insertion of the most abundant maize retrotransposons intomethylated, intergenic blocks (9, 158, 182). Silencing in this case could occur by aprocess analogous to position effect variegation inDrosophila, where movement ofnormally euchromatic genes into a heterochromatic region leads to a progressive ra-diation of the silenced heterochromatic state into the translocated gene(s) (196). In

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?502 KUMAR ■ BENNETZEN

other cases, retrotransposons may be epigenetically modified and/or posttranscrip-tionally suppressed by a process similar to homology-based gene silencing, firstdetected in plants but now also identified in some animal systems (75, 135). Someforms of paramutation (71), the coordinate inactivation of a transposable elementfamily, and homology-based gene silencing all involve correlated DNA methyla-tion and transcriptional inactivation of repeated sequences scattered throughout aplant genome (112, 115). How homologous sequences widely dispersed through agenome are found and inactivated is not clear, nor is it completely clear why repeatslike transposable elements, transgenes, and their chromosomal homologues aretargeted for silencing while normal gene families largely escape this inactivation.Numerous studies are under way in this area (60), and the insights obtained will betremendously valuable to understanding the regulation of plant retrotransposons.

In the yeastS. cerevisiae, several studies have shown that the transpositionof Ty1 elements is controlled at posttranscriptional steps (99). A recent work il-lustrates the concept that conserved cellular functions, such as those involved indifferentiation and DNA repair/recombination, have been adapted by the host forpreventing rampant proliferation of retrotransposons (reviewed in 29). Host genesthat encode Fus3p (a MAP kinase involved in the mating pheromone signal trans-duction pathway) and TFIIH (an RNA polymerase II general transcription factorinvolved in initiation and promoter clearance, and in transcription-coupled repair)have been shown to regulate a posttranscriptional step in retroviral replication inyeast. It appears that Fus3p inhibits the accumulation of Ty1 proteins and thatTFIIH promotes degradation of Ty1 cDNAs. It is not known whether similar hostgene-mediated or other types of posttranscriptional control mechanisms exist forthe regulation of retrotransposition in plants.

Retrotransposon Regulation as a Form of Host Defense

Some retrotransposons might have beneficial effects on a plant, through mutationsthat provide new regulatory properties to a gene (202) or perhaps by contributingto DNA repair (123) or centromere function (121, 150). However, many of theexpression, mutation, and insertion properties of these elements suggest that theireffects are being minimized. With any highly adapted host/parasite interaction,the parasite will contribute as little as possible to decreased host fitness. The hostshould also evolve such minimization/defense processes, and several seem to beacting on plant retrotransposons.

Most of the known retrotransposons appear to be defective in their ability toencode all necessary transposition functions, owing to insertions, deletions, andother mutations (45, 46, 72, 76, 114, 141, 186, 194, 203). The epigenetic regulationof plant retrotransposons associated with DNA methylation and presumed hete-rochromatization may also be involved in keeping retrotransposon transcriptionat a low level (208). In maize, for example, the retrotransposons that make up50–80% of the DNA contribute only 10% or less of the mRNA in most tissuesunder most circumstances (B Bowen, personal communication). Moreover, thisDNA methylation is associated with a two- to threefold higher transition mutation

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 503

rate in these elements, thus causing them to decay to a nonfunctioning form morerapidly than other sequences.

Retrotransposons can inactivate other retrotransposons by inserting within thema structure often seen in the intergenic regions of the maize genome (158). In acompetition between retrotransposons for use of host resources, this inactivation ofone retrotransposon by insertion of another would be an ingenious approach (95).Moreover, the host might benefit from such an insertion pattern, because theseelements would not be generating mutations as often, and thus natural selectioncould act on host factors that regulate this insertion specificity.

PLANT RETROTRANSPOSON ORIGINS AND EVOLUTION

Very little is known about the timing or particular steps in the origin of retro-transposons overall, or about their arrival within a particular kingdom or species.Sequence comparisons of conserved gene products encoded by LINEs and LTRretrotransposons suggest that the first retrotransposons were LINE elements, andthat LTR retrotransposons evolved from them by the acquisition of terminal directrepeats (207). InDrosophila, tandem insertion by theHeT-Aelement suggests howthis process could simply create two flanking LTRs (30, 40). Retroviruses appearto have evolved in animals from a subclass of Ty3-gypsyretrotransposons by ac-quisition of an envelope function (180, 207). Some believe that retrotransposonsoriginated at the beginning of the transition from a putative RNA-based cellulargenome to the DNA-based genome now shared by all living cells (52).

The ubiquity of plant retrotransposons and their extreme sequence heterogeneitysuggests that they were present in the first plants (46, 95, 129, 176, 194). Alterna-tively, retrotransposons may have originated after the creation of the first eukary-otes, and have reached their current wide dispersal by a combination of verticaland horizontal transmission. Nonetheless, they have clearly been highly successfulin proliferating within all higher eukaryotic genomes. Their contributions to plantgenome evolution are many and significant (5, 48, 55, 93, 95, 97, 202). As with anyother DNA sequence present in a plant genome, selective pressures can act to uti-lize that retrotransposon sequence for the benefit of the entire organism, if anysuch benefit can be acquired by chance mutational alterations. Hence, we expectthat retrotransposons or retrotransposon components will occasionally be found tocontribute important factors to plant fitness, as has been seen with the contributionsof these elements to the promoters of many “wild-type” plant genes (203).

The Frequency and Timing of RetrotranspositionDuring Evolution

Although a great deal of information exists regarding the genomic organizationof retrotransposons, little is known concerning the frequency and possible period-icity of element transposition during plant evolution. These questions have beenmore deeply investigated with the DNA transposable elements.Mutatorof maize,

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?504 KUMAR ■ BENNETZEN

for instance, commonly transposes more than once per plant generation, therebyincreasing copy numbers several fold and making many mutations in a single grow-ing season (10, 97). Since the survival and evolution of retrotransposons and thelarge-scale structure of plant genomes are greatly influenced by the transpositionalactivity of retrotransposons, it is important to understand this aspect of retrotrans-poson behavior. Recently, element-anchored PCR analyses have been used to studythe insertional polymorphism of the Ty1-copia group retrotransposons in plants(41, 57, 143, 201). Those studies have shown thatBARE-1insertions, for example,are highly polymorphic in barley, wheat, rye, and oat, suggesting that the trans-positional activity of this retrotransposon has persisted for millions of years inseveral cereal species (57, 201). Recently, a similar study has shown that a fewretrotransposons have been transpositionally active in the recent past in severalPisumspecies (41, 143).

The transpositional activity of plant retrotransposons varies considerably onthe evolutionary time-scale, with some retrotransposons having been active in thedistant past whereas others have been active more recently. A Ty1-copiaelement,R9of rye [about 50,000 copies per genome (140)], is similar toBARE-1of barley[about 70,000 copies per genome (172)]. Gel blot hybridization analyses indicatethat a close relative of theR9subgroup is present in high copy numbers in barley,oat, and wheat. Although Northern analysis on RNAs from seedlings shows thatthe R9/BARE-1subgroup is transcribed in all these cereal plants, the amount oftranscript varies from high in barley, to moderate in wheat and rye, to extremely lowin oat (140). It will be interesting to determine how these retrotransposons havebecome different in transcriptional activity between these closely related cerealhosts, or whether differences in the host transcriptional machinery account for thevariable expression.

A different approach has been used to demonstrate that most retrotransposonsin the maize genome have transposed within the last 2–6 million years (157). Thisstudy was based on calculating the rate of mutations in the LTR sequences ofseveral retrotransposons in theadh1region of maize. As pointed out earlier, thetwo LTRs of a retrotransposon are synthesized from a single template during theirreplication cycle. Thus, the 5′ LTR is usually identical to the 3′ LTR at the time ofa retrotransposon’s insertion into the genome. Therefore, the date of insertion canbe estimated from sequence divergence between the two LTRs, assuming a fairlyconstant rate of nucleotide substitution since the insertion date.

Some retrotransposon families are essentially specific to one genome in plantsof a hybrid origin. For example, in Triticale, an allopolyploid with two wheatgenomes and one rye genome, a rye-origin retrotransposon probe hybridizesmuch more strongly to the rye-origin chromosomes than to those of the wheatgenomes (140). Likewise, in hexaploid oat,Avena sativa(2n = 6x = 42, withA, C, and D genomes), a retrotransposon probe isolated from a diploid A genomespecies hybridizes only weakly to C-genome chromosomes (83). This indicates thatretrotransposition of these elements has been relatively infrequent since the poly-ploid was created. Otherwise the retrotransposons would now be homogeneously

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 505

distributed across all genomes within the same nucleus. Alternatively, some retro-transposons may have evolved to a degree that they are compatible only withinsertion sites within one particular genome, as appears to be the case for retro-transposon amplification in an interspecific wallaby hybrid (132).

Heterogeneity

Fortuitous discovery by genomic sequencing, identification of retrotransposonsas the causal agents in various mutations, and the directed acquisition of retro-transposons by PCR techniques have all yielded information indicating there isa great variety of retrotransposons both within a single plant and between plantspecies. Both LTR and non-LTR retrotransposons, including the Ty1-copiagroup,the Ty3-gypsygroup, LINE, and SINE, are usually present in a plant genome(Table 1) (5, 55, 95, 97). Among these retrotransposons, the Ty1-copiaretrotrans-posons have been studied in most detail, and this has revealed that they are presentas highly heterogeneous populations in all higher plant genomes. For example,when 31 subcloned fragments of the reverse transcriptase genes of the Ty1-copiagroup retrotransposons from potato were sequenced, each was found to be differ-ent, with predicted amino acid similarities between individual fragments varyingfrom 5% to 75% (45). Such extreme heterogeneity has subsequently been observedamongst the reverse transcriptase genes of Ty1-copiagroup retrotransposons fromseveral monocotyledonous and dicotyledonous plant species (45, 46, 89, 114, 128,140–142, 186). BothD. melanogasterand S. cerevisiaepossess a small num-ber of homogeneous elements, with less than 5% divergence within the families(87, 127). These results indicated that plant Ty1-copiagroup retrotransposons aremore heterogeneous than those in many animals and lower eukaryotes.

There could be several reasons for differences in the degree of heterogeneityof Ty1-copiagroup retrotransposons between plants and other eukaryotes. Retro-transposons proliferate by a replicative mode of transposition, with each replicationcycle capable of generating defective elements. The mutation rate during a sin-gle transposition cycle of yeast Ty1 retrotransposons has been estimated to be2.5× 10−5 (49). Additionally, retrotransposons are capable of generating largepopulations in relatively short evolutionary times. In yeast, it has been found thathigh homogeneity in both sequence and size variation of Ty1 elements is due to ahigh level of genomic turnover of retrotransposons in the genome (80, 87). Thus,the majority of retrotransposons in the yeast genome are derived from transposedevents over a relatively recent evolutionary time. In contrast, a plant genome cantolerate accumulation of large amounts of DNA and does not appear to removethese sequences rapidly. It is possible, therefore, that early plants possessed onlya few active populations of retrotransposons that subsequently proliferated intolarge populations in the host genomes. The mixed populations of active and de-fective retrotransposons then accumulated mutations over time, and eventuallygave rise to highly heterogeneous populations. If, as appears likely (45, 46, 114,187, 191), the majority of plant retrotransposons have been transmitted vertically

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?506 KUMAR ■ BENNETZEN

into subsequent generations, with horizontal transfer between species being veryrare, then heterogeneous lineages should predominate. Another feature, whichmay contribute to the high level of heterogeneity in plant retrotransposons, is thedistinctive strategy for determination of germline in plants. Unlike animals, plantsdo not have a sequestered germline; they have meristematic cells that are capableof differentiating into somatic tissues or reproductive organs throughout the lifecycle. This means that somatic tissues, with mutations accumulated in them duringplant growth, can give rise to germ cells, as long as mutations are not lethal. In ani-mals, because the germ cells are selected early in embryogenesis, it may be that theanimal germ cells are better protected against mutation. All of these phenomenaare likely to be responsible for the current heterogeneity of plant retrotransposons.

Heterogeneity generated in the 5′ LTR sequences and adjacent leader sequencescan have effects on the evolution of retrotransposons. Indeed, extensive sequencevariation has been found among the LTRs and adjacent leader sequences ofcopiaretrotransposons inD. melanogasterpopulations (28). However, the functional sig-nificance of the sequence variation has not yet been demonstrated. In the humanretrovirus-like element, RTVL-H, it has been shown that the LTRs are functionallydiverse and can promote the expression of a linked gene (42). In plants, the tobaccoTnt1 retrotransposons members have been extensively studied for sequence het-erogeneity of both coding domains and U3 regulatory sequences from a number ofNicotianaspecies (23, 190). This study has revealed that there are three differentsubfamilies ofTnt1 elements,Tnt1A, Tnt1B, andTnt1C, that differ completelyin their U3 regions but share conserved flanking regions (190). U3 divergencebetween the three subfamilies is found in the region that contains the regulatorysequences of an activeTnt1-94retrotransposon. On the basis of these data, the au-thors have proposed that this high sequence variability could allow these elementsto evolve regulatory mechanisms in order to optimize their coexistence with theirhost genome. LINEs, SINEs, and Ty3-gypsyelements are also found as heteroge-neous populations in plant genomes, but detailed investigations of their possibleorigins and evolution have not been performed yet.

Internal Mutation

In most plants, the majority of the retrotransposons in the genome have structuressuggesting that they are defective. Many members of a retrotransposon familydiffer by large internal insertions or deletions (72, 111). In some cases, as with the5′ ends of LINE elements, truncated insertions suggest that the internal variationwas generated during transposition. The relatively low replication accuracy ofreverse transcriptase will also increase the rate of internal mutation.

In many cases, defective retrotransposons will continue to persist and evenamplify within a genome. By sequence criteria, many LTR retrotransposons appearto be lacking in some essential component for transpositon, such as the absenceof apol coding potential inBs1(76). The absence of complete coding capacity forone of these elements does not necessarily mean that it will not transpose.Bs1itselfwas first detected as a de novo insertion mutagen, indicating that it had utilized

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 507

POL and any other possible missing functions from some other retrotransposon(76, 78). Similarly, SINEs do not appear to encode any proteins, therefore requiringtrans-encoded factors for every enzymatic or packaging step in their transpositionprocess (133). It is not clear how broad the specificity is for an enzyme encodedby one retrotransposon to act on another retrotransposon of a different family.The similar structure of the target repeats flanking LINEs, SINEs, and intronlesspseudogenes suggests that they use the same mechanism for integration into thegenome, presumably the endonuclease encoded by LINEs (39), although they mayalso be inserted as repair templates at sites of double-stranded DNA breaks (123).The observation that LINEs, SINEs, and intronless pseudogenes all seem to becommon in some species (e.g. humans), while all are relatively rare in other species(e.g. maize) concurs with a LINE-endonuclease model. LTR retrotransposons mayshow some preference for use of their own integrase, but it is unlikely that thisbias is absolute. However, the knowledge that retrotransposons do not need to befully intact in order to transpose does not indicate how or why they are so variablein sequence and so often defective.

A second class of mutation seen for several LTR retrotransposons has beenan apparent deletion of some standard retrotransposon genes, often accompaniedby the presence of additional internal sequences (18, 72, 76, 77, 111, 182, 191).In many cases, these defective retrotransposons have been identified in multiplecopies or as de novo insertions, indicating that they can retrotranspose by the useof trans-acting factors. One class of internal mutation is observed in the case ofsolo LTRs, where an apparent unequal exchange has led to the loss of one LTRand all internal sequences (25, 158, 165, 191). It is not known whether other lossesof internal sequences might be due to ectopic recombination or through a standardDNA breakage/repair mechanism. Sequence acquisitions may occur by ectopicgene conversions like those proposed for theMutator elements of maize (10), orby recombination between an mRNA-derived template and the retrotransposonwithin the replication/integration complex (16).

Regardless of their origin(s), internal variation within retrotransposons canprovide the raw material for either new retrotransposon properties or new capabil-ities acquired by the host genome. Such changes could explain sudden bursts inretrotransposon activity, perhaps by random mutation, to create a retrotransposonpromoter with a new tissue specificity (e.g. in “germline” cells) and/or a higherexpression level. The relative inaccuracies of reverse transcriptases provides anindication that internal variation is programmed into the system, and many host“defense” activities (e.g. cytosine methylation) also will increase variability. Thesealtered elements should be selected for their ability to amplify and persist withina genome, while natural selection acting on the host will utilize this variabilitywhenever possible.

Retrotransposons and Retroviruses

Although LTR retrotransposons are strikingly similar to retroviruses in both ge-nomic organization and their coding sequences, they lack sequences encoding the

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?508 KUMAR ■ BENNETZEN

envelope (ENV) protein (Figure 1). Thus, the retrotransposon replication cycle isrestricted to the cell of the host genome. Theenvgene is required by retrovirusesfor cell-to-cell transfer, thereby allowing infectivity. Until recently, retroviruseswere thought to be restricted to vertebrates. The first conclusive evidence for thepresence of a retrovirus in an invertebrate,Drosophila melanogaster, was reportedin 1994. A LTR retrotransposon of the Ty3-gypsygroup (mdg-4) was shown topossess a functionalenvgene, and has now been reclassified as a retrovirus orendogenous retrovirus (170). As pointed out earlier, it is believed that retroviruseshave evolved from the Ty3-gypsygroup retrotransposons by acquiring theenvgene or vice versa (94, 177, 204). In plants, Ty3-gypsyretrotransposons have beenfound with additional sequences in the region usually occupied byenv, but it is notknown whether they have anyenv-like gene function (24, 72, 76, 206). Recently,a Ty1-copiagroup retrotransposon has been found in soybean that has novelenv-like sequences in the usualenvregion of the retrotransposon (100). This suggeststhat the acquisition ofenv-like and/orenv-localized sequences in Ty3-gypsyandTy1-copiagroups has occurred independently or occurred once in one element andthen was transferred to the others by recombination (94). Recombination betweenretrotransposons appears to be a more common occurrence than was consideredpreviously (79).

The presence of putativeenv-like sequences in plant LTR retrotransposonsis intriguing. In animals, theenvgene is involved in transferring virus particlesfrom one cell to another through the plasma membrane by a receptor-mediatedendocytosis. In plants, a scenario similar to that seen in animals is unlikely to occurbecause of the plant cell wall (94). Enveloped viruses might move from cell to cell inplants via the intercellular channels called plasmodesmata. Virus-encoded proteinsthat modify the molecular size exclusion limit of plasmodesmata (21) usually assistthe movement of plant viruses through plasmodesmata. Such transfer proteins arenot known to be encoded by retrotransposons. Perhaps some of theenv-localizedfunctions that appear to be encoded by some LTR retrotransposons in plants couldperform such a transfer function.

It is possible that the putativeenv-like genes in plants have evolved along dif-ferent lines, leading to ENV-like proteins with either a different function or nocurrent specific role in plants. Future work on this subject should reveal the pre-cise function, if any, of the putativeenv-like gene in plants. In animals, retrovirusesare responsible for some of most deadly diseases known, but such pathogenicityof retroviruses is not known in plants. It may be important to determine what it isabout the biology of plants that has restricted (or even eliminated) the pathogenic-ity of retroviruses in the absence of an identified humoral or cellular immunesystem.

Horizontal Transmission?

Phylogenetic analyses based on sequences of the reverse transcriptase and inte-grase genes from both Ty1-copiaand Ty3-gypsyretrotransposons in plants have

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 509

provided strong evidence, with some exceptions, for the vertical transmission ofretrotransposons (45, 46, 114, 187, 190). Simple inheritance thus appears to be themost frequent mode of transmission. In contrast, the horizontal transmission ofplant retrotransposons has not been demonstrated. This is not surprising, becausemost or all plant retrotransposons lack the envelope (env) gene that is required byretroviruses for cell-to-cell transmission.

Nonetheless,env-like gene sequences have been found in some plant retrotrans-posons. One possible explanation for this result would be that these sequences arelegacies ofenv genes that functioned in another species, perhaps in an insect(170). If an insect retrovirus, for instance, were transferred to a plant cell duringthe feeding process, it is possible that it would prove to be a competent intracel-lular retrotransposon. The success of some human-assisted horizontal transfers ofretrotransposons from one plant species to another indicates that the host factorsneeded for retrotransposition are often conserved in higher plants (69, 108, 139),so natural horizontal transfers should have an equal chance of success. A possibleorigin from insect retroviruses is suggested for many plant retrotransposons be-cause of their transcriptional activation by stress (55). This activation makes sensefor a virus, because damage-associated stress (e.g. insect feeding) suggests that avector is available for transfer to the next possible host.

Similarly, it is possible that retrotransposons could be passively hijacked, in rareevents, by true retroviruses or by some other invading plant pathogen, pest, or en-doparasite (7, 94). Given that eukaryotic cells, including those of plant protoplasts,have the capacity to take up foreign nucleic acids into their genomes, perhaps thechief limitation to horizontal transmissions will be the small number of cells thatcontribute to the next generation (i.e. the germline). However, horizontal transfersof retrotransposons that occur as rarely as once every few million years can betremendously important, particularly if the acquired element has the potential toamplify up to thousands of copies per nucleus (141, 156). Conclusive evidencefor a recent horizontal transfer ofcopia retrotransposon betweenDrosophilamelanogasterandD. willistoni has been demonstrated (80a). Although, the modeof this horizontal transfer is unknown since thecopiaelement lacks the envelopegene and therefore thought to be not infectious. The circumstantial evidence sug-gests that parasitic mites may be involved in vectoring DNA betweenDrosophilaspecies (80a).

One likely mode for the horizontal transfer of retrotransposons from one plantspecies into another would be by wide crosses. Such crosses can lead to at leasttransient loss of epigenetic control (i.e. suppression) of transposable elements(86, 132). Because many plant species use relatively nonspecific gametic inter-change processes (e.g. wind-assisted or insect-assisted pollination), usually unpro-ductive crosses between different species may be particularly common in plants.Rarely, such a wide cross may give rise to an allopolyploid (102), thereby bringingthe retrotransposon populations from two species into the same resultant species.More commonly in chance matings between two different species, the fertiliza-tion is not productive and/or embryogenesis aborts. In some exceptional cases,

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?510 KUMAR ■ BENNETZEN

however, the two genomes of very different species (e.g. maize and oats) may co-exist in the same nucleus for a few plant generations (2). Eventually, one parentalset of chromosomes may be eliminated, but the transfer of a retrotransposon fromone genome to another may occur in the interim.

CONTRIBUTIONS OF RETROTRANSPOSONS TO THEEVOLUTION OF PLANT GENES AND GENOMES

Gene Mutation

Retrotransposons are involved in generating mutations through insertions near orwithin genes. Both LTR and non-LTR retrotransposons have generated mutationsin plants: some examples for the Ty1-copia, Ty3-gypsy, LINE, and SINE retro-transposons are listed in Table 2. Many retrotransposon insertions within and inclose proximity to genes affect their expression in a negative fashion by decreasingor abolishing transcription of the gene or by detrimental alterations in transcriptprocessing and/or stability. In other cases, however, insertion of retrotransposonsequences within or near a gene has more complex effects on gene expression, in-cluding alterations of temporal and spatial patterns of transcription or the structureof the resultant protein (48, 97, 202).

The splicing of retrotransposon sequences can sometimes restore proper synthe-sis of a protein (187) or a truncated protein (60, 110). Insertion of a retrotransposoncan also result in gross changes in the prematuration of a transcript. Intron removalby splicing, using a cryptic splice site or sites within the element, will sometimesyield a protein product that differs from the wild type, usually having reducedfunctionality. For example, one of three retrotransposon-mutated maizewaxyalle-les,wxG, has an altered pattern of tissue-specific expression, with thirtyfold moreenzymatic activity in pollen than in the endosperm (110). It was demonstrated thatthe increase in Waxy enzymatic activity in pollen as compared to the endospermwas due to tissue-specific differences in RNA processing (i.e. there is approx-imately 30-fold more correctly spliced RNA in pollen than in the endosperm).Therefore, thewxGallele provides a good example of a retrotransposon insertionthat has resulted in tissue-specific alternative splicing.

Many plant genes currently considered wild type contain legacies of previ-ous retrotransposon insertion (Table 2) (202, 203). For example, over 30 genesfrom both dicotyledenous and monocotyledenous plants were found to have an-cient, degenerate retrotransposon insertions in 5′ or 3′ flanking regions, basedon computer-assisted database searches using Ty1-copia group retrotransposonsas query sequences (202, 203). This is probably a major underestimate, partlybecause elements that have greatly diverged will not be recognized. As databasesgrow and search programs become more sophisticated, most (perhaps all) plantgenes will likely be found to contain some transposable element legacy withintheir regulatory and/or structural components.

Three of four genes in the pearbcSfamily have the same retrotransposon se-quence in the promoter region, suggesting that the insertion predated the gene

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 511

duplication events. The combined expression level of three element-associatedrbcSgenes is 30–50% lower than that of the gene that does not contain the retro-transposon insertion, strongly suggesting that the 829-bp insertion sequence har-bors a negative regulator of transcription (203). Thus, it appears that retrotrans-poson insertions have been important contributors to the establishment of novelpatterns of transcription for a wide range of plant genes, over 100 reported sofar. The presence of remnants of retrotransposon sequences within and in closeproximity to genes suggests that these sequences have evolved and have been fixedunder selective pressures for regulatory roles (117).

Compared to the relatively intact retrotransposons found in recent insertion mu-tations or in the intergenic regions between maize genes (56, 65, 76, 158), manyof the retrotransposon sequences found within or close to wild-type genes appearto be truncated or rearranged (202, 203). Older retrotransposon-induced mutationsin maize often contain defective elements (151, 187, 193, 203). The degeneratestructure of these elements is probably due to a combination of their age and theprocess of natural selection. Once inserted near or within a gene, the retrotrans-poson sequences would be selected for a possible positive effect on that gene’sfunction.

Interestingly, the retrotransposons that are found to cause insertional mutationsin plants are usually present in the host genome at relatively low copy numbers.In maize, for example, theBs1retrotransposon is found in one to five copies (78),Hopscotchin two to six copies (203), andMagellanin four to eight copies (151).Similarly, the mutationalTnt1andTto1 families in tobacco andTos17family inrice are also present in relatively low copy numbers [about 1–100 (56, 64, 65, 70)].On the other hand, the abundant maize retrotransposons such asOpie, Grande,Ji, andHuck (each with copy numbers of 10,000 or more) have not been asso-ciated with any characterized maize mutations. Thus, it has been postulated thatthere might be a cause-and-effect relationship between retrotransposon familycopy number and the propensity to insert into genes (158). Very abundant fam-ilies of retrotransposons in maize andBARE-1(about 50,000 copies) in barleyappear to display a target site preference for intergenic regions and for previouslytransposed retrotransposons (158, 174). Alternatively, less fitness selection actingagainst retrotransposons in these regions may be responsible for such distributionpatterns of these high copy retrotransposons. The target site preference and activeselection hypotheses are not mutually exclusive.

Genome Size

Retrotransposons play a major role in determining the size of plant genomes (93).For example, small genomes in plants likeArabidopsis(1C equal to about 130Mbp) might be the consequence of a lack of retrotransposon proliferation (205).On the other hand, large genomes in plants like faba bean and maize (1C equal toabout 128,000 Mbp and 3200 Mbp, respectively) might be the result of success-ful colonization and amplification of retrotransposons (141, 158). Indeed, severaldifferent families of retrotransposons have amplified and attained copy numbers

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?512 KUMAR ■ BENNETZEN

TA

BL

E2

Exa

mpl

esof

the

cons

eque

nces

ofre

trot

rans

poso

nin

sert

ion

inor

near

plan

tgen

es

Gen

eR

etro

tran

spos

onP

lant

Com

men

tsR

efer

ence

Age

neex

pres

sed

Tna

1N

icot

iana

Inse

rtio

nw

ithin

age

neth

atis

155

inre

spon

seto

polli

natio

n(G

ypsy

)al

ata

expr

esse

din

the

styl

ein

resp

onse

topo

llina

tion

orto

uch;

chim

eric

tran

scri

pt;t

his

gene

isno

tess

entia

lfo

rfe

rtili

tyan

dis

loca

ted

abou

t10

kbup

stre

amof

the

S 6-R

Nas

ege

ne

Alc

ohol

dehy

drog

enas

eB

s1M

aize

Inse

rtio

nin

toex

on9

ofth

eA

dh1

gene

;76

,78

(Adh

1-S5

446)

(Cop

ia)

inac

tivat

ion

ofth

ege

ne;m

utan

tph

enot

ype:

ally

lalc

ohol

-res

ista

ntpo

llen

Ant

hocy

anin

Cin

4(L

INE

)M

aize

Inse

rtio

nw

ithin

codi

ngre

gion

;alte

red

162

pigm

entg

ene

stru

ctur

eof

the

A1

gene

;no

mut

ant

(A1)

phen

otyp

e

Caf

feic

acid

B5

Mai

zeIn

sert

ion

near

the

junc

tion

ofth

e3′

192

O-m

ethy

ltran

sfer

ase

(Cop

ia?)

codi

ngre

gion

ofth

eC

OM

Tge

ne(C

OM

T)

intr

on;a

ltere

dtr

ansc

ript

sbo

thin

size

and

amou

nt;e

xhib

itsre

ddis

hbr

own

mut

antp

igm

enta

tion

ofth

ele

afm

idri

b

Cat

alas

ep-

SIN

E-1

Ric

eIn

sert

ion

betw

een

two

exon

s,ex

on-2

and

74ex

on-3

,of

Cat

Age

ne;p

SIN

E-1

elem

ent

beha

ves

asin

tron

inC

atA

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 513

Dm

3C

opia

-lik

eL

ettu

ceIn

sert

ion

ofin

ters

pers

edco

pia-

like

118

Gen

ere

gion

retr

otra

nspo

son

sequ

ence

sin

the

Dm

3ge

nere

gion

that

issu

rrou

nded

bya

fam

ilyof

resi

stan

cege

neca

ndid

ates

(RG

C2)

;ret

rotr

ansp

oson

-med

iate

dre

arra

ngem

ents

Glu

telin

Wis

2-1A

Whe

atIn

sert

ion

with

inco

ding

regi

onof

the

124

(Glu

-1)

(Cop

ia)

Glu

1ge

ne;n

otr

ansc

ript

Her

oge

ne(G

ypsy

)To

mat

oIn

sert

ion

ofin

ters

pers

edgy

psy-

like

—re

gion

retr

otra

nspo

son

sequ

ence

sin

the

regi

onof

Her

oge

nefa

mily

Nitr

ate

redu

ctas

eT

nt1

Toba

cco

Inse

rtio

nsin

toex

on2

&3;

alte

red

56(N

R)

(Cop

ia)

tran

scri

pts

ofth

eN

Rge

ne;N

Rde

ficie

ntm

utan

ts

Nitr

ate

redu

ctas

eT

np2

Nic

otia

naIn

sert

ion

into

exon

;alte

red

tran

scri

pts

55(N

R)

(Cop

ia)

plum

bagi

nifo

lia

ofth

eN

Rge

ne;N

Rde

ficie

ntm

utan

ts

Nod

ule-

spec

ific

gene

Tm

s1M

edic

ago

Inse

rtio

nin

toin

tron

9w

hich

may

188

(Nm

s-25

)(C

opia

)sa

tiva

bere

spon

sibl

efo

rdo

wns

trea

mde

letio

ns;

retr

otra

nspo

son-

med

iate

dre

arra

ngem

ents

Ory

zasa

tiva

Tos1

7R

ice

Inse

rtio

nin

toex

on4;

notr

ansc

ript

;15

9ho

meo

box

(OSH

15)

(Cop

ia)

defe

cts

inin

tern

ode

elon

gatio

n

(con

tinu

ed )

PAP

2T

SB

ellp

eppe

rIn

sert

ion

into

the

first

intr

onof

the

PAP

214

9(S

INE

)ge

ne;s

plic

ing

defe

ctre

sulti

ngin

inco

mpl

etel

ysp

liced

tran

scri

pts;

nom

utan

tphe

noty

pe

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?514 KUMAR ■ BENNETZEN

TA

BL

E2

Gen

eR

etro

tran

spos

onP

lant

Com

men

tsR

efer

ence

Poly

gala

ctur

anas

eP

RE

M-2

Mai

zeIn

sert

ion

atth

eid

entic

al5′

flank

ing

184

(Pol

gal)

(Cop

ia)

posi

tion

ofth

ree

Polg

alge

nes

ofa

mul

tigen

efa

mily

mem

bers

;no

know

nef

fect

son

gene

expr

essi

on

Star

chph

osph

oryl

ase

Tst

1Po

tato

Inse

rtio

nof

the

elem

enti

nth

e20

(Cop

ia)

oppo

site

orie

ntat

ion

toth

atof

the

gene

with

inin

tron

5;un

impa

ired

tran

scri

pt;n

om

utan

tphe

noty

pe

Self

-inc

ompa

tibili

tyM

elm

oth

B.o

lera

cea

Inse

rtio

nw

ithin

tran

scri

bed

regi

ons;

138

(Slo

cus)

(Cop

ia)

notr

ansc

ript

;thi

sin

sert

ion

mut

atio

nan

alys

isre

veal

edth

atfu

nctio

nalS

LA

gene

isno

treq

uire

dfo

rth

eS1

resp

onse

inB

rass

ica

Soyb

ean

fung

alT

gmr

Soyb

ean

Inse

rtio

nin

the

flank

ing

regi

onof

the

12re

sist

ance

(Cop

ia)

Rps

1-k

gene

;no

know

nm

utan

t(R

ps1-

K)

phen

otyp

es

(con

tinu

ed )

Wax

y-M

(wx-

M)

Mag

ella

nM

aize

Inse

rtio

nw

ithin

wx

exon

3ju

st2

bp15

1lo

cus

(Gyp

sy)

dow

nstr

eam

ofth

ein

tron

2/ex

on3

splic

eju

nctio

n;w

axy

mut

ant

phen

otyp

e.

a

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 515

Wx

Ston

orSt

onor

Mai

zeIn

sert

ion

with

inw

xex

on11

0lo

cus

(Cop

ia)

ofth

ein

tron

5/ex

on6

splic

eju

nctio

n;nu

llfo

xw

xex

pres

sion

inpo

llen;

reta

ins

norm

alex

pres

sion

inth

een

dosp

erm

Xa2

1R

etro

fitR

ice

Inse

rtio

nco

ding

regi

onof

171

(are

cept

or-l

ike

(Cop

ia)

Xa2

1fa

mily

mem

ber

Dle

ads

toki

nase

)a

trun

cate

dpr

otei

n w

ith

Xa2

1ETr

unca

tor

Ric

eIn

sert

ion

with

inco

ding

regi

onof

171,

197

(Cop

ia)

Xa2

1fa

mily

mem

ber

Ele

ads

toa

trun

cate

dpr

otei

n w

ith n

o ac

tivity

;ge

nera

tion

ofa

rece

ptor

-lik

eO

RF

stru

ctur

ally

sim

ilar

toth

eto

mat

ofu

ngal

gene

Cf-

9

27kD

aze

inZ

eon-

1M

aize

Inse

rtio

nin

tosi

te1.

1kb

upst

ream

of72

(mai

zest

orag

e(G

ypsy

)th

eA

TG

stan

dard

27kD

aze

inge

ne;

prot

ein)

mito

ticre

arra

ngem

ent;

redu

ced

leve

lsof

stor

age

prot

ein

inm

atur

ese

ed

dow

nstr

eam

Din

to th

e

res

ista

nce

activ

ity; t

hepr

edic

ted

enc

odes

a n

ew c

lass

of

LR

R r

esis

tanc

e ge

ne p

rodu

cts,

lack

tran

smem

bran

e an

d a

kina

sedo

mai

n.

pred

icte

d

and

a ki

nase

sim

ilar

to th

e to

mat

o ge

ne P

TO

.

prot

ein

dise

ase

ing

a

part

ial

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?516 KUMAR ■ BENNETZEN

greater than 10,000 in the maize genome. The total copy number of all types ofmaize retrotransposons is estimated to be approximately 300,000, thereby compris-ing about 50–80% of the maize nuclear genome (156). An estimated divergencetime between maize and sorghum is about 15–20 million years ago, and bothhave 10 chromosome pairs, but the maize nuclear genome is 3–4 times longer thanthe sorghum genome (3). Molecular hybridization and sequence analysis data haverevealed that these two grass genomes are very similar in the content and order oftheir genes (6, 73, 122). However, the nearly two dozen retrotransposons that makeup over 70% of the region surrounding the maizeadh1gene are all absent fromthe orthologousadh region in sorghum (4, 156) and most are not detected in thesorghum genome by gel blot hybridization (9). Hence, these results indicate thatthe increase in maize genome size was largely due to an extensive proliferationof retrotransposons. This result also suggests that these elements inserted afterthe divergence of maize and sorghum, a suggestion that was confirmed by datingexperiments that indicated these retrotransposons had all arrived and/or amplifiedin the maize genome within the last 2 to 6 million years (157).

It is possible that analogous patterns of retrotransposon distribution and orga-nization will be found in the genomes of other plant and animal species that havehigh retrotransposon copy numbers. Genomic sequencing in large genome eukary-otes continues to be limited primarily to gene-rich regions, so we do not know thearrangement of DNAs in the largest genomic component of any large genome.However, preliminary sequence analysis of the DNA adjacent toArabidopsiscen-tromeres has uncovered regions rich in LTR retrotransposons, arranged in a nestedorganization like that observed in maize (R Martienssen, R McCombie & R Wilson,personal communication, http://www. cshl.org/protarab).

Although retrotransposons have played a major role in the expansion of plantgenomes (8, 93), it is also likely that many retrotransposon sequences have beeneliminated by various mechanisms, such as unequal recombination between theLTRs of an LTR retrotransposon (to generate solo LTRs) (25, 158, 165, 191) orother types of deletion (145). Nonetheless, the dispersed organization of retro-transposons makes it unlikely that known genomic elimination processes suchas large deletions can effectively remove a significant amount of the genomicsequences within a relatively short evolutionary time period without having dele-terious affects on the survival of the host by concurrent elimination of the genesthat are interspersed with retrotransposons (8, 95).

Genome Rearrangement

Retrotransposons can rearrange genomes with or without transpositional compe-tence. At least one retrotransposon,Bs1, has acquired a portion of another gene, andthen amplified it by transposition throughout the maize genome (18, 77, 78). Thisis highly significant, as it shows how genes or parts of genes might be amplifiedand dispersed in genomes. Similarly, the rare action of retrotransposon reversetranscription and integration functions on normal cellular mRNAs has created

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 517

intronless pseudogenes in plants (37, 106). These amplified gene sequences canserve as the raw material for the evolution of new genes.

Because of their high copy numbers and dispersal throughout plant genomes,retrotransposons could serve as sites of unequal or ectopic recombination. Thelegacies of such unequal recombinations between the LTRs of a single retro-transposon have been noted many times in plants by the existence of solo LTRs(25, 158, 165, 191). Even in methylated, presumably heterochromatic regionswhere these elements often accumulate, rare solo LTR are observed (158). Ectopicrecombination between retrotransposons of the same family that are in direct ori-entation on the same chromosome can cause reciprocal duplications and deletions,whereas unequal recombinations between elements on the same chromosome thatare in opposite orientation will cause a chromosomal inversion of the sequencesbetween the two retrotransposons. Ectopic recombination between two elementson different chromosomes can cause reciprocal translocations. These types ofretrotransposon-mediated genome rearrangement have been observed in yeast andDrosophilaas products of unequal recombination (104, 204). In plants, however,the structural and mechanistic foundations of such chromosomal rearrangementshave not yet been studied in great numbers, so we do not yet know how frequentlythey can be attributed to retrotransposon involvement.

Roles for Retrotransposons?

In most eukaryotic organisms, centromeres, telomeres, and other heterochromaticregions of chromosomes contain large numbers of retrotransposons, making itpossible that retrotransposons have acquired some structural and/or functionalroles in these heterochromatic regions in plants. Indeed, a role ofHeT-A andTART retrotransposons in buffering the shortening of chromosome ends has beenclearly demonstrated inDrosophila(137).

The exceptional conservation of a particular family of retrotransposons in allgrass centromeric regions suggests that these elements are involved in centromerefunction (121, 150). Similarly, the presence of retrotransposon fragments withinthe regulatory regions of genes indicates that they have evolved to a form that isinvolved in specific gene regulation. Because LTR retrotransposons, for instance,carry a promoter within each LTR, it is not surprising that their insertions shouldsometimes cause new patterns of gene expression or at least have the potential toevolve into new patterns of gene expression (48, 202). It was partly this capacity ofmobile DNAs to serve as controlling elements that caused McClintock to believethat their major role in evolution was to rearrange genomes under stresses so severethat only a rare individual with a totally changed genome might survive (116).Although this model does not appear to make evolutionary sense, it is certain thatthe presence of transposable elements within a genome will allow them to be usedby their hosts in whatever way might be beneficial, even if the elements themselvesare present primarily for selfish or parasitic reasons (34, 134). Future studies willindicate how plant genomes have evolved to utilize retrotransposon sequences.

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?518 KUMAR ■ BENNETZEN

RETROTRANSPOSONS AS GENETIC TOOLS

Molecular Markers for Phylogenetic, Biodiversity,and Genetic Linkage Analyses

There are several advantages in using retrotransposon sequences as molecularmarkers. They are ubiquitous, present in high copy numbers as highly heteroge-neous populations, are widely dispersed on chromosomes, and show insertionalpolymorphism both within and between species in plants (95). Furthermore, activeretrotransposons will produce new insertions in the genome, leading to polymor-phism. The new insertions may then be detected and used to temporally orderinsertion events in a lineage, helping to establish phylogenies. These genetic prop-erties have recently allowed retrotransposons to be exploited as DNA markersto study biodiversity in maize, pea, and barley (41, 81, 152) and to generate ge-netic linkage maps in barley and pea (41, 95, 198, 199, 202). Furthermore, becausemany retrotransposons are widely distributed within the euchromatin domains ofchromosomes, it should be possible to generate markers linked to agronomicallyimportant genes. Indeed, retrotransposonTnd-1 has provided a marker that islinked to black root resistance in tobacco (85).

One of the most useful retrotransposon-based marker systems in revealing largenumbers of highly polymorphic markers is sequence-specific amplification poly-morphism (SSAP). SSAP is a multiplex amplified fragment length polymorphisms(AFLP)-like technique that displays individual retrotransposon insertion as bandson a sequencing gel. Moreover, SSAP-based markers appear to be better for esti-mating phylogenetic relationship in plants compared with the conventional AFLP-based makers, because one of the primers is based on specific retrotransposonsequences (41). Furthermore, a multi-retrotransposon approach has been recentlyused to estimate phylogenetic relationships within species and between speciesin legume (143) and cereal (57) plants. This approach is particularly informativebecause each element has a unique transpositional history. For example, retroele-ments that have transposed in the recent past should be extremely polymorphicwithin a species, and these can be used for genetic linkage and intraspecific geneticdiversity studies. Some that were active only a million years ago or more shouldbe useful for determining relationships between species or even between genera.

In the past, a major hurdle in using the retrotransposon-based SSAP method wasthe limited numbers of LTR sequences available. The recent development of a newtechnique for rapid isolation of plant Ty1-copia retrotransposon LTR sequencesfor molecular studies (143) has made the application of a retrotransposon-basedSSAP method highly accessible to all plant species. The SSAP method should alsobe applicable for determining the integration patterns of specific elements in thehost genome by subcloning, sequencing, and analyzing SSAP bands.

A PCR-based approach to detect individual retrotransposon insertions has beendeveloped using primers derived from the element and its flanking DNA (47, 51).This method produces a dominant marker system, where the different allelic states

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 519

(i.e. presence and absence of the retrotransposon insertion) at a locus can be re-vealed. A major advantage of this retrotransposon-based insertion polymorphism(RBIP) method is that it could be fully automated for high throughput markeranalysis (47). Sequence analysis of clones from an enriched barley microsatellitelibrary has revealed that retrotransposons sequences are intimately associated withmicrosatellites (153). This property has been exploited to develop a molecularmarker method, REMAP (Retrotransposon-Microsatellite Amplified Polymorp-phism), that can be used to distinguish between barley varieties and to producefingerprint patterns for species across a genus (81).

Non-LTR retrotransposons also have been used as molecular markers to studygenetic diversity at the species and genus levels (63, 126). For example, ricep-SINE1elements have been used to demonstrate thatpSINE-1-r2transposed tothewx locus after the divergence ofOryza rufipogonandO. sativafrom theOryzaancestor that carried the AA genome. In contrast,pSINE-1-r1transposed at anearlier time to another position in thewx locus before the divergence of variousrice species (63). Also,BrassicaSINE elements have been used to study geneticdiversity inCruciferaespecies and to determine the extent to which introgressionhas occurred between cultivated and wild species inBrassica(103; JM Deragon,unpublished data).

Gene Tagging and Functional Analysis of Genes

Some retrotransposons have features that make them ideal genetic tools for genetagging in plants (65). These include (a) retrotransposon-mediated insertion muta-tions that are stable because they transpose by a replicative mode, (b) transpositiontarget sites that are unlinked to the site of the original copy, making it relativelyeasy to generate a large collection of random insertions for saturation mutagenesis,(c) transposition that can be regulated by abiotic and biotic stress conditions,(d) high mutagenicity due to preferential transposition into genes, (e) low copynumbers, thereby facilitating the identification of the retrotransposon insertion re-sponsible for a specific mutation, and (f ) endogenous retrotransposons that areubiquitous in plants and may be active in many species. In rice, the endogenousTos17retrotransposon meets all the conditions mentioned above for its use in tag-ging genes.Tos17is highly active under tissue culture conditions and preferentiallyinserts into or near genes (65). It has been used to generate a large collection ofrandom insertions for saturation mutagenesis and has been used to clone a numberof genes in rice (65; H Hirochika, unpublished data). Additionally,Tos17has beenused in a reverse genetic approach to identify null mutant lines for theOSH15gene(Oryza sativahomeobox; aknotted-type homeobox gene), thereby demonstratingthatOSH15has a role in rice internode development (159).

Evaluation of Somaclonal Variations

Several techniques for genetic manipulation and vegetative propagation of plantsare dependent upon in vitro plant regeneration systems. However, in vitro

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?520 KUMAR ■ BENNETZEN

manipulation processes are prone to somaclonal variations, which may be highlyundesirable (70). Somaclonal variation is the collective term given to the highfrequency of mutations encountered among plants regenerated from cell and tis-sue cultures under in vitro conditions. There is now good evidence to show thata significant percentage of mutations among in vitro manipulated plants are dueto the induced activation of retrotransposons during cell and tissue culture condi-tions. For example, over 10% of regenerated rice plants possessed retrotransposon-induced mutations (70; H Hirochika, unpublished data). The retrotransposon-based molecular markers described earlier provide an excellent tool for evaluatingretrotransposon-induced mutations among in vitro manipulated plants. Addition-ally, a better understanding of the regulation of expression and transposition ofretrotransposons should give some clues to avoiding somaclonal variation amongin vitro manipulated plants.

CONCLUSIONS AND FUTURE PROSPECTS

We now know that plant retrotransposons are responsible not only for generatingmutations in plant genomes but also for significantly increasing the size of manyplant genomes. Their replicative modes of transposition have equipped them withthe ability to rapidly proliferate within their hosts, allowing them to become amajor component of higher plant genomes. Retrotransposons are also one of themost fluid of genomic components, varying greatly in copy number, genomiclocalization, and sequence structure over relatively short evolutionary times. Theseproperties make retrotransposons one of the major forces influencing the structureof plant genomes. Although their initial and major continuing motivation appearsto be selfish and/or parasitic, retrotransposons will continue to be used wheneverpossible by their hosts.

Mutations caused by retrotransposon insertions within or in close proximity togenes can result in gene inactivation or alterations of the expression patterns ofthe genes, or the structure of the encoded proteins. Many gene promoters con-tain fragments of retrotransposons that now contribute to that gene’s regulation.Ectopic recombination between these mobile sites of homology could amplify ordecrease genome size and gene family number. However, it remains to be provenwhether any retrotransposon-induced changes have been positively selected innatural populations.

For the future, many questions remain to be answered. These can be conve-niently separated into four categories: origins, specificities, effects, and uses.There is no definitive information on the origin of any plant retrotransposon fam-ily. Are some LTR retrotransposons actually defective insect retroviruses that havearrived in plant genomes via horizontal transfer? Or, do horizontal transfers ofintracellular elements occur through wide crosses or by rare incorporation of se-quences from a pathogen or parasite? Perhaps horizontal transfers do not happen atall, or at least do not account for many of the elements. Their differential presenceand/or copy numbers may be determined by whether they have amplified or have

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 521

been lost in one species or another. At a more detailed level, how do retroelementswithin a family change; how fast and by what mechanisms? These will be diffi-cult questions to answer, partly because their great diversity, high copy numbers,unpredictable amplification potential, and rapid evolutionary change can easilyobscure simple patterns of element relatedness or origin. However, these prob-lems will be partly offset by the huge amounts of genome sequence data that willbecome available in upcoming years.

Numerous studies have documented specificities for retrotransposon expres-sion and insertion. Several groups have begun characterizing the regulation ofLTR promoters by transacting factors associated with host developmental andstress response pathways. Studies of epigenetic gene regulation, including thosepertinent to mobile DNAs and transgenes, also will expand our understanding ofretrotransposon biology. We have not yet identified any factors in plants that mightdetermine the different insertion specificities of different retrotransposon families.We also do not know the degree to which a retrotransposon can use transacting fac-tors from other retroelements for replication and insertion. What is the relationshipbetween an element’s transcription and its potential for transposition?

The effects of retrotransposons on gene and genome structure, including genomesize, have been well documented. However, how often do retrotransposons am-plify to high copy numbers? What are the processes that remove retrotransposonsfrom a genome, and how rapidly do they act? How often do retrotransposonsinsert into or near genes in standard plant populations, and how often/rapidly dothese sequences evolve into a form that is utilized by the affected gene? Howoften have retrotransposons been used for non-genic tasks, such as contributinga functional component to centromeres? If retrotransposons are used for variousroles in genome structure or function, how did the elements change to permit thisuse, and how does their use differ between species?

Finally, we are just beginning to see some of the ways that plant retrotransposonscan be used for the study and improvement of plants. Their outstanding potentialas genetic tools for plant genome analysis, linkage mapping, phylogeny and ge-netic biodiversity studies, gene tagging, functional analysis of genes, evaluationof somaclonal variants, and in gene transfer processes appear to be most promis-ing. Understanding the possible horizontal transfer of retrotransposons could alsobe very valuable, perhaps to set an upper limit for the rate at which transgenesmight escape from genetically engineered crops. Despite comprising the majorityof many plant genomes and their many roles in the mutation and evolution of genesand genomes, only a handful of laboratories investigate these elements. Much fu-ture work is needed, and the tools are all in hand, so we can expect continuing greatgains in our understanding of plant retrotransposons for the foreseeable future.

ACKNOWLEDGMENTS

We thank all of our colleagues for providing their published and unpublishedpapers, especially Michael Ashburner, Andreas Bachmir, Eugene Berezikov, BenBowen, Jean-Marc Deragon, Tom Eickbush, Noel Ellis, David Finnegan, Andy

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?522 KUMAR ■ BENNETZEN

Flavell, Marie-Angele Grandbastien, Pat Heslop-Harrison, Hirohiko Hirochika,Howard Laten, Rob Martienssen, John McDonald, Norihiro Okada, Steve Pearce,Ingo Schubert, Alan Schulman, Daniel Voytas, and Susan Wessler. We are gratefulto Sue and Hiro for their comments on the manuscript, and to Pat and Ingo forthe LINE, SINE, andgypsypictures, respectively. AK acknowledges the ScottishOffice Agricultural, Environmental and Food Department and EC BiotechnologyFramework IV for financial support. JLB acknowledges financial support from theUSDA CSREES program (97-35300-4594).

Visit the Annual Reviews home page at http://www.AnnualReviews.org

LITERATURE CITED

1. Ananiev EV, Phillips RL, Rines HW. 1998.Complex structure of knob DNA on maizechromosome 9. Retrotransposon invasioninto heterochromatin.Genetics149:2025–37

2. Ananiev EV, Riera-Lizarazu O, Rines HW,Phillips RL. 1997. Oat-maize chromosomeaddition lines: a new system for mapping themaize genome.Proc. Natl. Acad. Sci. USA94:3524–28

3. Arumuganathan K, Earle ED. 1991. Nu-clear DNA content of some importantplant species.Plant Mol. Biol. Rep.9:208–19

4. Avramova Z, Tikhonov A, SanMiguel P, JinYK, Lui C, et al. 1996. Gene identifica-tion in a complex chromosomal continuumby local genomic cross-referencing.Plant J.10:1163–68

5. Bennetzen JL. 1996. The contributions ofretroelements to plant genome organisation,function and evolution.Trends Microbiol.4:347–53

6. Bennetzen JL. 1998. The structure and evo-lution of angiosperm nuclear genomes.Curr.Opin. Plant Biol.1:103–8

7. Bennetzen JL. 1999. Transposable elementcontributions to plant gene and genome evo-lution. Plant Mol. Biol.In press

8. Bennetzen JL, Kellogg EA. 1997. Do plantshave a one-way ticket to genomic obesity?Plant Cell9:1509–14

9. Bennetzen JL, Schrick K, Springer PS,

Brown WE, SanMiguel P. 1994. Activemaize genes are unmodified and flanked bydiverse classes of modified, highly repetitiveDNA. Genome37:565–76

10. Bennetzen JL, Springer PS, Cresse AD,Hendrickx M. 1993. Specificity and regu-lation of theMutator transposable elementssystem in maize.Crit. Rev. Plant Sci.12:57–95

11. Bevan M, Bancroft I, Bent E, Love K, Good-man H, et al. 1998. Analysis of 1.9 Mb ofcontiguous sequence from chromosome 4of Arabidopsis thaliana. Nature 391:485–88

12. Bhattacharyya MK, Gonzales RA, Kraft M,Buzzell RI. 1997. Acopia-like retrotranspo-sonTgmrclosely linked to theRps1-k allelethat confers race-specific resistance of soy-bean toPhytophthora sojae.Plant Mol. Biol.34:255–64

13. Bingham PM, Zachar Z. 1989. Retrotrans-posons and FB elements fromDrosophilamelanogaster. In Mobile DNA, ed. by DEBerg, MM Howe, pp. 485–502. Washing-ton, DC: Am. Soc. Microbiol.

14. Blanchard JL, Schmidt GW. 1995. Pervasivemigration of organellar DNA to the nucleusin plants.J. Mol. Biol.41:397–406

15. Boeke JD. 1997. LINEs and Alus—thepolyA connection.Nat. Genet.16:6–7

16. Boeke JD, Corces VG. 1989. Transcrip-tion and reverse-transcription of retrotrans-posons.Annu. Rev. Microbiol.43:403–34

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 523

17. Brandes A, Heslop-Harrison JS, KammA, Kubis S, Doudrick RL, Schmidt T.1997. Comparative analysis of the chro-mosomal and genomic organization ofTy1-copia-like retrotransposons in pteri-dophytes, gymnosperms and angiosperms.Plant Mol. Biol.33:11–21

18. Bureau TE, White SE, Wessler SR. 1994.Transduction of a cellular genes by a plantretroelement.Cell 77:479–80

19. Bureau TE, Ronald PC, Wessler SR. 1996.A computer-based systematic survey re-veals the predominance of small inverted-repeat elements in wild-type rice genes.Proc. Natl. Acad. Sci. USA93:8524–29

20. Camirand A, St-Pierre B, Morineau C,Brisson N. 1990. Occurrence of acopia-like transposable element in one of the in-trons of the potato starch phosphorylasegene.Mol. Gen. Genet.224:33–39

21. Carrington JC, Kasschau K, Mahajan SK,Schaad MC. 1996. Cell-to-cell and long-distance transport of viruses in plants.Plant Cell8:1669–81

22. Casacuberta JM, Grandbastien MA. 1993.Characterisations of LTR sequences in-volved in the protoplast specific expressionof the tobaccoTnt1 retrotransposon.Nu-cleic Acid Res.21:2087–93

23. Casacuberta JM, Vernhettes S, Audeon C,Grandbastien MA. 1997. Quasispecies inretrotransporons: Sequence variability inTnt1evolution.Genetica100:109–17

24. Chavanne F, Zhang DX, Liaud MF, CerffR. 1998. Structure and evolution ofCy-clops: a novel giant retrotransposon of theTy3/Gypsyfamily highly amplified in peaand other legume species.Plant Mol. Biol.37:363–75

25. Chen M, SanMiguel P, Bennetzen JL.1998. Sequence organization and conser-vation in sh2/a1-homologous regions ofsorghum and rice.Genetics148:435–43

26. Cost GJ, Boeke JD. 1998. Targeting of hu-man retrotransposon integration is directedby the specificity of the L1 endonuclease

for regions of unusual DNA structure.Bio-chemistry37:18081–93

27. Craig NL. 1998. Target site selection intransposition.Annu. Rev. Biochem.66:437–74

28. Csink AK, McDonald JF. 1995. Analysis ofCopia sequences variation within and be-tweenDrosophilaspecies.Mol. Biol. Evol.12:83–93

29. Curcio MJ, Garfinkel DJ. 1999. New linesof host defense: inhibition of Ty1 retro-transposon by Fus3p and NER/TFIIH.Trend Genet.15:43–45

30. Danilevskaya ON, Arkhipova JR, TraverseKL, Pardue ML. 1997. Promoting in tan-dem: the promoter for telomere transposonHeT-Aand implications for the evolution ofretroviral LTRs.Cell 88:647–55

31. Deragon JM, Gilbert N, Rouquet L, LenoirA, Arnaud P, Picard G. 1996. A transcrip-tional analysis of the S1Bn (Brassica na-pus) family of SINE retroposons.PlantMol. Biol. 32:869–78

32. Deragon JM, Landry BS, Pelissier T, Tu-tois S, Tourmente S, Picard G. 1994. Ananalysis of retroposition in plants based ona family of SINEs fromBrassica napus. J.Mol. Evol.39:378–86

33. Ding D, Lipshitz H. 1994. Spatially regu-lated expression of retroviruses-like trans-posons duringDrosophila melanogasterembryogenesis.Genet. Res.64:167–81

34. Doolittle WF, Sapienza C. 1980. Self-ish genes, the phenotype paradigm andgenome evolution.Nature284:601–3

35. Dooner HK, Belachew A. 1989. Transpo-sition pattern of the maize elementAcfromthebz-m2(Ac) allele.Genetics122:447–57

36. Dorer D, Henikoff S. 1994. Expansionsof transgene repeats cause heterochro-matin formation and gene silencing inDrosophila. Cell 77:993–1002

37. Drouin G, Dover GA. 1987. A plant pro-cessed pseudogene.Nature328:557–58

38. Edwards KJ, Veuskens J, Rawles H, DalyA, Bennetzen JL. 1996. Characterization offour dispersed repetitive DNA sequences in

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?524 KUMAR ■ BENNETZEN

Zea maysand their use in constructing con-tiguous DNA fragments using YAC clones.Genome39:811–17

39. Eickbush TH. 1992. Transposing withoutend: the non-LTR retrotransposable ele-ments.New Biol.4:430–40

40. Eickbush TH. 1997. Telomerase and retro-transposons: Which came first?Science277:911–12

41. Ellis THN, Poyser SJ, Knox MR, VershininAV, Ambrose MJ. 1998. Polymorphism ofinsertion sites of Ty1-copia class retro-transposons and its use for linkage and di-versity analysis in pea.Mol. Gen. Genet.260:9–19

42. Feuchter A, Mager D. 1990. Functionalheterogeneity of a large family of humanLTR-like promoters and enhancers.Nu-cleic Acids Res.18:1261–70

43. Feuerbach F, Drouaud J, Lucas H. 1997.Retrovirus-like end processing of the to-baccoTnt1retrotransposon linear interme-diates of replication.J. Virol. 71:4005–15

44. Flavell RB, Bennet MD, Smith JB, SmithDB. 1974. Genome size and proportionof repeated nucleotide sequence DNA inplants.Biochim. Genet.12:257–69

45. Flavell AJ, Smith DB, Kumar A. 1992.Extreme heterogeneity of Ty1-copiagroupretrotransposons in plants.Mol. Gen.Genet.231:233–42

46. Flavell AJ, Dunbar E, Anderson R, PearceSR, Hartley R, Kumar A. 1992. Ty1-copiagroup retrotransposons are ubiquitous andheterogeneous in higher plants.NucleicAcids Res.20:3639–44

47. Flavell AJ, Knox MR, Pearce SR, El-lis THN. 1998. Retrotransposon-based in-sertion polymorphisms (RBIP) for highthroughput marker analysis.Plant J. 16:643–50

48. Flavell AJ, Pearce SR, Kumar A. 1994.Plant transposable elements and thegenome.Curr. Opin. Genet. Dev.4:838–44

49. Gabriel A, Willems M, Mules EH, BoekeJD. 1996. Replication infidelity duringa single cycle ofTy1 retrotransposition.

Proc. Natl. Acad. Sci. USA93:7767–7150. Gai X, Voytas DF. 1998. A single amino

acid change in the yeast retrotransposonTy5 abolishes targeting to silent chromatin.Mol. Cell 1:1051–55

51. Garber K, Bilic I, Pusch O, Tohme J,Bachmair A, et al. 1999. TheTpv2familyof retrotransposons ofPhaseolus vulgaris:structure, integration characteristics, anduse for genotype classification.Plant Mol.Biol. 39:797–807

52. Gesteland RF, Atkin JF. 1993.The RNAWorld. New York: Cold Spring HarborPress

53. Goubely C, Arnaud P, Tatout C, Heslop-Harrison JS, Deragon J-M. 1999. S1 SINEretroposons are methylated at symmetricaland non-symmetrical positions inBrassicanapus: identification of preferred targetsite for asymmetrical methylation.J. Mol.Biol. 39:243–55

54. Grandbastien M-A. 1992. Retroelementsin higher-plants.Trends Genet.8:103–8

55. Grandbastien M-A. 1998. Activation ofplant retrotransposons under stress condi-tions.Trends Plant Sci.3:181–87

56. Grandbastien M-A, Spielmann A, CabocheM. 1989. Tnt1, a mobile retroviral-liketransposable element of tobacco isolatedby plant cell genetics.Nature337:376–80

57. Gribbon BM, Pearce SR, Kalendar R,Schulman A, Paulin L, et al. 1999. Phy-logeny and transpositional activity of Ty1-copia group retrotransposons in cerealgenomes.Mol. Gen. Genet.261:883–91

58. Grunstein M. 1998. Yeast heterochro-matin: regulation of its assembly and in-heritance by histones.Cell 93:325–28

59. Hake S, Walbot V. 1980. The genome ofZea mays, its organization and homologyto related grasses.Chromosoma79:251–70

60. Henikoff S, Comai L. 1998. Trans-sensingeffects.Cell 93:329–32

61. Heslop-Harrison JS, Brandes A, Taketa S,Schmidt T, Vershinin AV, et al. 1997. TheChromosomaldistributions of Ty1-copiagroup retrotransposable elements in higher

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 525

plants and their implications for genomeevolution.Genetica100:197–204

62. Higashiyama T, Noutoshi Y, Fujie M, Ya-mada T. 1995. Zepp, a LINE-like retro-transposon accumulated in theChlorellatelomeric region.EMBO J.16:3715–23

63. Hirano HY, Mochizuki K, Umeda M, Oht-subo H, Ohtsubo E, Sano Y. 1994. Retro-transposition of a plant SINE into thewxlocus during evolution of rice.J. Mol. Evol.38:132–37

64. Hirochika H. 1993. Activation of to-bacco retrotransposons during tissue-cul-ture.EMBO J.12:2521–28

65. Hirochika H. 1997. Retrotransposons ofrice: their regulation and use for genomeanalysis.Plant Mol. Biol.35:231–40

66. Hirochika H, Fukuchi A, Kikuchi F. 1992.Retrotransposon families in rice.Mol. Gen.Genet.233:209–16

67. Hirochika H, Kakutani T, Okamoto H.1999. Silencing of the tobacco retrotrans-posonTto1in Arabidopsisand reactivationby DNA hypomethylation mutation.PlantCell. In press

68. Hirochika H, Otsuki H. 1995. Extra chro-mosomal circular forms of the tobaccoretrotransposonTto1. Gene165:229–32

69. Hirochika H, Otsuki H, Yoshikawa M,Otsuki Y, Sugimoto K, Takeda S. 1996.Autonomous transposition of the tobaccoretrotransposonTto1 in rice. Plant Cell8:725–34

70. Hirochika H, Sugimoto K, Otsuki Y, Tsug-awa H, Kanda M. 1996. Retrotransposonsof rice involved in mutations induced bytissue culture.Proc. Natl. Acad. Sci. USA93:7783–88

71. Hollick JB, Dorweiler JE, Chandler VL.1997. Paramutation and related allelic in-teractions.Trends Genet.13:302–8

72. Hu W, Das OP, Messing J. 1995.Zeon-1,a member of a new maize retrotransposonfamily. Mol. Gen. Genet.248:471–80

73. Hulbert SH, Richter TE, Axtell JD, Bennet-zen JL. 1990. Genetic mapping and charac-terization of sorghum and related crops by

means of maize DNA probes.Proc. Natl.Acad. Sci. USA87:4251–55

74. Iwamoto M, Nagashima H, Nagamine T,Higo H, Higo K. 1999.p-SINE1–like intronof theCatAcatalase homologs and phylo-genetic relationships among AA-genomeOryza and related species.Theor. Appl.Genet.98:853–61

75. Jensen S, Gassama MP, Heidmann T. 1999.Taming of retrotransposable elements byhomology-dependent gene silencing.Nat.Genet.21:209–12

76. Jin Y-K, Bennetzen JL. 1989. Struc-ture and coding properties ofBs1, amaize retrovirus-like transposon.Proc.Natl. Acad. Sci. USA86:6235–39

77. Jin Y-K, Bennetzen JL. 1994. Integrationand non-random mutation of a plasmamembrane proton ATPase gene fragmentwithin theBs1retroelement of maize.PlantCell 6:1177–86

78. Johns MA, Mottinger J, Freeling M. 1985.A low copy number,copia-like transposonin maize.EMBO J.4:1093–102

79. Jordan IK, McDonald JF. 1998. Evidencefor the role of recombination in regulatoryevolution ofSaccharomyces cerevisiaeTyelements.J. Mol. Evol.47:14–20

80. Jordan IK, McDonald JF. 1999. Tempo andmode of Ty element evolution inSaccha-romyces cerevisiae. Genetics151:1341–51

80a. Jordan IK, Matyunina LV, McDonald JF.1999. Evidence for the recent horizontaltransfer of an LTR retrotransposon.Proc.Natl. Acad. Sci. USA66: In press

81. Kalendar R, Grob T, Regina M, SuoniemiA, Schulman A. 1999. IRAP and REMAP:Two new retrotransposon-based DNA fin-gerprinting techniques.Theor. Appl. Genet.98:704–11

82. Kamm A, Doudrick RL, Heslop-HarrisonJS, Schmidt T. 1996. The genomic andphysical organisation of Ty1-copia-like se-quences as a component of large genomesin Pinus elliottii var. elliottii and othergymnosperms.Proc. Natl. Acad. Sci. USA93:2708–13

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?526 KUMAR ■ BENNETZEN

83. Katsiotis A, Schmidt T, Heslop-HarrisonJS. 1996. Chromosomal and genomic or-ganisation of Ty1-copia-like retrotranspo-son sequences in the genusAvena. Genome39:410–17

84. Kempken F, Kuck U. 1998. Transposons infilamentous fungi-facts and perspectives.BioEssays20:652–59

85. Kenward KD, Bai D, Ban MR, BrandleJE. 1999. Isolation and characterisation ofTnd-1, a retrotransposon marker linked toblack root resistance in tobacco.Theor.Appl. Genet.98:387–95

86. Kidwell MG, Lisch D. 1997. Transposableelements as sources of variation in ani-mals and plants.Proc. Natl. Acad. Sci. USA94:7704–11

87. Kim JM, Vanguri S, Boeke JD, Gabriel A,Voytas DF. 1998. Transposable elementsand genome organisation: A compara-tive survey of retrotransposons revealedby the completeSaccharomyces cerevisiaegenome sequence.Genome Res.8:464–78

88. Kirchner J, Connolly CM, Sandmeyer SB1995. Requirement of RNA-polymerase-III transcription factors forin vitro posi-tion-specific integration of a retrovirus-likeelement.Science267:1488–91

89. Konieczny A, Voytas DF, Cummings MP,Ausubel FM. 1991. A superfamily ofAra-bidopsis thalianaretrotransposons.Genet-ics127:801–9

90. Kossack DS, Kinlaw CS. 1999. IFG,a gypsy-like retrotransposon inPinus(Pinaceae), has an extensive history inpines.Plant Mol. Biol.39:417–26

91. Kubis SE, Heslop-Harrison JS, Desel C,Schmidt T. 1998. The genomic organisa-tion of non-LTR retrotransposons (LINEs)from threeBetaspecies and five other an-giosperms.Plant Mol. Biol.36:821–31

92. Kuipers AGJ, Heslop-Harrison JS, Jacob-sen E. 1998. Characterisation and phys-ical localisation of Ty1-copia-like retro-transposons in fourAlstroemeriaspecies.Genome41:357–67

93. Kumar A. 1996. The adventures of the Ty1-copiagroup of retrotransposons in plants.Trends Genet.12:41–43

94. Kumar A. 1998. The evolution of plantretroviruses: moving to green pasture.Trends Plant Sci.3:371–74

95. Kumar A, Pearce SR, McLean K, Harri-son G, Heslop-Harrison JS, et al. 1997.The Ty1-copia group of retrotransposonsin plants: genomic organisation, evolution,and use as molecular markers.Genetica100:205–17

96. Kumekawa N, Ohtsubo H, Horiuchi T, Oht-subo E. 1999. Identification and charac-terisation of novel retrotransposons of thegypsytype in rice.Mol. Gen. Genet.260:593–602

97. Kunze R, Saedler H, Lonnig WE. 1997.Plant transposable elements.Adv. Bot. Res.27:331–470

98. Kurose K, Hata K, Hattori M, Sakaki Y.1995. RNA polymerase III dependence ofthe human L1 promoter and possible par-ticipation of the RNA polymerase II factorYY1 in the RNA polymerase III transcrip-tion system.Nucleic Acids Res.23:3704–9

99. Labrador M, Corces VG. 1997. Transpos-able element-host interactions: regulationof insertion and excision.Annu. Rev. Genet.31:381–404

100. Laten HM, Majumdar A, Gaucher EA.1998. SIRE-1, a copia/Ty1-like retroele-ment from soybean, encodes a retroviralenvelope-like protein.Proc. Natl. Acad.Sci. USA95:6897–902

101. Leeton PR, Smyth DR. 1993. An abun-dant LINE-like element amplified in thegenome ofLilium speciosum. Mol. Gen.Genet.237:97–104

102. Leitch IJ, Bennett MD. 1997. Polyploidy inin angiosperms.Trends Plant Sci.2:470–75

103. Lenoir A, Cournoyer B, Warwick S, PicardG, Deragon JM. 1994. Evolution of SINES1 retroposons inCruciferaeplant species.Mol. Biol. Evol.14:934–41

104. Lim JK, Simmons JM. 1994. Grosschromosome rearrangements mediated

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 527

by transposable elements inDrosophilamelanogaster. BioEssays16:269–73

105. Llaca V, Messing J. 1998. Amplicons ofmaize zein genes are conserved withingenic but expanded and constricted in in-tergenic regions.Plant J.15:211–20

106. Loguercio LL, Wilkins TA. 1998. Struc-tural analysis of a hmg-coA reductasepseudogene: insights into evolutionaryprocesses affecting thehmgrgene familyin allotetraploid cotton (Gossypium hir-sutumL). Curr. Genet.34:241–49

107. Luan DD, Korman MH, Jakubczak JL,Eickbush TH. 1993. Reverse transcrip-tion of R2Bm RNA is primed by a nickat the chromosomal target site: a mecha-nism for non-LTR retrotransposition.Cell72:595–605

108. Lucas H, Feuerbach F, Kunert K,Grandbastien M-A, Caboche M. 1995.RNA-mediated transposition of the to-bacco retrotransposonTnt1 in Arabidop-sis thaliana. EMBO J.14:2364–73

109. Maier RM, Neckermann K, Igloi GL,Kossel H. 1995. Complete sequence of themaize chloroplast genome, gene content,hotspots of divergence and fine tuning ofgenetic information by transcript editing.J. Mol. Evol.25:614–28

110. Marillonnet S, Wessler SR. 1997. Retro-transposon insertion into the maizewaxygene results in tissue-specific RNA pro-cessing.Plant Cell9:967–78

111. Marillonnet S, Wessler SR. 1998. Ex-treme structural heterogeneity among themembers of a maize retrotransposon fam-ily. Genetics150:1245–56

112. Martienssen RA. 1996. Epigenetic phe-nomena: paramutation and gene silenc-ing in plants.Curr. Biol. 6:810–13

113. Martinez-Izquierdo JA, Garcia-MartinezJ, Vicient CM. 1997. What makesGrande1 retrotransposon different?Ge-netica100:15–28

114. Matsuoka Y, Tsunewaki K. 1999. Evo-lutionary dynamics of Ty1-copia groupretrotransposons in grass shown by re-

verse transcriptase domain analysis.Mol.Biol. Evol.16:208–17

115. Matzke AJM, Matzke MA. 1998. Posi-tion effects and epigenetic silencing ofplant transgenes.Curr. Opin. Plant Biol.1:142–8

116. McClintock B. 1984. The significance ofresponses of the genome to challenge.Sci-ence226:792–801

117. McDonald JF, Matyunina LV, Wilson S,Jordan IK, Brown NJ, Miller WJ. 1997.LTR retrotransposons and evolution ofeukaryotic enhancers.Genetica100:3–13

118. Meyers BC, Chin DB, Shen KA, Sivara-makrishnan S, Lavelle DO, et al. 1998.The major resistance gene cluster in let-tuce is highly duplicated and spans severalmegabases.Plant Cell10:1817–32

119. Mhiri C, De Wit PJGM, GrandbastienM-A. 1999. Activation of the promoter ofthe Tnt1 retrotransposon in tomato afterinoculation with the fungal pathogenCla-dosporium fulvum. Mol. Plant-MicrobeInteract.12:592–603

120. Mhiri C, Morel JB, Vernhettes S, Casacu-berta JM, Lucas H, Grandbastien MA.1997. The promoter of the tobaccoTnt1retrotransposon is induced by woundingand by abiotic stress.Plant Mol. Biol.33:257–66

121. Miller JT, Dong F, Jackson SA, SongJ, Jiang J. 1998. Retrotransposon-relatedDNA sequences in the centromeres ofgrass chromosomes.Genetics150:1615–23

122. Moore G, Devos KM, Wang Z, Gale MD.1995. Grasses, line up and form a circle.Curr. Biol. 5:737–39

123. Moore JK, Haber JE. 1996. Capture ofretrotransposon DNA at the sites of chro-mosomal double-strand DNA breaks.Na-ture383:644–45

124. Moore G, Lucas H, Batty N, Flavell R.1991. A family of retrotransposons andassociated genomic variation in wheat.Genomics10:461–68

125. Moreau-Mhiri C, Morel JB, Audeon C,

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?528 KUMAR ■ BENNETZEN

Ferault M, Grandbastien MA, Lucas H.1996. Regulation of expression of the to-baccoTnt1 retrotransposon in heterolo-gous species following pathogen-relatedstresses.Plant J.9:409–19

126. Motohashi R, Mochizuiki K, OhtsuboH, Ohtsubo E. 1997. Structures and dis-tribution of p-SINE1 members in ricegenomes.Theor. Appl. Genet.95:359–68

127. Mount SM, Rubin GM. 1985. Completenucleotide sequence of theDrosophilatransposable elementcopia: homologybetweencopia and retroviral proteins.Mol. Cell. Biol.5:1630–38

128. Noma K, Nakajima R, Ohtsubo H, Oht-subo E. 1997.RIRE1, a retrotranspo-son from wild rice (Oryza australiensis).Genes Genet. Syst.72:131–40

129. Noma K, Ohtsubo E, Ohtsubo H. 1999.Non-LTR retrotransposon LINEs areubiquitous components of plant genomes.Mol. Gen. Genet.261:71–79

130. Noutoshi Y, ItoY, Kanetani S, Fuji M,Usami S, Yamada T. 1998. Molecularanatomy of a small chromosome in thegreen algaChlorella vulgaris. NucleicAcids Res.26:3900–7

131. Nozue K, Kanegae T, Wada M. 1997. Afull length Ty3/gypsy-type retrotranspo-son in the fernAdiantum. J. Plant Res.110:495–99

132. O’Neill RJW, O’Neill MJ, Graves MA.1998. Undermethylation associated withretroelements activation and chromo-somes remodelling in an interspecificmammalian hybrid.Nature393:68–72

133. Okada N, Hamada M, Ogiwara I,Ohshima K. 1997. SINEs and LINEsshare common 3′ sequences: a review.Gene205:229–43

134. Orgel LE, Crick FHC. 1980. Selfish DNA:the ultimate parasite.Nature284:604–7

135. Pal-Bhadra M, Bhadra U, Birchler JA.1997. Cosuppression inDrosophila: genesilencing of alcohol dehydrogenasebywhite-adhtransgenes ispolycombdepen-dent.Cell 90:479–90

136. Panstruga R, Buschges R, Piffanelli P,Schulze-Lefert P. 1998. A contiguous 60kb genomic stretch from barley revealsmolecular evidence for gene islands ina monocot genome.Nucleic Acids Res.26:1056–62

137. Pardue ML, Danilevskaya ON, TraverseKL, Lowenhaupt K. 1997. Evolutionarylinks between telomeres and transposableelements.Genetica100:73–84

138. Pastugilia M, Ruffio-Chable V, DelormeV, Gaude T, Dumas C, Cock JM. 1997.A functional S locus anther gene is notrequired for the self-incompatibility re-sponse inBrassica oleracea. Plant Cell9:2065–76

139. Paul PK, Kunert E, Hunter E, Grand-bastien M-A. 1994. Expression of the to-baccoTnt1 retrotransposon promoter inheterologous species.Plant Mol. Biol.26:393–402

140. Pearce SR, Harrison G, Heslop-HarrisonJS, Flavell AJ, Kumar A. 1997. Character-isation and genomic organisation of Ty1-copia group retrotransposons in rye (Se-cale cereale). Genome40:617–25

141. Pearce SR, Harrison G, Li D, Heslop-Harrison JS, Kumar A, Flavell AJ. 1996.The Ty1-copiagroup retrotransposons inVicia species: copy number, sequenceheterogeneity and chromosomal localisa-tion. Mol. Gen. Genet.250:305–15

142. Pearce SR, Pich U, Harrison G, FlavellAJ, Heslop-Harrison JS, et al. 1996.The Ty1-copiagroup retrotransposons ofAllium cepa are distributed throughoutthe chromosomes but are enriched in theterminal heterochromatin.ChromosomeRes.4:357–64

143. Pearce SR, Stuart-Rogers CM, Kumar A,Ellis THN, Flavell AJ. 1999. Rapid iso-lation of plant Ty1-copia group retro-transposon LTR sequences for molecularmarker studies.Plant J.19:711–17

144. Pelissier T, Tutois S, Tourmente S, De-ragon JM, Picard G. 1996. DNA re-gions flanking the majorArabidopsis

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 529

thalianasatellite are principally enrichedin Athila retroelement sequences.Genet-ica 97:141–51

145. Petrov D. 1977. Slow but steady: Reduc-tion of genome size through biased muta-tion. Plant Cell11:1900–1

146. Pimpinelli S, Berloco M, Fanti L, DemitriP, Bonaccorsi E, et al. 1995. Trans-posable elements are stable compo-nents ofDrosophila melanogasterhete-rochromatin.Proc. Natl. Acad. Sci. USA92:3804–8

147. Pouteau S, Grandbastien MA, Boccara M.1994. Microbial elicitors of plant defenseresponse activate transcription of a retro-transposon.Plant J.5:535–42

148. Pouteau S, Huttner E, Grandbastien MA,Caboche M. 1991. Specific expression ofthe tobaccoTnt1 retrotransposon in pro-toplasts.EMBO J.10:1911–18

149. Pozueta-Romero J, Houlne G, SchantzR. 1998. Identification of a short in-terspersed repetitive element in partiallyspliced transcripts of the bell pepper(Capsicum annuum) PAP gene: new evo-lutionary and regulatory aspects on planttRNA-related SINEs.Gene214:51–58

150. Presting GG, Malysheva L, FuchsJ, Schubert I. 1998. A TY3/GYPSYretrotransposon-like sequence localises tothe centromeric regions of cereal chromo-somes.Plant J.16:721–28

151. Purugganan MD, Wessler SR. 1994.Molecular evolution of magellan, amaize Ty3/gypsy-like retrotransposon.Proc. Natl. Acad. Sci. USA91:11674–78

152. Purugganan MD, Wessler SR. 1995.Transposon signatures-species-specificmolecular markers that utilise a class ofmultiple-copy nuclear-DNA.Mol. Ecol.4:265–69

153. Ramsay L, Macaulay M, Cardle L, Mor-gante M, Degli-Ivanissevich S, et al. 1999.Intimate association of microsatellite re-peats with retrotransposons and other dis-persed repetitive elements in barley.PlantJ. 17:415–23

154. Razin A. 1998. CpG methylation, chro-matin structure and gene silencing—athree way connection.EMBO J.17:4905–8

155. Royo J, Nass N, Matton DP, OkamotoS, Clarke AE, Newbigin E. 1996. Aretrotransposon-like sequence linked tothe S-locus ofNicotiana alata is ex-pressed in styles in response to touch.Mol. Gen. Genet.250:180–88

156. SanMiguel P, Bennetzen JL. 1998. Ev-idence that a recent increase in maizegenome size was caused by the mas-sive amplification of intergene retrotrans-posons.Ann. Bot.81:37–44

157. SanMiguel P, Gaut BS, Tikhonov A,Nakajima Y, Bennetzen JL. 1998. Thepalaeontology of intergene retrotrans-posons of maize.Nat. Genet.20:43–45

158. SanMiguel P, Tikhonov A, Jin YK,Motchoulskaia N, Zakharov D, et al.1996. Nested retrotransposons in the in-tergenic regions of the maize genome.Sci-ence274:765–68

159. Sato Y, Sentoku N, Miura Y, HirochikaH, Kitano H, Matsuoka M. 1999. Loss-of-function mutations in the rice homeoboxgeneOSH15affect the architecture of in-ternodes resulting in dwarf plants.EMBOJ. 18:992–1002

160. Schmid CW. 1998. Does SINE evolutionpreclude Alu function? Nucleic AcidsRes.26:4541–50

161. Schmidt T, Kubis S, Heslop-Harrison JS.1995. Analysis and chromosomal local-isation of retrotransposons in sugar beet(Beta vulgaris) LINES and Ty1-copia-like elements as major components of thegenome.Chromosome Res.3:335–45

162. Schuster W, Brennicke A. 1987. Plastid,nuclear and reverse transcriptase se-quences in the mitochondrial genome ofOenothera: is genetic information trans-ferred between organelles via RNA?EMBO J.45:61–78

163. Schwarz-Sommer Z, Leclercq L, G¨obel

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?530 KUMAR ■ BENNETZEN

E, Saedler H. 1987.Cin4, an insert al-tering the structure of theA1 gene ofZea mays, exhibits properties of nonviralretrotransposons.EMBO J.6:3873–80

164. Sentry JW, Smyth DR. 1989. An elementwith long terminal repeats and its vari-ant arrangements in the genome ofLiliumhenryi. Mol. Gen. Genet.215:349–54

165. Shepherd NS, Schwarz-Sommer Z,Blumberg vel Spalve J, Gupta M, Wien-and U, Saedler H. 1984. Similarity oftheCin1 repetitive family ofZea maystoeukaryotic transposable elements.Nature307:185–87

166. Simmen MW, Leitgeb S, Charlton J, JonesSJ, Harris BR, et al. 1999. Nonmethy-lated transposable elements and methy-lated genes in chordate genome.Science11:64–67

167. Smit AFA. 1996. The origin of inter-spersed repeats in the human genome.Curr. Opin. Genet. Dev.6:743–48

168. Smyth DR, Kalitsis P, Joseph JL, SentryJW. 1989. Plant retrotransposon fromLil-ium henryiis related to Ty3 of yeast andthegypsygroup ofDrosophila.Proc. Natl.Acad. Sci. USA86:5015–19

169. Smyth DR. 1993. Plant retrotransposons.In Control of Gene Expression, ed. DPSVerma, pp. 1–15. London: CRC Press

170. Song SU, Gerasimova T, Kurkulos M,Boeke JD, Corces VG. 1994. An Env-likeprotein encoded byDrosophila retroele-ment: evidence thatgypsyis an infectiousretrovirus.Genes Dev.8:2026–57

171. Song W-Y, Pi L-Y, Wang G-L, GardnerJ, Holsten T, Ronald PC. 1997. Evolutionof the riceXa21disease resistance genefamily. Plant Cell9:1279–87

172. Suoniemi A, Anamthawat-Jonsson K,Arna T, Schulman AH. 1996. Retrotrans-posonBARE-1is a major, dispersed com-ponent of the barley (Hordeum vulgareL.) genome.Plant Mol. Biol. 30:1321–29

173. Suoniemi A, Narvanto A, Schulman AH.1996. The BARE-1 retrotransposon is

transcribed in barley from an LTR pro-moter active in transient assays.PlantMol. Biol. 31:295–306

174. Suoneimi A, Schmidt D, Schulman AH.1997.BARE-1insertion site preferencesand evolutionary conservation of RNAand cDNA processing sites.Genetica100:219–30

175. Suoniemi A, Tanskanen J, Pentikainen O,Johnson MS, Schulman AH. 1998. Thecore domain of retrotransposon integrasein Hordeum: predicted structure and evo-lution. Mol. Biol. Evol.15:1135–44

176. Suoniemi A, Tanskanen J, Schulman AH.1998. Gypsy-like retrotransposons arewidespread in the plant kingdom.PlantJ. 13:699–705

177. Takeda S, Sugimoto K, Otsuki H, Hi-rochika H. 1998. Transcriptional activa-tion of the tobacco retrotransposonTto1by wounding and methyl jasmonate.PlantMol. Biol. 36:365–76

178. Takeda S, Sugimoto K, Otsuki H, Hi-rochika H. 1999. 13–bpcis-regulatory el-ement in the LTR promoter of tobaccoretrotransposonTto1 is involved in re-sponsiveness to tissue culture, wound-ing, methyl jasmonate and fungal elici-tors.Plant J.18:1–11

179. Tatout C, Lavie L, Deragon JM. 1998.Similar target site selection occurs in in-tegration of plant and mammalian retro-posons.J. Mol. Evol.47:463–70

180. Temin HM. 1980. Origin of retrovirusesfrom cellular moveable genetic elements.Cell 21:599–600

181. Thomson KG, Thomas JE, Dietzgen RG.1998. Retrotransposon-like sequences in-tegrated into the genome of pineapple,Ananas comosus. Plant Mol. Biol. 38:461–65

182. Tikhonov AP, SanMiguel PJ, Nakajima Y,Gorenstein ND, Bennetzen JL, AvramovaZ. 1999. Colinearity and its exceptionsin orthologousadh regions of maize andsorghum.Proc. Natl. Acad. Sci. USA96:7409–14

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?PLANT RETROTRANSPOSONS 531

183. Trelogan S, Martin SL. 1995. Tightlyregulated, developmentally specific ex-pression of the first open reading frameLINE-1 during mouse embryogenesis.Proc. Natl. Acad. Sci. USA92:1520–24

184. Turcich MP, Bokharririza A, HamiltonDA, He CP, Messier W, et al. 1996.PREM-2, acopia-type retroelement inmaize is expressed preferentially in earlymicrospores.Sexual Plant Reprod.9:65–74

185. Unseld M, Marienfeld JR, Brandt P,Brennicke A. 1997. The mitochondrialgenome ofArabidopsis thalianacontains57 genes in 366,924 nucleotides.Nat.Genet.15:57–61

186. VanderWiel PL, Voytas DF, Wendel JF.1993. Copia-like retrotransposable ele-ment evolution in diploid and polyploidcotton (GossypiumL.). J. Mol. Evol.36:429–47

187. Varagona MJ, Purugganan M, WesslerSR. 1992. Alternative splicing inducedby insertion of retrotransposons into themaizewaxygene.Plant Cell4:811–20

188. Vegh Z, Vincze E, Kadirov R, TothG, Kiss GB. 1990. The nucleotide se-quence of a nodule-specific geneNms-25of Medicago sativa: its primary evolutionvia exon-shuffling and retrotransposon-mediated DNA rearrangements.PlantMol. Biol. 15:295–306

189. Vernhettes S, Grandbastien MA, Casacu-berta JM. 1997. In vivo characterisation oftranscriptional regulatory sequences in-volved in the defence-associated expres-sion of the tobacco retrotransposonTnt1.Plant Mol. Biol.35:673–79

190. Vernhettes S, Grandbastien MA, Casacu-berta JM. 1998. The evolutionary analy-sis of theTnt1 retrotransposon inNico-tiana species reveals the high variabilityof its regulatory sequences.Mol. Biol.Evol.15:827–36

191. Vicient CM, Suoniemi A, Anamthawat-Jonsson K, Tanskanen J, et al. 1999.RetrotransposonBARE-1and its role in

genome evolution in the genusHordeum.Plant Cell.11:1769–84

192. Vignols F, Rigau J, Torres MA, Capel-lades M, Puigdomenech P. 1995. Thebrown midrib3 (bm3)mutation in maizeoccurs in the gene encoding caffeic acidO-methyltransferase.Plant Cell 7:407–16

193. Voytas DF, Ausubel FM. 1988. Acopia-like transposable element family inAra-bidopsis thaliana. Nature336:242–44

194. Voytas DF, Cummings MP, KoniecznyA, Ausubel FM, Rodermel SR. 1992.Copia-like retrotransposons are ubiqui-tous among plants.Proc. Natl. Acad. Sci.USA89:7124–28

195. Voytas DF, Konieczny A, Cummings MP,Ausubel FM. 1990. The structure, dis-tribution and evolution of theTa1 retro-transposable element family ofArabidop-sis thaliana. Genetics126:713–21

196. Wakimoto B. 1998. Beyond the nucle-osome: epigenetic aspects of position-effect variegation inDrosophila. Cell 93:321–24

197. Wang G-L, Ruan DL, Song W-Y, SiderisS, Chen L, et al. 1998.Xa21D encodesa receptor-like molecule with a leucine-rich repeat domain that determines race-specific recognition and is subject toadaptive evolution.Plant Cell10:765–79

198. Wang S, Zhang Q, Maughan PJ, Saghai-Maroof MA. 1997.Copia-like retrotrans-posons in rice: Sequence heterogeneity,species distribution and chromosomal lo-cations.Plant Mol. Biol.33:1051–58

199. Wang SP, Liu N, Peng KM, Zhang QF.1999. The distribution and copy num-ber ofcopia-like retrotransposons in rice(Oryza sativa) and their implications inthe organization and evolution of therice genome.Proc. Natl. Acad. Sci. USA96:6824–28

200. Warner SA, Scott RJ, Draper J. 1993. Iso-lation of an asparagus intracellular PRgene (AoPR) wound-responsive promoterby the inverse polymerase chain reaction

P1: FNE/fgo/fok P2: FDX/fgo QC: FhN/anil T1: FNE

November 2, 1999 22:53 Annual Reviews AR095-15

?532 KUMAR ■ BENNETZEN

and its characterisation in transgenic to-bacco.Plant J.3:191–201

201. Waugh R, McLean K, Flavell AJ, PearceSR, Kumar A, et al. 1997. Genetic distri-bution ofBARE-1–like retrotransposableelements in the barley genome revealedby sequence-specific amplification poly-morphisms (S-SAP).Mol. Gen. Genet.253:687–94

202. Wessler SR, Bureau TE, White SE. 1995.LTR-retrotransposons and MITES, im-portant players in the evolution of plantgenomes.Curr. Opin. Genet. Dev.5:814–21

203. White SE, Habera LF, Wessler SR. 1994.Retrotransposons in the flanking regionsof normal plant genes-a role forcopia-likeelements in the evolution of gene struc-ture and expression.Proc. Natl. Acad. Sci.USA91:11792–96

204. Williamson VM. 1983. Transposable el-ements in yeast.Int. Rev. Cytol.83:1–25

205. Wright DA, Ke N, Smalle J, Hauge BM,Goodman HM, Voytas DF. 1996. Mul-tiple non-LTR retrotransposons in thegenome ofArabidopsis thaliana. Genet-ics142:569–78

206. Wright DA, Voytas DF. 1998. Potentialretroviruses in plants:Tat1 is related to agroup ofArabidopsis thalianaTy3/gypsyretrotransposons that encode envelope-like proteins.Genetics149:703–15

207. Xiong Y, Eickbush TH. 1990. Origin andevolution of retroelements based upontheir reverse transcriptase sequences.EMBO J.9:3353–62

208. Yoder JA, Walsh CP, Bestor TH. 1997.Cytosine methylation and the ecology ofintragenomic parasites.Trends Genet.13:335–40

209. Youngman S, Van Luenen HGAM, Plas-terk RHA. 1996.Rte-1, a retrotransposon-like element inCaenorhabditis elegans.FEBS Lett.380:1–7

210. Yoshioka Y, Matsumoto S, Kjima S,Ohshima K, Okada N, Machida Y. 1993.Molecular characterization of a short in-terspersed repetitive element from to-bacco that exhibits sequence homologyto specific tRNAs.Proc. Natl. Acad. Sci.USA90:6562–66

211. Zou S, Voytas DF. 1997. Silent chromatindetermines target preference of theSac-charomycesretrotransposon Ty5.Proc.Natl. Acad. Sci. USA94:7412–16