Genetic diversity among sugarcane clones using Target ...

88
Louisiana State University LSU Digital Commons LSU Master's eses Graduate School 2005 Genetic diversity among sugarcane clones using Target Region Amplification Polymorphism (TP) markers and pedigree relationships Jie Alojado Arro Louisiana State University and Agricultural and Mechanical College, [email protected] Follow this and additional works at: hps://digitalcommons.lsu.edu/gradschool_theses is esis is brought to you for free and open access by the Graduate School at LSU Digital Commons. It has been accepted for inclusion in LSU Master's eses by an authorized graduate school editor of LSU Digital Commons. For more information, please contact [email protected]. Recommended Citation Arro, Jie Alojado, "Genetic diversity among sugarcane clones using Target Region Amplification Polymorphism (TP) markers and pedigree relationships" (2005). LSU Master's eses. 1094. hps://digitalcommons.lsu.edu/gradschool_theses/1094

Transcript of Genetic diversity among sugarcane clones using Target ...

Louisiana State UniversityLSU Digital Commons

LSU Master's Theses Graduate School

2005

Genetic diversity among sugarcane clones usingTarget Region Amplification Polymorphism(TRAP) markers and pedigree relationshipsJie Alojado ArroLouisiana State University and Agricultural and Mechanical College, [email protected]

Follow this and additional works at: https://digitalcommons.lsu.edu/gradschool_theses

This Thesis is brought to you for free and open access by the Graduate School at LSU Digital Commons. It has been accepted for inclusion in LSUMaster's Theses by an authorized graduate school editor of LSU Digital Commons. For more information, please contact [email protected].

Recommended CitationArro, Jie Alojado, "Genetic diversity among sugarcane clones using Target Region Amplification Polymorphism (TRAP) markers andpedigree relationships" (2005). LSU Master's Theses. 1094.https://digitalcommons.lsu.edu/gradschool_theses/1094

GENETIC DIVERSITY AMONG SUGARCANE CLONES USING TARGET REGION AMPLIFICATION POLYMORPHISM (TRAP) MARKERS AND PEDIGREE

RELATIONSHIPS

A Thesis Submitted to the Graduate Faculty of the

Louisiana State University and Agricultural and Mechanical College

in partial fulfilment of the requirements for the degree of

Master of Science

in

The Department of Agronomy and Environmental Management

by Jie A. Arro

B.S., University of the Philippines, Los Baños, 1995 May 2005

ACKNOWLEDGEMENTS

I am deeply thankful to my adviser, Dr. Collins Kimbeng, whose advice, patience, support and

encouragement have no doubt enabled me to hurdle the task of completing my degree. Among so many

other things, I am thankful for his decision in sending me to an established lab in Virginia for training

during my first semester. The skill I learned proved invaluable towards the completion of this research. I

am also grateful both for the recommendation letter and financial assistantship he provided during the

extension of my graduate program. Indeed, I am honoured to be one of his students and friends.

I also express gratitude to my committee advisers, Dr. Prasanta Subudhi and Dr. Kenneth Gravois,

for having the time to read and provide important comments to my manuscript at such a short notice.

Many thanks also to Chris Laborde of St. Gabriel Research Station and Drs. Yon-Bao Pan, John

Veremis and Tom Tew of USDA- ARS, Houma for all the readily-available help especially in providing

the sugarcane clones used in this study. Also to those who patiently taught me invaluable laboratory

skills: Chris Botanga (Timko lab, UVA), Mary Bowen and Alicia Ryan.

I sincerely thank the Fulbright-Philippine Agriculture Scholarship Program through the PAEF and IIE

for the rich “Fulbright experience” that has made me a better person.

I would like also to thank the following: Mr. Leon Arceo, director of PHILSURIN, for permitting me

to go on a study leave; my fellow graduate students in agronomy department especially Sreedhar and

Suman; the Filipino community especially A. Lilia and K. Chris, A. Jackie and K. Bong; my brethren at

Fil-Am Christian Fellowship especially Ptr Nelio and A. Lydia, Ron and Faith, Bill and A. Thelma and

Lorna; also to Ptr Rey Villamayor and my comrades at Punlaan.

I am thankful to my family: to Tatay and Nanay for the gift of life; my siblings: Nonoy and Jan-jan,

for the prayers; to my wife and son, Gina and Justin Judiael, for the support and unconditional love; and

finally to my Lord Jesus Christ, for the saving grace.

ii

TABLE OF CONTENTS

ACKNOWLEDGEMENTS............................................................................................................ ii

LIST OF TABLES.......................................................................................................................... v

LIST OF FIGURES ....................................................................................................................... vi

ABSTRACT................................................................................................................................. viii

INTRODUCTION AND LITERATURE REVIEW ...................................................................... 1 SUC Gene Family ....................................................................................................................... 7

SuSy ........................................................................................................................................ 7 SAI .......................................................................................................................................... 8 PPDK ...................................................................................................................................... 9

CT Gene Family........................................................................................................................ 10 CRT/DRE.............................................................................................................................. 10 COR15a................................................................................................................................. 11 Mischanthus-specific PPDK ................................................................................................. 11 CDPK.................................................................................................................................... 11

TRICH Gene Family................................................................................................................. 12 GL1 ....................................................................................................................................... 12 GL2 ....................................................................................................................................... 13 TTG1..................................................................................................................................... 13

Objectives ................................................................................................................................. 14 MATERIALS AND METHODS.................................................................................................. 15

Plant Materials .......................................................................................................................... 15 DNA Isolation........................................................................................................................... 15 TRAP - Fixed Primer Design.................................................................................................... 17 TRAP - Arbitrary Primer Design.............................................................................................. 19 TRAP - PCR Protocol............................................................................................................... 20 Statistical Analyses ................................................................................................................... 21

Marker Information............................................................................................................... 22 Bootstrap Analysis ................................................................................................................ 22 Cluster Analysis .................................................................................................................... 23 Mantel Test ........................................................................................................................... 24 Multidimensional Scaling (MDS)......................................................................................... 25 Minimum Spanning Tree (MST) .......................................................................................... 26 Mixed Model Analysis on MDS Axes.................................................................................. 26 Analysis of Molecular Variance (AMOVA)......................................................................... 27 Coefficient of Parentage ....................................................................................................... 27

RESULTS AND DISCUSSION................................................................................................... 29

TRAP Marker Polymorphism................................................................................................... 29 Polymorphism Information Content (PIC) Values ................................................................... 33

iii

Bootstrap Analysis .................................................................................................................... 35 Cluster Analysis ........................................................................................................................ 37 Cluster Analysis for Each Gene Family.................................................................................... 39 Comparison of GS Trends Among Gene Families ................................................................... 42 AMOVA on Gene Family Structure ......................................................................................... 43 Multi-dimensional Scaling (MDS) Analysis ........................................................................... 49 Analysis of Molecular Variance (AMOVA)............................................................................. 52 MDS With Mixed Model Analysis On Era Per Gene Families ................................................ 54

SUC....................................................................................................................................... 56 CT ......................................................................................................................................... 56 TRICH................................................................................................................................... 57

Pedigree Analysis...................................................................................................................... 57 COP vs. TRAP (Cluster and MDS Analyses)........................................................................... 60

SUMMARY AND CONCLUSION ............................................................................................. 64 BIBLIOGRAPHY......................................................................................................................... 68 VITA............................................................................................................................................. 78

iv

LIST OF TABLES

1. List of 63 clones used within the study and their breeding station origin……………………16

2. Gene family/genes within gene family and characteristics of genes included in the study…...18

3. Arbitrary 18-mer primer sequences used in this study derived from Li and Quiros, (2001).....19

4. TRAP protocol. The protocol is designed to run two separate reactions simultaneously…….20

5. TRAP PCR conditions (Hu and Vick, 2003)……...…….………………………………….…21

6. TRAP marker information generated on 60 sugarcane clones using 40 primer combinations…………..………………………………………………….……………….…..30 7. Pedigree relationship of the 63 clones.………………………………………..………………40

8. Analysis of Molecular Variance partitioning genetic variation for gene family, genes within gene family and genes using 63 sugarcane clones…………………..……….….49 9. AMOVA on 63 clones based era of release…………………………………………...………53

10. Least square means and t-test of the MDS axes for era of cultivar release……….…………54

11. Least square means and t-test of the MDS axes for era of cultivar release based on SUC GS……………………………………………………………………………55 12. Least square means and t-test of the MDS axes for Era of cultivar release based on CT GS…………………………………………………………………………..….56

13. Least square means and t test of the MDS axes for Era of cultivar release based on TRICH GS……………………………………………………………………………..…57 14. Foundation clones of the 63 clones………………..……………………………………..….58

v

LIST OF FIGURES

1. Example of TRAP amplification pattern……………………………….……………………..32 2. Box plot representation of mean and standard deviation for SUC, CT and TRICH gene family.............................................................................................33 3. Box plot representation of PIC mean and standard deviation for SUC, CT and TRICH gene families..….……………………………………………………...34 4. Plot of the coefficient of variation with its standard error (bars) estimated by Bootstrap analysis using different number of TRAP bands...…………………………..………………..36 5. Plot of the coefficient of variation estimated by bootstrap analysis using different number of resampled TRAP bands derived from SUC, CT and TRICH gene families………37 6. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph=0.85) using TRAP markers from all the gene families (SUC, CT, TRICH)….……….………………………………………………………44 7. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph=0.82) using the SUC TRAP markers………………………….45 8. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph=0.86) using the CT TRAP markers……………………………46 9. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph=0.85) using TRICH……………………………………………47 10. Smouse-Long-Sokal three way mantel test from SUC, CT, TRICH GS matrices…………..48 11. Bi-plot of the first two MDS scales among the ten genes derived from pairwise distance generated by AMOVA……………………………………………………………...50 12. Bi-plot of the first two MDS scales among the 63 clones using GSJaccard derived from TRAP markers………………………………………………………………...51 13 Bi-plot of the first two MDS scales among the 63 clones classified into old and new using GSJaccard derived from TRAP markers……………………………………54 14. Bi-plot of the first two MDS scales among the 63 clones classified into old (Pre-1980) and new (post-1980)using GSJaccard derived from TRAP markers of the SUC gene families…………………………………………………………..55

vi

15. Bi-plot of the first two MDS scales among the 63 clones classified into old (pre-1980) and new (post-1980) using GSJaccard derived from TRAP markers using CT genes………………………………………………………….……….….56 16. Bi-plot of the first two MDS scales among the 63 clones classified into old (pre-1980) and new (post-1980) using GSJaccard derived from TRICH markers….……..57 17. Percent presence of the foundation clones in the 63 clones…………………………………60 18. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GS estimates using Coefficient of Parentage (COP)..………………………61 19. Mantel test on COP vs. SUC, CT, TRICH GS matrices.…………..………………….….…62

vii

ABSTRACT

Genetic diversity is indispensable to sustain genetic gain in breeding programs.

Cultivated sugarcane is a highly heterozygous hybrid derived from crossing two highly

heterozygous parents. Sugarcane breeders traditionally rely on pedigree records to select

parents. Molecular markers now make it possible to assess genetic diversity at the DNA level.

Sixty three sugarcane clones were characterised using Target Region Amplification

Polymorphism (TRAP) markers and pedigree relationship (Coefficient of parentage (COP)). The

TRAP is a PCR-based marker with a fixed primer designed from Expressed Sequence Tag (EST)

sequences paired with an arbitrary primer. It is supposed to unravel trait based polymorphism in

the intronic- or exonic-regions of the genome. Ten genes evidently involved in sucrose

accumulation (SUC), cold tolerance (CT) and trichome development (TRICH) were paired with

four arbitrary primers. A total of 3,170 bands were scored of which 2,684 (85%) were

polymorphic. Cluster and Multi Dimensional Scaling (MDS) analyses revealed a very narrow

genetic diversity among the entries with genetic similarity (GS) ranging from 78 to 94%.

Parentage did not seem to contribute to the grouping pattern in either the overall or individual

gene family (SUC, CT and TRICH) clusters which was confirmed by the lack of correlation (r =

-0.008) between the TRAP and COP-derived GS matrices. The complex genome of sugarcane

and the strict assumptions inherent with estimating COP may account for this disparity. Analysis

of Molecular Variance (AMOVA) revealed no structure in the population with regards to era of

release (Pre- versus post-1980) with among clones accounting for up to 99% of the total

variation and only 1% of variation attributable to era of release. Mixed Model Analysis on the

MDS axes generally revealed no significant differences among era of release. Thus, pedigree

records can enhance interpretation of marker-derived information especially in an interspecific

viii

crop like sugarcane where ancestral species possess and contribute different characteristics.

Results from this study needs to be supplemented with sequence analysis of TRAP fragments to

definitively relate the derived information to trait variation.

ix

1

INTRODUCTION AND LITERATURE REVIEW

Saccharum is a complex genus characterized by high ploidy levels and composed of at

least six distinct species – S. officinarum, S. barberi, S. sinensi, S. spontaneum, S. robustum and

S. edule (Daniels and Roach, 1987; GRIN, 2004; Naidu and Sreenivasan, 1987). Described as

an allopolyploid, modern cultivated sugarcane have approximately 80-140 chromosomes with 8-

18 copies of a basic set (i.e. x = 8 or x = 10 haploid chromosome number) (D'hont et al., 1995;

Ha et al., 1999; Ming et al., 2001). Sugarcane was vegetatively cultivated as noble canes (i.e. S.

officinarum) until the end of the 20th century when it succumbed to the devastating sereh disease

which prompted plant breeders to hybridize it with its wild relative, S. spontaneum. In a

process termed as nobilization, the resulting hybrid progeny was repeatedly backcrossed to S.

officinarum to restore the high-sucrose producing plant types. S. officinarum, the noble cane

from Asia, is thought to comprise a large part of the cultivated sugarcane genome and confers the

genes for high sucrose content, low fiber, thick stalks, sparse pubescence, rare flowering and

limited tillering (Ming et al., 2001). The wild relative, S. spontaneum, comprise about 10% of

the cultivated sugarcane as evidenced from in situ hybridization (D'hont et al., 1996) and is

credited to impart the needed pest and disease resistance and abiotic stress tolerance due to its

wide ecogeographical adaptive distribution (Sreenivasan et al., 1987).

The nobilization endeavour proved successful in averting the threat of diseases. For the

most part of the last century, sugarcane breeding activity thrived essentially by intercrossing the

original nobilized clones and their derived progeny. It is well known that only a few clones were

involved in the original nobilization event (Arcenueaux, 1965). There is a growing concern

among sugarcane breeders regarding the narrowness of the genetic diversity in the existing

sugarcane germplasm worldwide. Data from various studies that looked at chloroplast (cpDNA)

and mitochondrial (mtDNA) DNA in the clones of Saccharum spp. and hybrids indicated a

2

narrow genetic base (Al-Janabi et al., 1994; Deren, 1995; D'hont et al., 1994; Mangelsdorf,

1983). Also, there are no active base broadening programs alongside cultivar development

programs in most sugarcane breeding stations. This is compounded by the practice of sugarcane

breeders to use the proven cross method of choosing parents. The proven cross method has the

bias of repeatedly using, in high frequency, parents from good performing crosses. Such

concentrated use of a few parental clones each crossing season seems to contradict the base

broadening efforts (Heinz and Tew, 1987; Kimbeng et al., 2004). Untested crosses and new

parents are relegated to exploratory evaluation and are thus proportionally lesser.

The U.S. sugarcane breeding program was started in 1919 at the USDA Sugarcane

Station at Canal Point, Florida where sugarcane is able to flower naturally. The sugarcane

research stations at Louisiana State University (LSU), Baton Rouge, Louisiana and USDA-ARS,

Houma, Louisiana were later established in 1922 and 1923 respectively. Seeds and seedcanes

bred from Canal Point, Florida were sent to Houma, Louisiana for site-specific tests and

evaluation. In 1924, Louisiana State University (LSU), United States Department of Agriculture

(USDA) and the American Sugarcane League agreed on a collaborative program of what is now

known as the “Three-Way Agreement”. This agreement made possible active exchange of

materials between the breeding stations and collaborative regional varietal testing and evaluation

(LSU AgCenter, 2001).

Sugarcane has been a major part of south Louisiana’s economy and culture since the last

two centuries. Cultivar development is the single most important factor that plays a critical role

in the industry’s productivity. Sugarcane is currently grown on about 475,000 acres and

contributed about $2.3 billion to the state’s economy in 2001 (LSU AgCenter, 2001). The major

contributing factor to this boost in production is the widespread adoption of LCP85-384 which

has grown in popularity since its release a decade ago. The most recent crop survey indicates

3

that it occupies about 88% of the acreage planted to sugarcane (Legendre and Gravois, 2004).

No other cultivars, including the other four that are recommended for commercial growing in

Louisiana, occupies more than 4% of the total acreage in the current survey. Moreover, it was

noted that only two cultivars – LCP85-384 and HoCP91-555, showed increase in the planted

acreage from the previous year. With this current state wide varietal landscape, researchers and

plant breeders are well aware of the genetic vulnerability to diseases of the Louisiana Sugar

Industry as was the case in Cuba (Diaz et al., 2001) and Australia (Kimbeng, Personal

Communication; (Braithwaite et al., 2004)).

The accurate quantification of the genetic diversity of major agricultural crops is

important both scientifically and socio-economically (Swanson, 1996). Concern has often been

expressed that the practice of modern intensive plant breeding leads inevitably to a reduction in

both diverse agricultural practices and genetic diversity of crops (Reeves et al., 1999). One way

of averting this impending problem is to broaden the genetic base of the breeding program.

A basic understanding of the genetic diversity that exists in the germplasm available for

breeding is fundamental to the success of a breeding program. This knowledge can be useful in

the utilization and management of genotypes and indeed genes in the breeding gene pool. In

sugarcane, for example, crosses could be planned between genotypes from divergent

backgrounds to maximize heterosis while increasing genetic diversity in the gene pool.

Sugarcane breeders have traditionally relied on pedigree records when planning such crosses.

Faulty genealogy and inadvertent mislabelling of clones can adversely complicate genetic

diversity estimates that rely solely on pedigree history.

Furthermore, as previously mentioned, sugarcane breeders are notorious for crossing

mostly parents that have attained the so-called proven cross status. That is, those parents that

4

produce elite progenies are retained for further crossing to the detriment of newer parents as

evident from the high number of vintage clones still involved in the parentage of newer cultivars

(Deren, 1995; Tew, 1987). Population improvement strategies such as recurrent selection are

generally not practiced in sugarcane breeding because of apparent problems with

synchronization of flowering. Most sugarcane progenies are derived from bi-parental crosses.

Potential parents are selected largely based on their performance as clones in advanced stage

trials. Therefore, continuous selection for the same traits may narrow genetic diversity to the

extent that it may be difficult to predict diversity based on pedigree history alone. With the

advent of molecular markers, it is now possible to make direct comparison of genetic diversity at

the DNA level without some of the over simplifying assumptions associated with calculating

genetic diversity based on pedigree history. For example, in calculating genetic diversity using

pedigree history prior selection, genetic drift and ancestral relationships are not adequately

accounted for.

Rapid advances in the field of molecular biology and its allied sciences, made the use of

molecular markers a routine practice providing plant breeders a precise tool in analyzing genetic

diversity for plant improvement (Andersen and Lubberstedt, 2003; Brar, 2002). Various genetic

diversity studies have been done on sugarcane using isoenzymes (Glaszmann et al., 1989),

Random Fragment Length Polymorphism (RFLP) (Coto et al., 2002; D'hont et al., 1994; Jannoo

et al., 1999), ribosomal DNA (Glaszmann et al., 1990), microsatellites (Cordeiro et al., 2003; Pan

et al., 2003; Piperidis et al., 2000), Amplified Fragment Length Polymorphism (AFLP®) (Besse

et al., 1998; Butterfield et al., 2004; Hoarau et al., 2001; Hoarau et al., 2002; Lima et al., 2002)

and molecular cytogenetics like in situ hybridization (Cuadrado et al., 2004; D'hont et al., 1996;

Ha et al., 1999; Jenkin et al., 1995; Piperidis et al., 2000). Results from these studies have

provided a clearer picture of the complex genetic architecture of sugarcane. Most of the

5

diversity found in modern sugarcane can be attributed to S. spontaneum, probably because S.

officinarum was used as the recurrent parent during nobilisation and transmitted 2n gametes to its

progeny (D'hont et al., 1996; Lu et al., 1994).

Although these markers portray the level of the diversity in sugarcane, there is still the

need to translate this information into a working system useful for sugarcane breeders. The

promise of marker assisted selection (MAS) is still to be realized in sugarcane. To date, most of

the markers used to genotype sugarcane and other crops like SSR, RFLP and AFLP® are

typically derived from polymorphic region randomly dispersed in the genome. These abundant

and phenotypically neutral random DNA markers have been very useful in biodiversity studies

and mapping of trait and QTL analysis (Andersen, 2003). Characterization of germplasm

genetic diversity was always seen by plant breeders as the most precise tool in improving the

genetic make-up of a cultivated crop species. The first wave of molecular marker application in

plant improvement saw voluminous work concentrated in dissecting and characterizing genetic

diversity using random molecular markers. However, the use of these random DNA markers are

severely limited in linkage and QTL analysis studies because such linkage can be broken by

genetic recombination. The advent of high throughput sequencing technology has generated

abundant information on DNA sequences for the genomes of many plant species. The

completion of the whole genome sequences for the model plant Thale cress (Arabidopsis

thaliana) and rice (Oryza sativa) have spurred excitement as to the practical and tangible impact

of genomics in crop and animal improvement. Consequently, several collaborative online

repositories/database have been set-up and are mostly publicly accessible and available.

Hundreds of thousands of annotated Expressed Sequence Tag (EST) libraries and putative

functional gene sequences are made available through powerful bioinformatics tools. This

milestone development provides geneticist and plant breeders the necessary tool to bridge the

6

gap between sequence information and molecular markers.

Gene targeted molecular markers may be more promising and meaningful than random

DNA markers in terms of characterizing genetic diversity. Whereas random DNA markers are

derived from polymorphic sites genome wide, gene targeted markers are derived from

polymorphisms within genes and thus reflect functional polymorphism (Andersen and

Lubberstedt, 2003). The potential of using genetic diversity to form sugarcane heterotic groups

to eventually predict offspring performance was described as a less promising prospect because

sugarcane has consistently shown no clear pattern of population structure due to its high level of

heterozygosity (Lu et al., 1994). High molecular polymorphism in sugarcane as revealed using

random DNA marker systems may be due to tremendous redundancy in the genome which may

further complicate results from genetic diversity studies based on random markers. Besides,

because of the sheer large genome size of sugarcane, emulating efforts in other crops in

saturating whole genomic maps with markers seem to be an enormous task that might find little

impact in sugarcane. To be of practical significance, genetic diversity and inter-relationship

measurements using molecular markers especially in sugarcane must somehow be based on

functionally characterized genes or traits. Access to increasing numbers of EST sequences

obtained from diverse cDNA libraries coupled with freely available bioinformatics tools now

allows us to explore new opportunities in sugarcane molecular marker research.

Geneticists and molecular biologists have accumulated a vast array of molecular markers

replete with conspicuous acronyms in the last five years. Recently, markers have been designed

that take advantage of the increasingly growing open online access to EST sequences obtained

from diverse cDNA libraries. Among the most recent is the Target Region Amplification

Polymorphism (TRAP) introduced by Hu and Vick (2003). TRAP is a novel polymerase chain

reaction (PCR)-based marker system that takes advantage of the available EST database

7

sequence information to generate polymorphic markers targeting candidate genes. Essentially it

derives an 18-mer primer from the EST sequence and pairs it with an arbitrary primer that targets

the intronic and/or exonic region (AT- or GC- rich core) (see Li and Quiros (2001)). Since

TRAP is based on PCR technology using anchored and arbitrary primers to amplify coding

regions in the genome, the resulting polymorphism should be reflective of diversity within

functional genes.

This study uses the TRAP markers to characterize the released and commercially

important sugarcane cultivars and some of their parents. Three gene families or groups of genes

directly or indirectly responsible for sucrose metabolism (SUC), cold tolerance (CT) and

trichome (TRICH) development were used in this study and are briefly described below.

SUC Gene Family

Sucrose (SUC) plays a central role in plant growth and development as it is a major

product of photosynthesis and is the major carbohydrate form used as energy source for growth

or storage reserves. Sugarcane belongs to a group of plants that are very productive in

assimilating its food by efficiently utilizing the C4 mechanism of CO2 fixation during

photosynthesis (Grof, 2001). Enzymes involved in sucrose metabolism are of particular interest

in sugarcane as sucrose is the main storage carbohydrate (Grivet and Arruda, 2002). Among the

SUC genes were Sucrose synthase (SuSy), Soluble acid invertase (SAI) and Pyruvate

orthophosphate dikinase (PPDK).

SuSy

Sugarcane is unique because it stores its food not in the form of glucose but in the

unstable form sucrose. Sucrose Synthase (SuSy) plays a central role in carbohydrate metabolism

in general, and sucrose accumulation in particular in all plant species. It belongs to a family of

invertases which are enzymes specialized in hydrolyzing sucrose into glucose and fructose.

8

SuSy catalyzes the reversible cleavage of sucrose and UDP (Uracil diphosphate) to UDP-glucose

and fructose (Schafer et al., 2004; Winter and Huber, 2000). The cleaved product, UDP-

glucose, acts as the substrate for cellulose and callose synthesis (Amor et al., 1995) or can enter

the ATP-generating glycolysis process (Schafer et al., 2004). It is thought that in

monocotyledonous species like sugarcane, SuSy is encoded by two differentially expressed

nonallelic loci Sus1 and Sus2 (Winter and Huber, 2000). In sugarcane, SuSy is associated with

vascular bundles (Nolte and Koch, 1993; Tomlinson et al., 1991; Yang and Russell, 1990). it

was earlier speculated that SuSy activity in mature sugarcane tissues is associated exclusively

with vascular bundles where sucrose accumulation/degradation activity occurs (Buczynski et al.,

1993). This was later clarified to be due to the presence of different forms of SuSy (i.e. SuSy

isoforms) between mature and young culms (Schafer et al., 2004). Needless to say, the definitive

physiological role of SuSy in sucrose accumulation/degradation is not yet clear. SuSy protein

was observed to undergo reversible phosphorylation in a localized membrane and in a soluble

form interacting with the actin cytoskeleton (Winter and Huber, 2000), suggesting a crucial role

in plant metabolism. In fact, SuSy activity was thought to be associated with starch synthesis

(Dejardin et al., 1997), cell wall synthesis (Chourey et al., 1998; Nakai et al., 1999) and overall

sink strength (Sun et al., 1992; Zrenner et al., 1995).

SAI

SAI (Soluble Acid Invertase) activity occurs mostly in the vacuoles of storage

parenchyma cells, and a few in the apoplastic cell wall space either as a soluble enzyme or bound

to the cell wall fraction (Hawker et al., 1991). SAI activity is regarded to have an inverse

relationship with sucrose accumulation in sugarcane, that is, SUC accumulation in the whole

stalk and within individual sugarcane internodes was correlated with the down-regulation of

soluble acid invertase (Zhu et al., 1997). SAI concentration is usually high in tissues that are fast

9

growing, such as cell and tissue cultures, root apices, and immature stem internodes but

decreases rapidly during internode growth and development (Zhu et al., 2000). It was found that

sugarcane clones that are of low sucrose level retain high levels of SAI in the mature stem, and

vice versa. Specifically, major differences in SUC accumulation among the population were

ascribed to differences between activities of SAI and SPS (Sucrose phosphate synthase),

provided SAI is below the critical threshold concentration (Zhu et al., 1997).

PPDK

PPDK (Pyruvate orthophosphate dikinase) is an important rate-limiting enzyme in plant

photosynthesis more pronounced in C4 plants such as sugarcane than in C3 plants. The C4

pathway consists of three key steps: the initial fixation of CO2 in the mesophyll cell cytosol by

phosphoenolpyruvate (PEP) carboxylase (PEPC) to form a C4 acid, decarboxylation of a C4 acid

in the bundle sheath cells to release CO2, and regeneration of the primary CO2 acceptor PEP in

the mesophyll cell chloroplasts by pyruvate orthophosphate dikinase (Hatch, 1987). Whereas

PPDK and its regulatory proteins are found in the chloroplast of C4 plants, PPDK is only present,

and at low concentrations, in the cytoplasm of C3 plants. Despite C3 PPDK being highly

homologous to its C4 counterpart, C3 PPDK is not believed to function in photosynthesis

(Minorsky, 2002). Even though the reasons for this observation largely remain unknown,

extensive attempts are being made to transfer C4-ppdk gene to C3 plants particularly in rice and

wheat (Fukayama et al., 2001).

In C4 plants like sugarcane, PPDK catalyzes the ATP- and Pi-dependent formation of

phosphoenolpyruvate (PEP) from pyruvate. The activity of PPDK is rapidly modulated in

response to changes in light intensity by reversible protein phosphorylation, which is mediated

by a bifunctional regulatory protein (Burnell and Hatch, 1985). Genes for PPDK involved in the

C4 pathway have a dual promoter system to express two different transcripts for the chloroplastic

10

and cytoplasmic forms of PPDK, with the former being specifically expressed at high levels in

green leaves (Fukayama et al., 2001; Glackin and Grula, 1990; Sheen, 1991).

CT Gene Family

Among the cold tolerance genes used in this study were C-repeats/Dehydration

Responsive Element (CRT/DRE), COR15a, Mischantus-PPDK and Calcium-dependent protein

kinases (CDPK). Sugarcane is essentially a tropical crop. Louisiana is the most northern latitude

at which sugarcane is grown such that cold tolerance is of unique importance to our breeding

program, hence our interest in genes involved in regulating cold tolerance. In the Louisiana

State University breeding program parents are not selected based on their status of cold tolerance

but cold tolerance plays an important role during seedling and clonal selection (Gravois, Personal

Communication)

CRT/DRE

CRT (C-repeats)/ DRE (Dehydration Responsive Element) are cis-element found in the

promoter regions of many cold and dehydration genes. A family of transcription factors known

as CBFs or DREB1s binds to this element and activates transcription of the downstream cold and

dehydration responsive genes (Liu et al., 1998; Stockinger et al., 1997). Interestingly, the

CBF/DREB1 genes are themselves induced by low temperatures. This induction is transient and

precedes that of the downstream genes with the CRT/DRE cis-element (Thomashow, 1999).

Therefore, there is a transcriptional cascade leading to the expression of the DRE/CRT class of

genes under cold stress (Chinnusamy et al., 2004). It was definitively demonstrated that in

higher plants induction of the CRT (C-repeats)/ DRE (Dehydration Responsive Element)-regulon

increases the freezing as well as drought tolerance of Arabidopsis thaliana plants (Liu et al.,

1998) providing strong support for the notion that a fundamental role of cold-inducible genes is

to protect plant cells against cellular dehydration (Shinwari et al., 1998).

11

COR15a

Cold acclimation in plants is associated with the expression of COR (cold-regulated)

genes. Artus et al. (1996), working with COR15a, provided the first direct evidence for a cold-

induced gene having a role in freezing tolerance. COR15a encodes a 15-kDa polypeptide that is

targeted to the chloroplasts. Upon import into the organelle, COR15a is processed to a 9.4-kDa

polypeptide designated COR15am. Artus et al. (1996) demonstrated that constitutive expression

of COR15a in nonacclimated transgenic Arabidopsis thaliana plants increases the freezing

tolerance of both chloroplasts frozen in situ and isolated leaf protoplasts frozen in vitro by 1 to

2°C over the temperature range of -4 to -8 °C. At first, it appeared that expression of COR15a

might also have a slight negative effect on freezing tolerance of protoplasts over the temperature

range of-2 to-4 °C but subsequent experiments showed this not to be true (Steponkus et al.,

1998). It was originally assumed that the protoplasts isolated from the leaves of nonacclimated

transgenic plants would have the same intracellular osmolality as those isolated from wild-type

plants. Protoplast survival tests indicate that expression of COR15a has only a positive effect on

freezing tolerance over the temperature range of -4 to -8 °C (Thomashow, 1999).

Mischanthus-specific PPDK

Possessing the efficient C4 photosynthetic pathway yet tolerant of cool temperate

climates, Miscanthus (Miscanthus x giganteus) is potentially an ideal energy crop and has found

use as a bioenergy product in most of Europe (Bioenergy Information Network, 1999) . The

rhizomatous perennial grass Miscanthus x giganteus is from the same taxonomic group as

sugarcane, sorghum (Sorghum bicolor), and maize (Zea mays) and uses the same C4

photosynthetic pathway (Naidu and Long, 2004). It was postulated that the low-temperature

tolerance in Mischantus x giganteus, in addition to high efficiency in photosynthetic rate,

correspond to its maintenance of high levels of total soluble protein, particularly PPDK and

12

Rubisco (Naidu et al., 2003). The gene sequence used here is a Mischantus x giganteus-specific

PPDK reported by Naidu et al. (2003).

CDPK

CDPK (calcium-dependent protein kinases) sequence used here are from a report using

Saccharum officinarum EST database (Casu et al., 2004). CDPK, a large superfamily of kinases

are thought to function in signal transduction pathways that utilize changes in cellular Ca++

concentration to couple cellular responses to extracellular stimuli (Harmon et al., 2001). CDPK

phosphorylate and regulate the activity of PEP carboxylase, an enzyme important in C4

metabolism and is also involved in photosynthesis as well as stress like cold tolerance (Winter

and Huber, 2000).

TRICH Gene Family

Phenotypic variability for pubescence (trichomes) among sugarcane clones range from no

pubescence to very pubescent. Sugarcane breeders do not pay much attention to phenotypic

variability for hairiness during selection, although pubescence has been implicated in insect

resistance in other crops such as cotton and tomato (Kennedy, 2003; Lahtinen et al., 2004;

Wright et al., 1999). We included trichome development as one of the characters because, unlike

sucrose and cold tolerance, it is a trait that normally does not undergo intentional selection. The

trichome genes are all from Arabidopsis thaliana as trichome development has been studied

extensively in this model plant. The trichome development genes used were Glabrous 1 (GL1),

Glabrous 2 (GL2) and Transparent testa glabra (TTG1).

GL1

The glabrous1 mutant (gl1), which lacks trichomes on most surfaces, was used in early

gene mapping studies (Marks, 1997). The GL1 gene encodes a protein with two myb

transcription factor repeats and a carboxy-terminal domain of approximately 120 amino acids.

13

Myb are a large family of transcription factors encoding proteins that are crucial to the control of

proliferation and differentiation in a number of cell types. The carboxy-terminal region does

contain several clusters of acidic amino acids that may function as transcriptional activators. In

the weak allele gl1-2 the molecular lesion is a small deletion that results in the loss of the

terminal 27 amino acids. The missing region contains one of the acidic clusters. It has been

found that the gl1-1 allele contains a deletion removing the complete GL1 coding region as well

as flanking promoter elements, and the only phenotype is a loss of trichomes (Marks, 1997).

Thus, it is likely that the function of GL1 is restricted to controlling trichome development.

GL2

In situ hybridization analysis indicates that GL2 mRNA is expressed strongly in

developing trichomes. Immunolocalization of the GL2 protein and the analysis of plants

containing a GL2 promoter GUS reporter gene construct (GL2GUS) indicate a more complex

pattern of expression. By genetic analysis, GL2 function downstream of GL1, but GL1 does not

control the nontrichome expression pattern of GL2. Although the GL1 protein does not regulate

the early expression pattern of GL2, it or another myb protein could influence the expression of

GL2 in developing trichomes by binding to the myb binding site (Marks, 1997).

TTG1

It has been found that plants doubly heterozygous for both weak Transparent testa glabra

1 (ttg1), along with gl1 mutant allele, have greater than normal numbers of clustered trichomes;

that is, lateral inhibition appears to be reduced (Marks, 1997). Larkin et al. (1994) found that

plants heterozygous for TTG (TTG/ttg) and one or two copies of the 35SGL1 construct have a

greater number of leaf trichomes than plants that have one or two copies of 35SGL1 in a

homozygous TTG background. Approximately, 30% of the trichomes on 35SGL1/-TTG/ttg

14

were present in clusters. This suggests that the levels of TTG1, together with GL1 expression,

are important in controlling lateral inhibition of cells towards developing into trichomes.

Objectives

1.) To determine the level and pattern of genetic diversity among commercially important

sugarcane cultivars of Louisiana and some of their parents using the Target Region

Amplification Polymorphism (TRAP) technique.

2.) To compare the results from genetic diversity generated by the TRAP marker

technique with that calculated using coefficient of parentage.

15

MATERIALS AND METHODS

Plant Materials

A total of 63 commercial sugarcane clones were used in the study. These entries were

selected such that they included most of the commercial sugarcane cultivars released since 1960s

(Table 1). The parents of these commercial cultivars, which are often released cultivars

themselves, were included whenever possible. Also included were several foreign cultivars and

a S. officinarum and S. spontaneum clone which are either of worldwide significance or local

interest. Examples include: POJ2878, CO312 and NCo310 which are among the early modern

sugarcane clones to gain widespread commercial success and are believed to be implicated in the

ancestry of modern sugarcane; LA (Louisiana) PURPLE, also known as Black Cheribon (Deren,

1995), is a S. officinarum credited to carry the highest sucrose content among the original

foundation clones; US56-15-8, an important S. spontaneum clone implicated in the ancestry of

LCP84-384, the dominant cultivar in Louisiana.

Leaf tissue samples were collected from three breeding stations. Most of the leaf samples

were collected from the crossing rack and parental nursery at the St. Gabriel Research Station,

St. Gabriel, Louisiana, and those entries that were no longer available at St. Gabriel were

collected from the historical nursery of the USDA sugarcane research unit, Houma, Louisiana

through the kindness of Dr. Thomas Tew; and the rest were kindly provided by Dr. Ray Schnell

of the USDA-ARS, Miami, Florida.

DNA Isolation

The leaf samples were manually ground into a powdery consistency in liquid nitrogen

using mortar and pestle. The leaf midrib was removed whenever possible to maximize DNA

yield recovery in this high-lignin grass species. The powdery samples were then placed inside a

50-ml centrifuge tube and stored at -800C freezer until DNA extraction.

16

Table 1. List of 63 clones used within the study and their breeding station origin.

Variety Breeding Station Origin Description Variety Breeding Station Origin Description CO421 Foreign HoCP92-631 Houma, LA via Canal Point, FL

CP44-154 Houma, LA via Canal Point, FL HoCP92-678 Houma, LA via Canal Point, FL

CP48-103 Houma, LA via Canal Point, FL HoCP96-540 Houma, LA via Canal Point, FL

CP52-68 Houma, LA via Canal Point, FL HoCP98-776 Houma, LA via Canal Point, FL

CP57-614 Houma, LA via Canal Point, FL L00-266 LSU, LA

CP62-258 Houma, LA via Canal Point, FL L01-281 LSU, LA

CP65-357 Houma, LA via Canal Point, FL L01-283 LSU, LA

CP70-1133 USDA, FL L01-292 LSU, LA

CP70-321 Houma, LA via Canal Point, FL L01-296 LSU, LA

CP70-330 Houma, LA via Canal Point, FL L01-299 LSU, LA

CP72-370 Houma, LA via Canal Point, FL L62-96 LSU, LA

CP72-356 Houma, LA via Canal Point, FL L65-069 LSU, LA

CP73-351 Houma, LA via Canal Point, FL L93-365 LSU, LA

CP74-383 Houma, LA via Canal Point, FL L93-386 LSU, LA

CP76-331 Houma, LA via Canal Point, FL L97-128 LSU, LA

CP77-310 Houma, LA via Canal Point, FL L98-209 LSU, LA

CP77-405 Houma, LA via Canal Point, FL LAPURPLE Foreign

CP77-407 Houma, LA via Canal Point, FL LCP81-010 LSU, LA via Canal Pont, FL

CP79-318 Houma, LA via Canal Point, FL LCP82-89 LSU, LA via Canal Pont, FL

CP85-830 Houma, LA via Canal Point, FL LCP85-384 LSU, LA via Canal Pont, FL

CP86-916 Houma, LA via Canal Point, FL LCP86-429 LSU, LA via Canal Pont, FL

HoCP00-927 Houma, LA via Canal Point, FL LCP86-454 LSU, LA via Canal Pont, FL

HoCP00-930 Houma, LA via Canal Point, FL LHo83-153 LSU, LA via Houma, LA

HoCP01-544 Houma, LA via Canal Point, FL POJ2878 Foreign

HoCP01-553 Houma, LA via Canal Point, FL TucCP77-42 Bred at Houma Selected at Tucumen,

HoCP03-741 Houma, LA via Canal Point, FL US01-39 Houma, LA RSB

HoCP03-760 Houma, LA via Canal Point, FL US01-40 Houma, LA RSB

HoCP85-845 Houma, LA via Canal Point, FL US56-15-8 Foreign Spontanuem

HoCP89-846 Houma, LA via Canal Point, FL US93-15 Houma, LA RSB

HoCP91-552 Houma, LA via Canal Point, FL US93-16 Houma, LA

HoCP91-555 Houma, LA via Canal Point, FL DWARF1 Unkown

NCo310 Foreign

LSU = Louisiana State University; USDA = U.S. Department of Agriculture; FL = Florida; LA= Louisiana; RSB = Recurrent selection for borer resistance.

17

Total genomic DNA was isolated from the ground leaf tissue using the DNeasy® Plant

Minikit (Valencia, CA) following the manufacturer’s protocol. The kit contains the spin

columns, buffers, reagents and collection tubes needed to carry-out a faster, less toxic and high

quality DNA extraction procedure. The kit elutes a volume of no more than 200µl of DNA

suspended in ddH2O which was enough for the purpose of the PCR reaction. The extracted

DNA were stored at -200C until use.

DNA yield and concentration was assessed by comparing it to a Low DNA mass ladder

(Invitrogen, Carlsbad, CA) in a 1% agarose gel.

TRAP - Fixed Primer Design

The fixed primer is designed from a gene or an EST sequence that is of interest to the

researcher. The arbitrary primer is designed to target either the exonic (GC rich) or intronic (AT

rich) region of the genome (Li and Quiros, 2001).

A total of 10 putative genes that are believed to be part of the metabolic pathway directly

or indirectly involved in the regulation of SUC, CT and TRICH were used in the study. Table 2

lists some characteristics of the genes used in the study.

The design of fixed primers was based on the method described by Hu and Vick (2003)

and can be briefly outlined as follows:

1. Identify genes of interest by reviewing published literature especially the biochemical

and physiological basis of how these genes are interrelated in trait expression. Genes

that are substantially established to play a distinct and critical role in a specific trait

expression are the first choice. We used several academic database search engines for

this: PubMed®, Web of Knowledge® and CabDirect®. The genes we used are shown in

Table 2.

18

Table 2. Gene family/genes with gene family and characteristics of genes included in the study.

Gene family/ genes Gene Size NCBI accession

no Primer sequence

Suc Metabolism

SuSy 2717 bp AF263384 GGAGGAGCTGAGTGTTTC

SAI 1808 bp

(partial) AF062735 AGGACGAGACCACACTCT

PPDK 3172 bp AF194026 CGTAAAGATTGCTGTGGA

Cold Tolerance

CRT/DRE 908 bp NM_118680 GCTCCGATTACGAGTCTT

Mischanthus-PPDK 3043 bp AY262272 GAAGCTTGAGCACATGAA

Cor15a 1921 bp U01377 AACATCCTCGATGACCTC

CDPK 552 bp CF572977 ACAGAACCACCAAAGGAG

Trichome Development

GL1 2951 bp AF263718 AAGGTGTTTCCTCCAAAA

GL2 2161 bp AY133700 ACGGCTCAGATCATTTCT

TTG1 1561 bp NM_180738 CCGCTCTTGAAAAATCC

2. Identify the nucleotide sequences of the genes of interest. This is commonly done by an

online search for a name and description match or close homology on anyone of the

publicly available EST and/or genomic libraries online. We used the National Center

for Biotechnology Information (NCBI) website (http://www.ncbi.nlm.nih.gov/) to obtain

EST nucleotide sequences for 10 genes involved in SUC, CT and TRICH expression.

As a way of confirmation, we used the nucleotide sequences to do BLAST (Basic

Alignment Search Tool) and compared sequence matches across a wider library and

hence a wider taxa.

3. Design an 18-nucleotide primer using a primer designer software such as Primer3

(http://www-genome.wi.mit.edu/cgi-bin/primer/primer3.cgi) and Oligoperfect™

19

designer (http://www.invitrogen.com/) with the following conditions: optimum size =

18bp; minimum Tm = 50ºC, optimum Tm = 53ºC and maximum Tm = 55ºC. Several

putative primers are given by the software. We choose the one closest to the 5’ end of

the gene.

TRAP - Arbitrary Primer Design

The arbitrary primers were paired with the anchored fixed primer to intentionally target

either the GC-rich exons or the AT-rich introns. The arbitrary primers were 18 nucleotides long

and were designed based on the principles originally conceived by Li and Quiros (2001) and

adapted by Hu and Vick (2003). Briefly, an 18-nucleotide must have the following:

1. 10-11 nucleotides as “filler” sequences of no specific constitution beginning at the 5’

end.

2. A core composed of 4-6 nucleotides of GC or AT rich region.

3. 3-4 selective nucleotides at the 3’ end.

The arbitrary primer was also designed to adhere to the basic rules of 40-60% GC content

and inability to form hairpins or secondary loops. The arbitrary primer sequences used in this

study were derived from Li and Quiros (2001) and are shown in Table 3. The four arbitrary

TRAP primers were synthesized commercially by MWG Biotech AG (Ebersberg, Germany) and

came labelled with IR dye 700 or IR dye 800 at the 3’ end.

Table 3. Arbitrary 18-mer primer sequences used in this study derived from Li and Quiros (2001). Name Sequence

arbit1 5′-GACTGCGTACGAATTAAT-3′

arbit2 5′-GACTGCGTACGAATTTGC-3′

arbit3 5′-GACTGCGTACGAATTGAC-3′

arbit4 5′-GACTGCGTACGAATTTGA-3′

20

TRAP - PCR Protocol

The LICOR 4300 NEN Global DNA sequencer used in the sugarcane genetics laboratory

is able to simultaneously read two independent fluorescent-labelled marker profiles (IR700 and

IR800). A single fixed primer can be paired with two arbitrary primers differentially labelled

with Infrared (IR®) dye. For a single reaction, two primer combinations can be simultaneously

prepared, loaded and visualized.

Each reaction was carried out in total volume of 10µl containing the components shown

in Table 4. Note that the two arbitrary primers, tagged with different IR® labels, were added

simultaneously in a single reaction. The conditions for PCR are shown in Table 5. All of the

PCR reactions were carried out in i-cycler™ (BioRad Labs, Hercules, CA).

Upon completion of the PCR cycles, 7ul of stop/loading buffer was added to the reaction

mixture. A 0.2ul aliquot was loaded into the 6.5 polyacrylamide gel in the LICOR 4300 Global

DNA sequencer. Electrophoresis was conducted at 1500v for 3.5 h. Digital tiff images were

derived from the software included with the LICOR 4300 Global DNA sequencer. The images

obtained from each PAGE gel was scored manually as present (1) and absent (0).

Table 4. TRAP protocol. The protocol is designed to run two separate reactions simultaneously. Component Stock concentration final concentration per reaction (ul)

PCR Buffer 10X 1x 1 BSA 10mg/ml 0.1mg/ml 0.1 dNTP 10mM 0.2mM 0.2 MgCl2 25 mM 2.5mM 1 fixed primer 1uM 1pM 0.5 arbit primer I (IR700 labelled) 1uM 1pM 0.5 arbit primer II (IR800 labelled) 1uM 1pM 0.5 Taq polymerase 5U 0.5U 0.2 DNA ~100ng 1 ddH2O 6 Total volume 10

21

Table 5. TRAP PCR conditions used in the study (Hu and Vick, 2003). Temperature Duration No. of Cycles Process

940C 4 minutes 1 cycle Denature

940C 45 sec Denature

350C 45 sec Annealing

720C 1 min

1 cycle Extension

940C 45 sec Denature

350C 45 sec Annealing

720C 1 min

35 cycles

Extension

720C 7 min 1 cycle Extension

Statistical Analyses

The binary data, consisting of 0s and 1s, were entered into Microsoft Excel™ spreadsheet

program from which genetic similarity was computed. In using molecular marker data,

amplified fragments are considered alleles. Thus, the degree of dis/similarity between two

genotypes, is a direct description of allelic variation (Nei and Li, 1979). Genetic similarity (GS)

can be loosely defined as the proportion of molecular markers common between the two

individuals being compared. Computation of GS is directly translated to comparing the number

of rows of marker fragments of (apparently) similar size separated by electrophoresis.

Genetic distance (GD) is the complement of GS (i.e. GD = 1 – GS).

The choice of the method to translate the marker data to a data matrix for analysis is

critical and is described in detail by other authors (Felsenstein, 1984). In genetic diversity

studies, the two most common GS coefficient are Jaccard’s (Jaccard, 1908) and Nei and Li’s

(Nei and Li, 1979). Both coefficients gave identical rankings among pair of inbred lines

(Mohammadi and Prasanna, 2003) but might differ when analyzing heterozygous loci (Link et

al., 1995). For this study, Jaccard’s coefficient was used as it is deemed most appropriate when

22

using a dominant marker like TRAP. The pairwise Jaccard index is calculated by the following

equation:

GSij = Nij /[Ni + Nj+ Nij ]

where Nij is the total number of bands common to lines i and j, and Ni and Nj are the number of

bands only present in i and j, respectively.

Jaccard’s method looks at positions on the gel as opposed to Nei and Li’s method which

evaluates individual bands. The numerator is the number of gel positions at which both

individuals have a band. The denominator is the total number of gel positions, which is the

number of bands of different size. Since the method can take into account the presence of null

alleles, it works well with data generated from dominant markers.

Marker Information

Two measures of marker information were computed in this study: Percent

Polymorphism and Polymorphism Information Content (PIC). Percent polymorphism reflects the

capacity of the marker to distinguish among a given set of genotypes based on the total

polymorphic alleles. Thus, it is computed by taking the proportion of polymorphic rows over the

total number of rows generated for each marker. PIC, on the other hand, takes into account the

number of alleles at a locus along with their relative frequencies in a given population.

(Vuylsteke et al., 2000).

The PIC is calculated as:

PIC = 1- Σ fi2

where fi is the frequency of the ith allele .

Bootstrap Analysis

The ability of TRAP markers to provide precise estimates of pairwise distances among

clones (Tivang et al., 1994) was estimated using the bootstrap method (Efron 1981). We used S-

23

Plus® (Insightful Corp, 2003), a statistical package, to submit 1,000 resamplings with

replacement of polymorphic markers in increments of 100. The average, the variance and the

coefficient of variation (CV) were estimated for each one of these combinations until the total

number of markers was reached. The corresponding mean CV and number of TRAP markers

were plotted to assess the relationship between them.

Cluster Analysis

The GS matrix generated from the 40 primer combinations was used to perform cluster

analysis using the UPGMA clustering method (Sneath and Sokal, 1973) following the Sequential

Agglomerative Hierarchical Nested (SAHN) cluster analysis module of NTSYS (Rohlf, 2000).

Cluster analysis is a common exploratory classification method employed in most diversity

analyses. It is particularly useful in discovering natural groupings among entries or items without

assumption on the number of groups or group structure (Johnson and Wichern, 2002). The

groupings are visualized as sub-clusters and clusters connected by branches and are called

dendrograms, phenograms or simply trees. Unweighted Pair Group Method with Arithmetic

Mean (UPGMA), the most straightforward method for tree construction, was used to visualize

the cluster pattern. UPGMA uses a sequential agglomerative clustering algorithm where

individuals, technically referred to as operational taxonomic units (OTUs) (Weir, 1996), are

clustered in order of similarity (using Jaccard’s coefficient), and the tree is built in a stepwise

manner. It starts by identifying two OTUs that are most similar to each other and then merging

these as a new single composite OTU. The pair with the highest similarity is subsequently

identified and clustered from among the new group of OTUs (composite and simple). This

continues until only two OTUs are left. The algorithm assumes that the two most closely related

OTUs are more similar to each other than they are to any other.

24

As cluster analysis will always form clusters, we tested the goodness-of-fit of the

dendrograms to the original GS matrix by computing the co-phenetic value (rcoph) using the

COPH (cophenetic) and MXCOMP (matrix comparison) modules of NTSYSPC version 2.11L

software (Exeter software, N.Y.; (Rohlf, 2000)). The COPH module computes a symmetrical

matrix of cophenetic (ultrametric) similarity or dissimilarity values in the form of a tree matrix

from the set of nested clusters produced by the SAHN clustering module. The MXCOMP

module tests the goodness of fit of the cluster analysis and the original GS matrix by computing

the product-moment correlation, r, and the Mantel test statistic, Z, to measure the degree of

relationship between the two matrices. An rcoph ≥ 80% is generally considered a good fit (Rohlf,

2000).

To assess if there are patterns of diversity for each gene family, separate cluster analyses

were done for each GS matrix generated by each gene family. To compare across the three

clusters, the abscissa (x axis) in the dendrograms were normalized so that it would fit the range

of the GS values generated from the four matrices (i.e. the matrix generated by the SUC, CT and

TRICH gene family and from the gene families taken together).

Mantel Test

The correlations between the overall, SUC, CT and TRICH GS matrices were assessed

using the Smouse-Long-Sokal 3-way Mantel test.

Mantel’s (Mantel, 1967) test evaluates the degree of independence between two sets of

objects by regression. However, unlike the traditional regression method, Mantel’s test proceeds

from a similarity (or dissimilarity) matrix and thus accommodates categorical data variables (e.g.

presence-absence data) often encountered in ecological and taxonomical studies. It involves

measuring the association between the elements in two matrices with a normalized simple Z

cross-product statistic, r. It then assesses the significance of this statistic by comparing it with the

25

distribution found by randomly reallocating the order of the elements in one of the matrices

(Urban, 2003). The Mantel three-way test, also referred to as Smouse-Long-Sokal 3-way Mantel

test (Rohlf, 2000), compares two GS matrices while holding the effects of a third matrix constant

and is described in detail by Smouse et al. (1986). The goal is to test the correlation between

matrices A and B while controlling the effect of a third matrix C in order to remove spurious

correlations.

Multidimensional Scaling (MDS)

Multidimensional Scaling (MDS) and Principal Coordinate Analysis (PCA) are often

used to supplement cluster analysis. Both can be used to represent the often complex relationship

among a set of points into a low dimensional space while retaining maximum information about

the relationship of the underlying data points. In this study, non-metric MDS was preferred over

PCA because it is not based on the general linear model assumed by PCA. This affords MDS a

relatively much better fit into fewer dimensions compared to PCA. Further, MDS is based on

distances between points which suites the data at hand, while PCA is based on the angles

between vectors. Nonmetric MDS, as opposed to metric MDS, may not use the actual pairwise

distances because it aims to preserve the order of distances among the individuals (Johnson and

Wichern, 2002).

The 63x63 GS matrix generated by the TRAP marker was fitted into two-dimensional

coordinates by MDS using the MDScale module of NTSYS-pc (Rohlf, 2000). The following

were the steps in computing MDS as outlined in the software manual (Rohlf, 2000): results of

PCA were used as an initial configuration so as to reduce the number of necessary iterations.

From this initial configuration, the GS between all pairs of points are computed and compared to

the original GS data. A monotone function was fitted to these two variables and the deviations

of the points from this monotone function were computed as a normalized sum of squared

26

deviations from the monotone relation. A coefficient of goodness of fit, called Stress, was then

computed to see how well the data behaved when projected into n dimensional space. Stress

ranges from from 0 to 1, with a lower value indicating a better fit. Normally if Stress is not

sufficiently small then the positions of the points in this configuration space are changed slightly

so as to reduce Stress. The Stress is recomputed, and this process is repeated until either the

maximum number of iterations specified is exceeded or a sufficiently good fit is obtained.

Finally, PCA was performed on the MDS result so as to align the axes with the major axes of

variation.

Minimum Spanning Tree (MST)

Minimum Spanning Tree (MST) analysis, which computes a minimum-length spanning

tree from the original GS matrix, was also done. This provides ease in detecting distortions

associated in projecting high dimensional data into low dimensional space. Specifically, MDS

detects pairs of points which look close together in a 2D plot but actually are far apart if other

dimensions are taken into account.

Mixed Model Analysis on MDS Axes

The MDS coordinates are quantitative variables derived from a weighted combination

from each of the TRAP markers used. As such, analysis of variance techniques can be used as

an exploratory tool to assess among-group variation as was done on other studies (Casler et al.,

2003; Curley and Jung, 2004). The 63 clones were grouped as old and new clones arbitrarily

assigned as pre- and post 1980, respectively. Thus, mixed model analysis was carried out on the

means of the two MDS coordinates to investigate if significant differences exist among era of

release (i.e. pre- vs. post- 1980). Era of release (i.e. old and new) was considered fixed effects

while clones within era of release were considered random effects. Statistical analysis were

carried out using Statistical Analysis Software, SAS, version 9 (SAS Institute Inc., 2002).

27

Analysis of Molecular Variance (AMOVA)

Analysis of Molecular Variance (AMOVA) (Schneider et al., 2000) is a statistical tool

originally developed for population genetics and is used to investigate the partitioning of the

total variation into intra-population and/or inter-population level (Excoffier et al., 1992). The

AMOVA component of the freely-available Arlequin ver. 2.00 software (Excoffier, 1996;

Schneider et al., 2000) was used to investigate variation among and within era of release.

AMOVA was also used to assess the variation among and within gene families and to calculate

the interpopulation distances (phi-statistic) stated by Huff (1997). Using the interpopulation

distance matrix, an ordination plot using MDS was then constructed to establish the trend of

association of the 10 genes used as fixed primers in this study.

Coefficient of Parentage

Modern sugarcane is an interspecific hybridThe pedigree history data for a particular

breeding population is often used by sugarcane breeders to determine genetic diversity. The

coefficient of parentage ( f ) (Kempthorne, 1957) or coancestry or coefficient of kinship

(Falconer, 1989) calculates the probability between any two clones in a population to have

identical alleles at the same locus by descent.

The calculation of coefficient of parentage (COP) was carried out using the Proc Inbreed

of SAS ver 9.0 (SAS Institute Inc., 2002). The generated pairwise f values among the 63 entries

were then used to construct a coefficient of parentage (COP) matrix. The cluster analysis was

consequently constructed based on this matrix.

Mantel test of matrix correlation was performed to assess the degree of relationship

between the COP and TRAP GS matrices.

Moreover, the pedigree data was also used to trace the foundation clones. Foundation

clones are at the beginning of a pedigree tree and is believed to be the few participants in the

28

genesis of the modern sugarcane (Saccharum spp.; 2n = 80-120) during nobilization. Foundation

clones are not a product of a cross, are unrelated to each other and in the case of sugarcane, can

be of different species. As the large saccharum genus represents a wide genetic diversity range

(GRIN, 2004) and modern sugarcane is an interspecific hybrid of several Saccharum species, it

is useful to look at the proportion of foundation clones in the ancestry of a breeding population.

29

RESULTS AND DISCUSSION

TRAP Marker Polymorphism

Genetic similarity was assessed among the 63 sugarcane clones comprised of

commercially important cultivars and some of their parents using TRAP markers. Forty TRAP

primer combinations using ten fixed forward primers (i.e. genes) and four arbitrary reverse

primers, gave a total of 3,170 bands of which 2,684 were polymorphic (85%) (Table 6). The

number of observable bands ranged from 25 in SuSy-TRAP arbit1 to 168 in SAI-TRAP arbit3

with an average of 79. There were large numbers of amplified bands, generally greater than 50

per gel. Occurrence of a highly polymorphic band profile in sugarcane can be attributed to the

fact that sugarcane is a highly self-incompatible, cross-pollinating, complex polyploid grass

species with homologous and homeologous chromosomes (D'hont et al., 1995; Ming et al.,

2001). It is expected that individual clones are highly heterozygous. The calculated

polymorphism was high, ranging from 64% (16/25) to 100% (168/168) with a mean of 84%

(Table 6). This points to the relative strength of this dominant marker system, compared to

AFLP® which was already known to give high degree of polymorphism, to discriminate among

sugarcane clones (Besse et al., 1998; Hoarau et al., 2001). Moreover, the observed band sizes

ranged from 30 to greater than 700bp, with a majority of polymorphic bands in the 30 to 200bp

range (Figure 1). These results are similar to those from AFLP® profiles generated in our

laboratory. Thus, TRAP markers are able to amplify small to large band sizes simultaneously.

These observations are consistent with the reports of Hu and Vick (2003) and Alwala et

al.(2003).

The numbers of bands per gene family were 1,051 for SUC, 1,316 for CT and 803 for

TRICH. Mean polymorphism rate was highest for CT which was 87% (1147/1,316); followed

by SUC which was 82% (901/1,051) and then TRICH which was 78% (637/803) (Table 6).

30

Table 6. TRAP marker information generated on 60 sugarcane clones using 40 primer combinations. Gene family/ Fixed primer Arbitrary primer No of rows polymorphic rows %Polymorphic PIC

Sucrose metabolism SuSy TRAP arbit 1 25 16 0.64 0.16

TRAP arbit 2 144 120 0.83 0.23

TRAP arbit 3 80 65 0.81 0.2

TRAP arbit 4 56 45 0.80 0.23

Average 76.25 61 0.77 0.205

Subtotal 305 245

SAI TRAP arbit 1 90 79 0.88 0.33

TRAP arbit 2 103 85 0.83 0.28

TRAP arbit 3 168 168 1.00 0.50

TRAP arbit 4 120 108 0.90 0.41

Average 120.25 110 0.90 0.4125

Subtotal l 481 441

PPDK TRAP arbit 1 71 64 0.90 0.34

TRAP arbit 2 100 80 0.80 0.31

TRAP arbit 3 28 19 0.68 0.16

TRAP arbit 4 66 52 0.79 0.27

Average 66.25 54 0.79 0.27

Total 265 215

Mean (Gene family) 0.82 0.30

Subtotal (Gene family) 1,051 901

Cold Tolerance CRT/DRE TRAP arbit 1 127 119 0.94 0.38

TRAP arbit 2 109 84 0.77 0.25

TRAP arbit 3 94 86 0.91 0.29

TRAP arbit 4 102 97 0.95 0.38

Average 108 96 0.89 0.325

Total 432 386

Mischanthus-PPDK TRAP arbit 1 67 66 0.99 0.41

TRAP arbit 2 31 29 0.94 0.27

TRAP arbit 3 52 39 0.75 0.21

TRAP arbit 4 72 60 0.83 0.29

Average 55.5 49 0.88 0.295

Total 222 194

31

Table 6. continued. Gene family/ Fixed primer

Arbitrary primer No of rows polymorphic rows

%Polymorphic PIC

Cor15a TRAP arbit 1 98 85 0.87 0.28 TRAP arbit 2 49 36 0.73 0.12 TRAP arbit 3 96 75 0.78 0.2 TRAP arbit 4 41 35 0.85 0.2 Average 71 58 0.81 0.2 Total 284 231

CDPK TRAP arbit 1 113 106 0.94 0.43 TRAP arbit 2 129 107 0.83 0.21 TRAP arbit 3 105 95 0.9 0.37 TRAP arbit 4 31 28 0.9 0.34 Average 94.5 84 0.89 0.3375 Total 378 336

Mean (Gene family) 0.87 0.29 Sutotal (Gene family) 1,316 1,147

Trichomes GL1 TRAP arbit 1 67 61 0.91 0.35

TRAP arbit 2 96 73 0.76 0.3 TRAP arbit 3 25 19 0.76 0.15 TRAP arbit 4 67 46 0.69 0.22 Average 63.75 50 0.78 0.255 Total 255 199

GL2 TRAP arbit 1 114 103 0.9 0.35 TRAP arbit 2 72 43 0.6 0.17 TRAP arbit 3 68 39 0.57 0.16 TRAP arbit 4 38 35 0.92 0.28 Average 73 55 0.75 0.24 Total 292 220

TTG1 TRAP arbit 1 63 54 0.86 0.4 TRAP arbit 2 96 94 0.98 0.26 TRAP arbit 3 34 25 0.74 0.23 TRAP arbit 4 63 44 0.7 0.17 Average 64 54 0.82 0.265 Total 256 218

Mean (Gene family) 0.78 0.25 Sutotal (Gene family) 803 631

Grand Ave 79.25 67 0.83 0.28 Grand total 3170 2684

32

Figure 1. An example of TRAP amplification pattern in sugarcane.

Boxplot (Tukey, 1977) representation of the datapoints per gene family showed the

relative data dispersion. The minimum and maximum (called whiskers) value were observed to

be approximately 55% and 100%, respectively. No other datapoints was observed outside the

whiskers thus there were no apparent outliers. The interquantile range (i.e. the box itself where

50% of the datapoints are contained) showed a taller box, hence a wider dispersion of datapoints,

for TRICH, followed by CT, then SUC. The median, the horizontal line inside the box, indicates

skewness when located off the middle of the box. Thus both SUC’s and TRICH’s distribution

were slightly skewed to the left while CT’s distribution was skewed to the right. In general,

however, the data distribution for each gene family did not depart from normality thus permitting

analysis assumed with normal distribution. A test for percent polymorphic mean (represented by

33

the horizontal line intersecting the boxplot) differences revealed that there were no significant

differences among the three gene families (data not shown).

Figure 2. Box plot representation of mean and standard deviation for SUC, CT and TRICH gene families. * Mean comparison used t-test at alpha = 0.05; Means with the same letter are not significantly different.

Polymorphism Information Content (PIC) Values

The PIC value per gene family ranged from 0.12 in COR15a_Arbit2 to 0.5 in SAI_Arbit3

with an average value of 0.28 (Table 6). PIC values per gene family averaged across the four

arbitrary primers were 0.29, 0.30 and 0.25 for SUC, CT and TRICH, respectively (Table 6). The

PIC range (difference of the minimum and maximum values) was observed highest in SUC,

followed by CT and then TRICH (Figure 3). No apparent outliers were observed. The box, which

represents middle 50% of the data, was relatively taller and hence indicated a wider dispersion

for TRICH and CT compared to SUC. Normality was assumed in analysing the PIC values

because the median (the line inside the box) was observed to be approximately in the middle of

the box. However, when t-test was performed on the mean PIC differences per gene family,

represented by the horizontal line intersecting the boxplot, no significant differences was

observed (Figure 3).

34

Figure 3. Box plot representation of PIC mean and standard deviation for SUC, CT and TRICH gene family. * Mean comparison used t-test at alpha = 0.05; Means with the same letter are not significantly different.

The PIC values obtained here are typical of dominant markers (e.g. RAPD, AFLP® )

which are comparatively lower than co-dominant marker systems (e.g. SSR and RFLP) (Powell

et al., 1996; Vuylsteke et al., 2000). Typically, dominant markers assume only two alleles (i.e.

states) – presence (1) and absence (0) - for a particular locus (i.e. band or row) and thus by

definition the highest frequency for a locus is 0.5. A PIC value of 0.28/0.5 indicates moderate

level of utility for TRAP markers used in this study which does not differ from the findings using

AFLP® (but using fewer AFLP markers) markers in our laboratory (Alwala et al., 2003).

Curiously, the percent polymorphism reported was high which suggests, contrary to PIC

finding, a high degree of genetic diversity among the clones. This apparent discrepancy is

resolved and better understood when we take into consideration how percent polymorphism is

calculated. It is calculated by counting bands that are polymorphic divided by the total number

of bands per TRAP gel. As the number of clones increases, each band has a greater chance of

being polymorphic. Thus, the probability of finding differences between two individuals

increases with an increase in the total number of individuals. Such observation is especially true

for a complex polyploid/anueploid like sugarcane where it is uncertain if each band positions

35

represents multiple copies of alleles from homologous or homeologous chromosome or

legitimate insertion-deletion mutations. In fact, for these reasons the codominant markers like

Simple Sequence Repeats (SSR) are routinely scored as dominant marker (0, 1) in sugarcane

(Cordeiro et al., 2003). Thus, it is conceivable that a high value in percent polymorphism is

observed although in reality only one or two individuals are consistently divergent among a set

of genetically similar individuals. Therefore, as an index of marker informativeness, percent

polymorphism is best seen as a qualitative measure of how useful a marker is, at least in a

polyploid crop like sugarcane. The specific number of percent polymorphism to nominate a

marker as “useful” or “not useful” is best left to experience and to the nature of the marker being

used. For this study, the high rate of percent polymorphism suggests that the TRAP markers

were useful in estimating GS.

Bootstrap Analysis

The bootstrap analysis showed that the accuracy of the GS estimates, measured by the

mean coefficient of variation (CV), increased according to the number of polymorphic loci

analyzed. There was an inverse relationship between the variance and the sample size of

markers used to estimate GS. The dispersion plot (Figure 4) shows a proportional asymptotic

decrease in the mean CV with an increasing sample size of markers. Various authors have

recommended a CV value of 10% to be sufficient in providing a good estimate of GS (Halldén et

al., 1994; Santos et al., 1994; Thormann et al., 1994) using the same technique as first expounded

by Tivang et al. (1994). In this study, about 600 polymorphic bands would be necessary to give

a CV value of 10.0%, (Figure 4). However, using all the 3,170 bands reduced the CV value to

approximately 4.80%. The number of markers used in this study provided a high degree of

reliability in estimating GS. In contrast, in assessing genetic diversity in 78 Brazilian sugarcane

clones, Lima et al. (2002) found a steeper slope which revealed that only about 250 AFLP®

36

markers were needed to give a CV of 10% and about 800 AFLP® markers were needed to give a

CV of 5%. When using TRAP markers, more bands or more markers, about twice that of

AFLP®, is needed to achieve similar precision in estimating GS. This is likely due to the fact

that TRAP is a gene-targeted marker system and hence is less likely to show high degree of

diversity on the highly conserved coding regions than AFLP®, which is designed to scan the

entire genome. When the objective of the study is to accurately assess genetic diversity, it seems

that AFLP® is more robust owing to its power to sample a wider portion of the genome.

However, if the objective is to assess genetic diversity in relation to a specific trait variability,

functional markers (Andersen and Lubberstedt, 2003) like TRAP is more appropriate, though its

bands are yet to be extensively verified if indeed they are allelic.

Figure 4. Plot of the coefficient of variation with its standard error (bars) estimated by bootstrap analysis using different number of TRAP bands.

When markers specific to each gene family were extracted from the original 3,170

markers and submitted for bootstrap analysis, about 800 polymorphic markers were found to

37

give a mean CV of 10% or less. Of note is the SUC’s steeper slope, giving a relatively more

precise GS estimates than CT and TRICH with a mean CV of 10% being achieved with 400

polymorphic bands (Figure 5).

Figure 5. Plot of the coefficient of variation estimated by bootstrap analysis using different number of resampled TRAP bands derived from SUC, CT and TRICH gene families.

Cluster Analysis

Figure 6 shows the dendrogram based on the GS matrix generated from the 3,170 TRAP

markers for all of the forty primers (cophenetic value = 0.85). GSjaccard for the 63 entries was

high, ranging from 0.78 to 0.94 with a mean of 0.86. Cultivars LHo83-153 and CP76-331 were

the most similar (GSjaccard = 0.94) followed by L97-128 and L98-209 (GSjaccard = 0.93). On the

38

other hand, US56-15-8 and CP79-318 were the most distant pair (GSjaccard = 0.854) followed by

HoCP01-553 and LCP86-454 (GSjaccard = 0.858). Figure 6 shows that the clusters were separated

by small GSjaccard differences (0.16). It is not surprising that the greatest distance was found

between a cultivar and a S. spontaneum clone (US56-15-8). This implies that this set of entries

share a rather large genetic similarity and hence exhibit low diversity among them. Upon further

scrutiny, it can be noted that most of the entries are descended from just a few cultivars which

are themselves related. For example, cultivars CP65-357, L65-069 and LCP85-384 are parents to

more than half of the entries in the study (Table 7). Thus, this pattern clearly shows that

diversification of the parents used in the crossing program should be of importance because the

releases are genetically similar (Deren, 1995). Similar observations were reported by Nair et al.

(2002) who investigated genetic diversity among 29 Indian commercial sugarcane cultivars using

Random Amplification Polymorphism (RAPD), also a dominant marker. Their cluster analysis

showed an alarmingly narrow genetic diversity among the cultivars (genetic distance = 0.28)

regardless of the location or era in which the cultivars were released.

Surprisingly, Figure 6 shows that the overall cluster patterns were not reflective of the

pedigree relationships of the entries as listed in Table 7. Most entries of the same parentage were

grouped into different clusters. LCP85-384, currently the most important clone in Louisiana,

clustered separately from its parental cultivars, CP77-310 and CP77-407. Further, the progenies

of LCP85-384 (L98-209, L97-128, L01-299, L01-296, L01-292, L01-283, L01-281, HoCP98-

776, HoCP96-540, HoCP03-760, HoCP03-741, HoCP01-553, HoCP01-544, HoCP00-927) were

also grouped into different clusters. Moreover, TucCP77-42, also a product of S. spontaneum

introgression program, belonged to a different cluster.

Within the narrow GS window of this population, US56-15-8 clustered together with

CP79-318 at a GSjaccard level of 0.78 as an outgroup from the rest of the entries. Based on

39

pedigree records, US56-15-8, a S. spontaneum species from Thailand, can be traced as the most

common source of wild (S. spontaneum) background in the Louisiana cultivars as well as the

Hawaiian cultivars (Heinz, personal communication). It is not unusual, therefore, that US56-15-

8 showed a high degree of GS index with these group of cultivars despite being a different

species (i.e. S. spontaneum). However, CP79-318 was found to be divergent and distinct from

the rest of the released cultivars as it clustered along with US56-15-8. Unless this is a

mislabelled clone, it is unusual since its parents, CP65-357 and L65-69, had three other progeny

clones in different clusters. More unusual is the fact that CP73-351, CP73-383 and CP76-331 are

full sibling but did not cluster together. With few exceptions, the same pattern of relationship

where half-siblings and even full-siblings failed to group in the same cluster was common in this

study.

The cluster of an “old” clone: CP52-68 and a “new” clone: CP85-830 at GS = 0.89,

depicts an important consideration in dealing with genetic structure in sugarcane. It turns out the

relatively “new” clone, CP85-830, is a fourth-generation S. spontaneum (US56-15-8) backcross

and is, therefore, genetically closer to the “old” CP52-68 which is just three generations away

from the first interspecific crosses. This instance demonstrates the lack of structure in sugarcane

populations wherein breeding generations overlap. This makes characterization of diversity

with the goal of forming heterotic groups in sugarcane particularly difficult (Lu et al., 1994).

Cluster Analysis for Each Gene Family

Cluster analysis for each gene family showed discrete grouping pattern but with no clear

discernable clusters. Figure 7 is the resulting dendrogram based on 1,051 TRAP markers

generated by the SUC gene family (SuSy, PPDK and SAI). The SUC GSjaccard index ranged

from 0.66 to 0.92 with a mean of 0.79 and a cophenetic value of 0.81. In terms of pairwise

distances, CP89-846 and L65-69 were the most similar (GSjaccard = 0.92). CP85-830 and CP52-68

40

Table 7. Pedigree relationship of the 63 clones. Clone Female Parent Male Parent Description Detail US56-15-8 Common LSU S. spontanuem parent

Dwarf1 Dwarf gene

CP77-310 CP52-68 L65-69 Parental Female parent of LCP85-384

NCO310 CO421 CO312 Foreign Commercial South Africa released cultivar

TucCP77-42 CP71-321 US72-19 Commercial Crossed in Houma, released in Tucumen, Argentina

CP77-407 CP71-421 CP66-315 Parental male parent of LCP85-384 L62-96 CP52-68 CP44-154 Commercial Released 1969 L65-69 CP52-001 CP48-103 Commercial Released 1972 CP65-357 CP52-68 CP53-17 Commercial Released 1973 CP70-321 CP61-39 CP57-614 Commercial Released 1978 CP70-330 CP61-39 CP57-614 Commercial Released 1978 CP72-370 CP61-037 CP52-68 Commercial Released 1980 CP72-356 CP63-361 CP62-258 Commercial Released 1980 CP73-351 CP65-357 L65-69 Commercial Released 1981 CP74-383 CP65-357 L65-69 Commercial Released 1982 CP76-331 CP65-357 L65-69 Commercial Released 1984 CP79-318 CP65-357 L65-69 Commercial Released 1987 LCP82-89 CP52-68 CP72-370 Commercial Released 1990 LCP85-384 CP77-310 CP77-407 Commercial Released 1993 LHo83-153 CP77-405 CP74-339 Commercial Released 1991 HoCP85-845 CP70-370 CP77-403 Commercial Released 1993 LCP86-454 CP77-310 CP69-380 Commercial Released 1994 HoCP91-555 CP83-644 LCP82-094 Commercial Released 1999 L97-128 LCP81-010 LCP85-384 Commercial Released 2004 La Purple Foundation clone High sucrose HoCP00-927 CP89-831 LCP85-384 Experimental HoCP01-544 LCP85-384 LCP86-454 Experimental HoCP01-553 LCP85-384 LCP86-454 Experimental HoCP03-741 LCP85-384 HoCP92-631 Experimental HoCP03-760 LCP86-454 HoCP92-631 Experimental HoCP91-552 CP81-010 CP72-356 Experimental HoCP96-540 LCP86-454 LCP85-384 Commercial Released 2003 HoCP98-776 LCP85-384 CP70-1133 Experimental L00-266 HoCP89-846 L93-386 Experimental L01-281 LCP86-429 LCP85-384 Experimental L01-283 L93-365 LCP85-384 Experimental L01-292 CP65-357 LCP85-384 Experimental L01-296 CP65-357 LCP85-384 Experimental L01-299 L93-365 LCP85-384 Experimental L98-209 LCP86-454 LCP85-384 Experimental LCP81-010 CP74-328 CP70-1133 Experimental HoCP93-775 CP86-916 CP85-830 Experimental (Borer resis.) US01-39 HoCP92-678 US93-15 Experimental (Borer resis.) US01-40 HoCP93-775 US93-16 Experimental (Borer resis.) POJ2878 Foreign Commercial Bred at Java, Indonesia Note: entries in grey were clones used in the study.

41

and L01-281 and DWARF1 (GSjaccard = 0.85) were the most distant pairs.

Figure 8 is the CT dendrogram based on 1,316 TRAP markers generated by the CT gene

family composed of CDPK, Mischanthus, COR15a and CRT/DRE. The CT GSjaccard index

ranged from 0.79 to 0.95 with a mean of 0.87 and a cophenetic value of 0.85. In terms of

pairwise distances, LHo83-153 and CP76-331 were the most similar (GSjaccard = 0.95); CP79-318

and US56-15-8 were the most distant pair (GSjaccard = 0.85).

Finally, Figure 9 is the dendrogram of the 63 entries based on the 803 TRAP markers

generated from the TRICH gene family (GL1, GL2 and TTG1). The CT GSjaccard index ranged

from 0.86 to 0.98 with a mean of 0.92 with a cophenetic value of 0.80. In terms of pairwise

distances, CP76-331 and L62-96 were the most similar (GSjaccard = 0.98). The most distant pair

was HoCP01-553 and LCP86-454 (GSjaccard = 0.89).

For the range of the GS values for the three gene families, we found SUC > CT > TRICH

which were similar to the PIC values for the gene families. This suggests the relative power of

PIC compared to percent polymorphism in determining the measurement of genetic diversity.

The high degree of cophenetic value, rcoph, for each of the clusters indicates that the

clusters were in high agreement to the underlying GS value from which they were derived.

US56-15-8, a S. spontaneum clone, was observed to be an out-group only in the SUC-

based cluster but not in the other two gene family clusters. US56-15-8 was markedly disticnt

from the rest at GS = 0.66 in SUC-based cluster (Figure 7) while it clustered with clones in the

CT- or TRICH-based clusters (Figs. 8 and 9). Moreover, the GS index where US56-15-8

grouped was comparatively lower in CT at GS = 0.87 (Figure 6) and TRICH at GS = 0.93

(Figure 7). Since S. spontaneum clones are known to confer anything but the high sucrose

content trait, one may speculate that the SUC based cluster might be reflective of “sugar” gene

pattern. In addition, LA PURPLE, a S. officinarum clone considered to be a foundation clone

42

and known to be of high sugar content was observed to have a greater similarity with the

cultivars in the SUC dendrogram (Figure 7). In particular, LA PURPLE clustered with CP77-

310 at GS = 0.91; an interesting result considering that CP77-310 is the male parent of the

popular and high yielding LCP85-384. The S. spontaneum ancestry of LCP85-384, being a

fourth generation S. spontaneum backcross, came from its female parent lineage (CP77-407).

Thus, one can speculate that the above average performance of LCP85-384 is due to the

appropriate “blend” of the high-sugar-content genes with a hardy, S. spontaneum background.

The “LAPURPLE- CP77-310” cluster was also observed in CT but not in the TRICH

dendrogram.

LCP85-384 was an out-group in the CT-based cluster and grouped closer to US56-15-8

than to the other clones (Figure 8). US56-15-8, collected from Thailand, is one of the most

important introduced accessions to have successfully conferred wide-adaptability traits to the

germplasm diversity in Louisiana. LCP85-384 is a BC4 clone derived from S. spontaneum

(US56-15-8) introgression. This may explain the observed hardiness of LCP85-384. It was

reported that S. spontaneum and Miscanthus spp. species have better adaptability to a wide range

of conditions including cold tolerance.

Comparison of GS Trends Among Gene Families

The Smouse-Long-Sokal 3-way Mantel test (Smouse et al., 1986) showed a moderate

correlation between the three gene family GS (Figure 10). The SUC and CT GS matrices gave r

= 0.47; CT and TRICH gave r = 0.53 while SUC and TRICH had an r = 0.18. All the three

correlation indices were significant at 5% error level. As mentioned earlier, none of the three

clusters looked alike but the results from the Mantel test manifested itself in the sense that there

were instances where two clones grouped similarly in the CT and TRICH clusters relative to the

SUC and TRICH clusters. Whether these results are associated with pleiotropism can only be

43

speculated since the GS measure is based on fragment size and not sequences. Only sequence

comparison would suffice to infer about the relationship between these genes. It should be noted

that similar arbitrary primer sequences were used. It should also be noted (and discussed later)

that there were overlaps between the gene function of genes used in this study, for example

PPDK is involved in both SUC and CT pathways.

AMOVA on Gene Family Structure

The amount of variation among gene families and among genes within gene families was

assessed using Analysis of Molecular Variance (AMOVA). A distance matrix using pairwise

difference was generated from the 2,828 usable TRAP markers computed by treating the 63

clones as the independent variables.

There were no significant differences among the three gene families SUC, CT and

TRICH and variation between genes families accounted for only about 2.04% of the total

variation (Table 9). The genes nested within each gene family were significantly different from

each other although the amount of variation it accounted for was low (3.9%). The greatest part

of the TRAP variation (94%) was found among the markers comprised of the 40 fixed-arbitrary

primer pair. Therefore, taking genetic variation and polymorphism as synonymous, it seems that

increasing the number of arbitrary primers would detect more polymorphism than would be

achieved by increasing the number of fixed primers for a constant number of arbitrary primers.

The fixed primer is anchored in the same portion on the gene, but the arbitrary primers

would sample a different part of the genome independent from each other. But ultimately, the

only way of sampling variation for a trait using TRAP markers would be to include the

appropriate combination of fixed and arbitrary primers which must include most of the genes

(fixed primers) involved in the expression of the trait.

44

Figure 6. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph = 0.85) using TRAP markers from all the gene families (SUC, CT, TRICH; see Materials and Methods).

45

Figure 7. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph = 0.82) using the SUC Trap markers; see Materials and Methods.

46

Figure 8. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph = 0.86) using the CT TRAP markers; see Materials and Methods

47

Figure 9. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GSJaccard estimates (rcoph = 0.85) using TRICH; see Materials and Methods.

48

Figure 10. Smouse-Long-Sokal three way mantel test from SUC, CT, TRICH GS matrices. note: all matrix are significantly different at p = 0.05

Using MDS and minimum spanning tree to project the pair-wise distances generated by

AMOVA, we can visualize how the 10 genes are associated with each other (Figure 11).

Generally, the results mimic those from the Mantel test (Figure 10). The CT and TRICH genes

seemed to cluster into a loose group away from the SUC genes. When we looked at the

individual genes, we found that SAI, which is classified as part of the SUC gene family (sucrose

49

accumulation), was actually closer to the cold-tolerant gene CDPK. This is not unexpected as

CDPK (and PPDK) are part of a larger superfamily that is involved in several biochemical

pathways mostly in response to stress (Cheng et al., 2002). Interestingly, stress is used in

sugarcane to boost sucrose accumulation through the application of ripeners. Assuming that

some of the sucrose accumulation and cold tolerance genes respond to stress, some of the genes

among our nominated gene family groups may share some similarity if they diverged not too

long ago. By the same token, it can be said that the significant low correlation (r = 0.18)

between SUC and TRICH (Figure 10) and the lack of association between SUC and TRICH

genes (Figure 11) is due to the fact that the genes overlapped less in terms of regulatory function

and their sequences. As earlier mentioned these results are speculative and can be substantiated

only through sequence comparisons.

Table 8. Analysis of Molecular Variance partitioning genetic variation for gene family, genes within gene family and genes. Source Sum of Variance Percentage Variation d.f. Squares Component of Variation Among Gene Family 2 494.755 0.16656 ns 2.04 Among Gene Within Gene Family 7 675.508 0.32199 ** 3.95 Within Genes (markers) 2792 21422.802 7.67292 ** 94.01 Total 2801 22593.065 8.16147

Multi-dimensional Scaling (MDS) Analysis

The computed genetic similarities between the 63 entries, derived from the 3,170 TRAP

markers, were graphically represented with a multi-dimensional scaling (MDS) plot (Figure 12)

using the MDScale module of NTSYS (Rohlf, 2000). The STRESS of the plot was 0.242,

50

suggesting a relatively fair fit of the interobject MDS projection of the GS matrix. Up to 77% of

variance in genetic distance data was explained by the two MDS coordinates as opposed to the

18% for the first two vectors when explored with Principal Component Analysis (Data not

shown).

Figure 11. Bi-plot of the first two MDS scales among the ten genes derived from pairwise distance generated by AMOVA. Note: Minimum spanning tree are the lines that connects the two nearest points.

Most of the entries form a tight group whereas the older clones (i.e. those bred before

1980) and clones derived from recent S. spontaneum backcross were generally in the periphery:

CP79-318, POJ2878, CP52-68 are old clones while LCP85-384, CP85-830 and HoCP01-553 are

products from S. spontaneum backcrossing (Figure 12). This trend was not apparent in the

cluster analysis. A large part of the remaining entries are comprised of full siblings, half siblings

and their parents; the observed tight grouping describes the high genetic similarity that exists

among such clones.

Clones bred and selected at LSU (the L clones), in particular, composed the observed

tight grouping because most of the entries were closely related (i.e. full siblings and half siblings,

refer to Table 7). Further, the tight grouping also revealed LSU and Houma bred clones to be

51

Figure 12. Bi-plot of the first two MDS scales among the 63 clones using GSJaccard derived from TRAP markers. Note: Minimum spanning tree are the lines that connects the two nearest points.

closely related probably because both stations embarked on a basic breeding program that

involves backcrossing to S. spontaneum and both share germplasm with accordance to the

“Three-way Agreement”(St. Gabriel/Sugar Research Station, 2004). For example, US56-15-8 is

one of the wild accessions collected from expeditions to sugarcane’s centers of diversity that

proved successful in introgressing the desired trait of disease resistance to the LSU and Houma

active germplasm pool (Kimbeng, personal communication). Today, three of the five

commercially recommended clones in Louisiana, LCP 85-384 (Louisiana’s leading commercial

clone), HoCP85-845, and HoCP96-540, have US56-15-8 in their pedigree.

52

For MDS to show patterns of grouping within the narrow GS window attest to the

validity and importance of supplementing cluster analysis with ordination analysis, like PCA or

MDS, in exploratory studies like this. It may allow the detection of subtle but significant

divergence among clones.

However, assessing differences based on breeding stations was not pursued because there

was a continuous active exchange of breeding materials among the LSU, Florida and Houma

breeding stations. These breeding stations exchange parental clones and do joint evaluation of

clones in the advance selection stages. Noteworthy to mention are the CP clones used here which

were actually bred and selected for LSU. Previous nomenclature in assigning variety name use

only “CP” as prefix. Distinction among clones bred from different breeding stations is

determined by the unique clone number range assigned to each station. Except for CP70-1133,

the CP clones used here are bred and selected for Louisiana (Gravois, personal communication).

It was only later that clones selected at LSU and Houma had the prefixes L and Ho, respectively.

The new nomenclature was modified to also accommodate the active exchange of breeding

materials. A prefix of LCP means that the clone was bred in Florida (CP clones) and selected in

LSU (L clones). The same is true for LHo, HoCP and HCP clones. A quick evaluation of the

parents of these commercially important entries (Table 7) and the general cluster analysis results

previously discussed (Figure 12) attest to the active exchange of materials among the stations.

One particular breeding station would have any of the clones bred and selected by the other

stations as parental clones. For example, L clones would include the L, CP, Ho and various

combinations of L, CP and H prefixes in their parental profile.

Analysis of Molecular Variance (AMOVA)

AMOVA showed no significant difference among old and new clones. Further, such

grouping accounts for a very small portion (0.08%) of the total variation (Table 9). Instead,

53

almost all of the observed genetic variation (99.92%) can be ascribed to the differences between

clones within era of release. Despite the low genetic diversity among these clones, each clone

possesses unique genetic variation. This reinforces the same intriguing pattern in cluster analysis

where related clones and even fullsibs and half siblings failed to group together. This results,

although not entirely expected, reveals a high level of heterozygosity and variation among the

individual sugarcane clones. High molecular variation for individual clones within populations

is common to clonally propagated, out-crossing polyploid species and has been reported in

buffalo grass (Buchloe dactyloides) (Huff et al., 1993) ; blue grama (Bouteloua gracilis) (Phan

and Smith, 2000) and crested wheatgrass (Agropyron spp.) (Mellish et al., 2002).

Table 9. AMOVA on 63 clones based on era of release (i.e. old or pre 1980 vs. new or post-1980).

Source Sum of Variance Percentage Variation d.f. Squares Component of Variation

Among Era of release 1 64.747 0.05330 ns 0.08 Within Era of release 61 3727.253 63.17379 99.92 Total 62 3792.000 63.22709

When the MDS-derived x and y coordinates of the 63 entries were analyzed by Mixed

Model Analysis based on the era of release (Figure 13), significant differences were found in X-

but not in the Y-axis (Table 10). The significant difference at the x-axis can be attributed to the

few clones that were mostly classified as old clones that were located at the periphery. In

general, it was observed that the minimum spanning tree connected old and new clones at the

center. Very few of the clones bred before 1980, arbitrarily named here as “old” clones, seem to

be more divergent from those bred post-1980s, arbitrarily named “new” clones. This is because

54

the breeding generations in sugarcane overlap as a result of sugarcane breeders’ bias to

repeatedly use a few parental clones with proven cross status each crossing cycle.

Figure 13. Bi-plot of the first two MDS scales among the 63 clones classified into old and new using GSJaccard derived from TRAP markers.

Table 10. Least square means and t test of the MDS axes for Era of cultivar release. No. of X coordinate Y coordinate

ERA Clones Least Sq Mean Least Sq Mean New 38 0.145187 a 0.0240854 a Old 25 -0.1989599 b -0.0175758 a

Means connected by the same letter are not significantly different at alpha = 0.05.

MDS With Mixed Model Analysis On Era Per Gene Families

SUC

A mixed model analysis based on SUC markers looking at differences among era of

release (Figure 14) showed that old and new clones were not significantly different from each

other both in the x and y MDS axis (Table 11). Such results reflect overlapping generations that

sugarcane pedigree is noted for. As was earlier mentioned, this was brought about by the bias of

55

using proven parents repeatedly during crossing. Except for some L and HoCP clones, most of

the new clones have old clones as parents (Table 7). The L and HoCP clones, however, were

noted to have high occurrence of LCP85-384 as either female or male parent. This is the

consequence of a concerted breeding effort to maximize the general combining ability of the

LCP85-384 in the hopes of replicating its exceptional field performance. However, to

definitively conclude progress has been made in terms of total sucrose yield from old to new

releases, yield performance is needed to supplement the data.

Figure 14. Bi-plot of the first two MDS scales among the 63 clones classified into old (Pre-1980) and new (post-1980) using GSjaccard derived from TRAP markers of the SUC gene family.

Table 11. Least square means and t test of the MDS axes for Era of cultivar release based on SUC GS.

No. of X coordinate Y coordinate ERA

Clones Least Sq Mean Least Sq Mean

New

38 0.0644866a 0.0158014 a

Old

25 -0.0883706 a -0.0216537 a

Means connected by the same letter are not signficantly different at alpha = 0.05

56

CT

When old and new clones were compared based on the CT gene family (Figure 15), no

significant differences were found (Table 12). Other than noting potential parents showing rapid

recovery from winter season from field observation data, breeding for cold tolerance is not

formally integrated in the breeding and selection program of LSU despite it being a highly

desired trait. Hence, it was expected that grouping pattern based on era of release, or breeding

stations, would shown significant differences for such a small activity as was indeed shown here.

Figure 15. Bi-plot of the first two MDS scales among the 63 clones classified into old (pre-1980) and new (post-1980) using GSJaccard derived from TRAP markers using CT genes.

Table 12. Least square means and t -test of the MDS axes for Era of cultivar release based on CT GS.

No. of X coordinate Y coordinate Level Clones Least Sq Mean Least Sq Mean

New 38 0.0158014 a 0.0844352 a Old 25 -0.0216537 a -0.1157075 a

means connected by the same letter are not signficantly different at alpha = 0.05

57

TRICH

Again, when old and new clones were compared using TRICH gene family (Figure 16),

no significant differences were observed (Table 13). This was expected as trichomes, at least in

sugarcane breeding, are not considered during selection.

Figure 16. Bi-plot of the first two MDS scales among the 63 clones classified into old (pre-1980) and new (post-1980) using GSJaccard derived from TRICH markers.

Table 13. Least square means and t test of the MDS axes for Era of cultivar release based on TRICH GS.

No. of X coordinate Y coordinate Level Clones Least Sq Mean Least Sq Mean New 38 0.1598493 a 0.0816873 a Old 25 -0.2190528 a -0.1119419 a

means connected by the same letter are not significantly different at alpha = 0.05

Pedigree Analysis

To compare COP and TRAP marker, 60 clones with complete pedigree records were used.

Three entries with incomplete, unknown or doubtful pedigree records were taken out from the

58

original list of 63 entries. These were Louisiana Purple, US56-15-8 and DWARF1.

Tracing the lineage of the 60 clones back to the foundation clones allows a unique

perspective in appreciating the diverse genetic background (or lack thereof) of this set of clones.

Foundation clones can be defined as a progenitor on the very top of the pedigree tree and is

believed to be one of the few participants in the genesis of modern sugarcane (Saccharum spp.;

2n = 80-120). Foundation clones are not the product of a cross, are unrelated to each other and

in the case of sugarcane, can be of different species. Tracing back the pedigree records for each

of the 60 clones, 18 foundation clones were identified. The 60 clones can be traced back to ten

S. officinarum, three S. spontaneum, two Barberi spp., one Saccharum x Sinensi hybrid and one

Sinensi spp. foundation clones (Table 14). Closer inspection showed that 13 foundation clones

were found to be prevalent, above 80% (Table 14), among this set of clones indicative of the

importance and influence of these clones to the U.S. sugarcane breeding program.

Table 14. Foundation clones for the 63 entries. Foundation Clone Species Description Chunnee S. barberi 2n = 92 Kansar S. barberi 2n = 92 28 NG 251 S. robustum Ashy Mauritius S. officinarum Banjermasin Hitam S. officinarum Black Cheribon S. officinarum Crystalina S. officinarum EK28 S. officinarum 2n = 80 FX1 S. officinarum Java Officinarum S. officinarum Loethers S. officinarum Rose Bamboo S. officinarum Saccharum Officinarum S. officinarum Striped Mauritius S. officinarum Okinawa Tekcha S. sinensi Indian Spontanuem S. spontanuem Java Spontanuem S. spontanuem US56-15-8 S. spontanuem

59

This list of foundation clones slightly differs with the 18 foundation clones identified by

Deren (1995). Red Fiji (S. officinarum), Yellow Caledonia (S. officinarum), Vallai (S.

officinarum) and Hawaiian Uba ( S. sinense) were among the foundation clones mentioned by

Deren (1975) not found in the lineage of the 60 clone in this study. On the other hand, EK28 (S.

officinarum), FX1 (S. officinarum) and US56-15-8 (S. spontaneum) are listed as foundation

clones here but were not included in Deren’s (1995) list of foundation clones. Except for US56-

15-8 which is a relatively recent wild germplasm to be used in the base broadening efforts, this

apparent discrepancy might be due to the different clones evaluated in the two studies and

spurious and conflicting pedigree records. Figure 17 shows the S. officinarum foundation clones

which account for a large part of the genetic make-up of the 60 clones and their relative

frequencies. Ashy Mauritius, Banjermasin Hitam, Black Cheribon, EK28, Striped Mauritius, S.

officinarum (Java) and S. officinarum occurred in high frequency. Rose Bamboo and Crystalina

were identified as synonymous clones (Deren, 1995). The influence of Barberi spp. is mostly

through the clone Chunnee. S. sinensi ancestry is through Okinawa Tekcha. Loethers is unique

among the foundation clones in that it is an interspecific hybrid S. officinarum x S. sinensi.

Noteworthy is the fact that the S. spontaneum influence can be traced back to just two clones:

Indian S. spontaneum and Java S. spontaneum (Deren, 1995).

The composition of foundation clones of other sugarcane breeding stations worldwide

will not be far from the one described here. It is universally accepted that only a few clones were

used in the development of the modern sugarcane. Knowing the frequency and composition of

the foundation clone for a specific germplasm becomes a handy tool in creating diversity and

broadening the genetic base. It is noted that when frequency of presence of these foundation

clones were evaluated per breeding station there was no appreciable difference (data no shown).

60

0.000.100.200.300.400.500.600.700.800.901.00

CH

UN

NEE

KAN

SAR

28 N

G 2

51

ASH

Y M

AUR

ITIU

S

BAN

JER

MAS

IN H

ITAM

BLAC

K C

HER

IBO

N

CR

YSTA

LIN

A

EK28 FX

1

JAVA

OFF

ICIN

ARU

M

LOET

HER

S

RO

SE B

AMBO

O

SAC

CH

ARU

M O

FFIC

INAR

UM

STR

IPED

MAU

RIT

IUS

OKI

NAW

A TE

KCH

A

IND

IAN

SPO

NTA

NU

EM

JAVA

SPO

NTA

NU

EM

US5

6-15

-8

Foundation clones

Perc

ent P

rese

nce

Figure 17. Percent presence of the foundation clones in the 63 clones.

COP vs. TRAP (Cluster and MDS Analyses)

The generated 3,600 pairwise COP data points was converted to a 60 x 60 matrix (GScop)

that was imported to NTYSys for Cluster and MDS analysis. Figure 18 shows the resulting

dendrogram based on COP.

The highest GScop was 0.40 and the lowest GScop was 0.06, with an average value of

0.23. Since the highest GS for a highly heterozygous crop like sugarcane can be 0.5 (i.e.

progeny of selfed clones would have a GS = 0.5 among each other), the dendrogram presents a

wide range of GS index (0.40 – 0.06 = 0.34). The most distant cluster that separated at GScop =

0.06 was observed to be composed of “old” (ie. Pre 1980s) foreign clones: Co421, POJ2878 and

NCo310. This is followed by the cluster of full siblings CP70-330 and CP70-321 and their male

61

parent, CP57-614 at GScop greater than 0.06. The most similar entries, on the other hand, were

LCP82-89 and its female parent, CP52-68 (GScop = 0.40). This is followed by a cluster

composed of the parental entries LCP85-384 and LCP86-454, its progeny HoCP01-544 and

HoCP01-533 and also reciprocal progenies HoCP01-544 and HoCP01-533 at GScop = 0.38 (refer

to Table 7 for the pedigree of the entries). Not too far were clusters composed of fullsibs and

their parents: L01-292 and L01-296 and their parent, CP65-357 (GScop ≤ 0.38) and full siblings

CP73-351, CP74-383, CP76-331 and CP79-318 (GScop = 0.31).

COP

GS (f)0.00 0.10 0.20 0.30 0.40 0.50

CP701133MW

CO421 POJ2878 NCO310 CP44154 L6296 HCP89846 L00266 L93386 CP48103 CP86916 L65069 CP52068 LCP8289 CP720370 CP65357 L01292 L01296 CP73351 CP74383 CP76331 CP79318 CP77310 HCP01544 LCP85384 HCP01553 HCP96540 L98209 LCP86454 HCP00927 HCP00930 US9316 CP77407 CP85830 L01283 L01299 L93365 HCP91555 HCP03741 HCP92631 HCP03760 HCP85845 HCP92678 US0140 US0139 US9315 CP77405 LHO83153 L01281 LCP86429 CP62258 TUCP7742 CP72356 HCP91552 CP701133 HCP98776 L97128 LCP81010 CP57614 CP70321 CP70330

Figure 18. Dendrogram of the 63 clones using UPGMA cluster analysis method produced from GS estimates using Coefficient of Parentage (COP).

62

Figure 19. Mantel test on COP vs. SUC, CT, TRICH GS matrices. note: all matrices are significantly different at p = 0.05.

However, the mantel test for matrix correlation (Mantel, 1967) revealed a low correlation

between COP-derived and TRAP-derived GS. Figure 19 shows the respective correlation

between COP vs. overall TRAP to be -0.008, COP vs. SUC to be -0.104, COP vs. CT to -0.132

and COP and TRICH to be -0.023. Similar results were observed in roughly the same set of

entries when COP-derived GS was compared with AFLP®-derived GS in our laboratory. Lima

et al. (2002) compared COP and AFLP® for 78 sugarcane clones and found a moderate

correlation, r = 0.37 which was higher than the AFLP®-COP correlation coefficient of 0.17

r = -0.02398COP vs TRICH

cop.NTS-0.04 -0.01 0.02 0.06 0.09

trich_dist.NTS

-0.11

-0.06

-0.02

0.02

0.06r = -0.13200

CT

cop.NTS-0.04 -0.01 0.02 0.06 0.09

ct_dist.NTS

-0.11

-0.07

-0.02

0.02

0.06

r = 0.10484COP vs Suc

cop.NTS-0.04 -0.01 0.02 0.06 0.09

suc_dist.NTS

-0.08

-0.04

-0.00

0.03

0.07

63

reported in our laboratory. Differences in the germplasm used may account for this disparity.

Several factors may be responsible for the low correlation between marker information

and COP in a complex polyploidy/ aneuploid like sugarcane. Aneuploidy would lead to spurious

relationships among half- and even full-siblings when using markers as opposed to COP in

computing GS. Unequal parental contribution of chromosomes during meiosis (Deren, 1975)

and the effect of selection and drift are all factors that are not accounted for when computing

GScop. Also, the assumption that ancestors are not related is an inherent bias in calculating GScop.

Molecular markers scored as presence or absence of a band between two individuals only

informs us about the state not the sequence of the band which may also bias GSjaccard estimates.

Moreover, if TRAP markers reveals mostly trait related polymorphisms, then it is plausible that

selection, genetic drift and unequal contribution of parents through selection could alter genetic

relationship among siblings for a particular trait-gene.

Coefficient of parentage (COP) provides plant breeders an important tool in assessing

genetic diversity. Most of the published studies related to COP are dealing with its relative

strength compared to molecular markers in assessing genetic diversity. When a moderate to high

correlation with molecular marker is found, COP is a cheaper and faster way of assessing

diversity. Information derived from COP, at least in sugarcane, should be used as a guide to

supplement the information generated from molecular markers. Given that accurate pedigree

records are kept, COP analysis of ancestry should be important when base broadening is the

main objective. But when the main objective is to make heterotic crosses for specific and clearly

defined traits then TRAP marker analysis may offer a better alternative to select divergent

parents. Experiments to confirm that TRAP bands are allelic and to test if TRAP markers can be

useful in parental selection are underway.

64

SUMMARY AND CONCLUSION

Sugarcane is an important crop in the world but in the US it enjoys a less significant

status. Consequently, sugarcane lags behind other crops in utilizing molecular biology tools to

unravel its complexity of its genome and to effect genetic improvement. The formation of a

comprehensive sugarcane EST database (SUCEST) alongside the construction of some genomic

libraries from diverse tissues (Ma et al., 2004), are among the few promising steps that cast some

light of optimism into the future of sugarcane improvement using the novel genetic tools

available to other crops. The development of a PCR-based marker system that uses primers

designed from gene sequences and/or expressed sequence tags (ESTs) offers a new perspective

towards the now generic and ubiquitous genetic diversity studies.

Target Region Amplification Polymorphism (TRAP) (Hu and Vick, 2003) is a marker

that takes advantage of gene sequence related information. The TRAP was conceived to show

gene polymorphism in a targeted region between a gene- or EST-anchored fixed primer

(designed from the gene or EST sequence) and an intronic- or exonic-targeting arbitrary primer.

The degree and pattern of genetic diversity among 63 sugarcane clones was studied using TRAP

markers. Most of these clones are cultivars released since the start of the sugarcane breeding

program in the 1960s. Also included in the study were several foreign cultivars and progenitor

species of sugarcane that are either of worldwide or local significance and interest. We explored

diversity among the clones and the relationship of this diversity to era of release (pre- and post-

1980). Most of these clones are still actively being used as parents in the various US breeding

stations. We used three gene families, that is, genes believed to be involved in sucrose

accumulation, cold tolerance and trichome development. We paired these three genes with four

arbitrary primers giving a total of 40 primer combinations.

65

Target Region Amplification Polymorphism (TRAP) revealed high amounts of

polymorphism comparable to polymorphism revealed by AFLP® in sugarcane but cluster

analysis revealed a very narrow genetic diversity among the 63 clones. Pedigree records

indicated that the clones are highly related including full and half siblings. Cluster analysis did

not show the clones grouping based on their pedigree records as in most cases full and half

siblings failed to cluster together. When cluster analysis was confined to individual gene

families (i.e. a series of genes related in the expression of a trait), only the cluster derived from

sucrose genes gave some semblance of a meaningful grouping. For example, US56-15-8, a S.

spontaneum species, was observed to be an outgroup only in the SUC-based cluster but not in the

other two gene family clusters. Since S. spontaneum is known to confer anything but the high

sucrose content trait, one may speculate that the SUC based cluster might be reflective of “sugar”

gene pattern. In addition, LA PURPLE, a S. officinarum clone considered to be a foundation

clone and known to be of high sugar content was observed to have a greater similarity with the

cultivars in the SUC dendrogram. In particular, LA PURPLE clustered with CP77-310 at GS =

0.91; an interesting result considering that CP77-310 is the male parent of the popular and high

yielding LCP85-384.

Within this backdrop of the narrow genetic diversity window and the lack of discernable

groupings from cluster analyses, we explored MDS to either confirm or provide additional

insights into the data. MDS revealed some subtle trends which may be useful to the plant

breeder. The general MDS biplot (all three genes families) showed a tight grouping as was

revealed by cluster analysis but a few clones considered as an outliers were found dispersed

around the periphery. They included some of the old clones, the one S. spontaneum clone

(US56-15-8) and some clones derived from recent backcrosses to US56-15-8. This was not

apparent in the cluster analysis.

66

The X and Y coordinates from the MDS analysis were further subjected to mixed model

analysis (a novel way of exploring associations). When used in conjunction with MDS, mixed

model analysis provides significance test that can be attached to the results from MDS analysis

which in turn makes the interpretation more meaningful. The MDS and Mixed Model Analysis

for the old (pre 1980) and new (post 1980) clones revealed no significant difference among the

clones. This was attributed to the overlapping generations in sugarcane breeding where both old

and new clones are used for breeding. In some cases the new clones are derived (recycled) from

the same parents as the old ones.

Whereas cluster and MDS analysis are exploratory tools designed to discover

associations among objects, AMOVA is primarily aimed at partitioning variation and attributing

it factors so as to describe the population structure. We explored our grouping of era of release

and gene families using AMOVA. The AMOVA showed that among the observed genetic

variation, only a small portion (0.59%) could be attributed to era of release with most of the

observed variation (99.41%) residing among clones within era. This could be attributed to the

low amount of genetic diversity among the clones and further explains why cluster analyses

could not reveal any discernable groupings and why the trend revealed by MDS and Mixed

Model Analysis was subtle and could be ascribed only to a handful of clones.

The AMOVA for gene families, among genes within gene families and within genes

revealed that most of the observed variations (94.01%) resided within genes with only 3.95% of

the variation among genes within gene families and 2.04% among the three gene families.

Therefore, polymorphism derived from a gene (fixed primer) paired with two different arbitrary

primers would provide more discriminatory power than using different fixed primers paired with

a single arbitrary primer. However, we still have to increase the number of fixed primers

associated with the expression of the associated trait to better capture a good portion of the trait

67

polymorphism hope for. The small amount of variation ascribed to among genes within gene

families and among gene families suggest that one would have to exhaust all the genes

responsible (in the pathway) for the expression a trait to paint a better picture of trait related

polymorphism.

We also employed the AMOVA to test for our perceived notion of gene families using

the pair-wise distances. Only the SUC genes grouped as a family. One of the cold tolerance

genes, CDPK grouped with the sucrose genes and could actually be classified as a SUC gene.

Not surprising CDPK is part of the larger super gene family that is involved in several

biochemical pathways mostly in response to stress. The CT and TRICH genes were commingled

into a separate group which is reflective of the relatively high correlation (r = 0.5) between CT

and TRICH compared to SUC and TRICH (r = 0.18) from the mantel test.

The low but significant correlation between coefficient of parentage (COP) and TRAP-

based GS obtained here corresponds to the general findings reported by other researchers using

other markers and other crops. The complex inheritance pattern of sugarcane, and the

assumptions inherent to the COP formula potentially contributed to this disparity. However,

COP remains an invaluable tool especially for an interspecific hybrid such as sugarcane.

Knowing the composition of each specific Saccharum species is very important because plant

breeders are better able to relate traits unique to a particular Saccharum species to the traits

expressed by a clone. Knowledge of pedigree history would enhance interpretation of the

information derived from molecular markers. But when the main objective is to make heterotic

crosses for specific and clearly defined traits, TRAP marker analysis may offer a better

alternative to select divergent parents.

68

BIBLIOGRAPHY

Al-Janabi, S.M., B.W.S. Sobral, C. Petersen, and M. McClelland. 1994. Phylogenetic analysis of organellar DNA sequences in the Andropogoneae: Saccharinae. Theor Appl Genet 88:933-944.

Alwala, S., S. Andru, J. Arro, and C.A. Kimbeng. 2003. Target Region Amplification Polymorphism (TRAP) markers for Sugarcane Genotyping (Abstract), p. 105 Journal of the American Society of Sugar Cane Technologists, Vol. 24.

Amor, Y., C.H. Haigler, S. Johnson, M. Wainscott, and D.P. Delmer. 1995. A membrane-associated form of sucrose synthase and its potential role in synthesis of cellulose and callose in plants. Proc Natl Acad Sci U S A 92:9353-9357.

Andersen, J.R., and T. Lubberstedt. 2003. Functional markers in plants. Trends Plant Sci 8:554-560.

Arcenueaux, G. 1965. Cultivated Sugarcane of the world and their botanical derivation, p. 844-854 Proc Int Soc Sug Tech, Vol. 12.

Artus, N.N., M. Uemura, P.L. Steponkus, S.J. Gilmour, C.T. Lin, and M.F. Thomashow. 1996. Constitutive expression of the cold-regulated Arabidopsis thaliana COR15a gene affects both chloroplast and protoplast freezing tolerance. Proc Natl Acad Sci U S A 93:13404-13409.

Besse, P., G. Taylor, B. Carroll, N. Berding, D. Burner, and C.L. McIntyre. 1998. Assessing genetic diversity in a sugarcane germplasm collection using an automated AFLP analysis. Genetica 104:143-153.

Bioenergy Information Network.1999. Mischanthus Factoid [Online] http://bioenergy.ornl.gov/papers/miscanthus/miscanthus.html.

Braithwaite, K., B. Croft, and S. Brumbley. 2004. Genetic diversity within collections of the sugarcane orange rust fungus., p. 82, In D. M. Hogarth, ed. 2004 Conference of the Australian Society of Sugar Cane Technologists held at Brisbane, Queensland, Australia, 4-7 May 2004.

Brar, D.S. 2002. Molecular Marker Assisted Breeding, p. 55-83, In B. D. S. Jain S.M., Ahloowalia, B.S., ed. Molecular Techniques in Crop Improvement. Kluwer Academic Publishers.

Buczynski, S.R., M. Thom, P. Chourey, and A. Maretzki. 1993. Tissue distribution and characterization of sucrose synthase isozymes in sugarcane. J Plant Physiol 142:641-646.

69

Burnell, J.N., and M.D. Hatch. 1985. Light dark modulation of leaf pyruvate, P(I) dikinase. Trends Biochem Sci 10:288-291.

Butterfield, M.K., R.S. Rutherford, D.L. Carson, and B.I. Huckett. 2004. Application of gene discovery to varietal improvement in sugarcane. S Afr J Bot 70:167.

Casler, M.D., Y. Rangel, J.C. Stier, and G.W. Jung. 2003. RAPD marker diversity among creeping bentgrass clones. Crop Sci 43:688-693.

Casu, R.E., C.M. Dimmock, S.C. Chapman, C.P.L. Grof, C.L. McIntyre, G.D. Bonnett, and J.M. Manners. 2004. Identification of differentially expressed transcripts from maturing stem of sugarcane by in silico analysis of stem expressed sequence tags and gene expression profiling. Plant Mol Biol 54:503-517.

Cheng, S., M. Willmann, H. Chen, and J. Sheen. 2002. Calcium Signaling through protein kinases. The Arabidopsis calcium-dependent protein kinase gene family. Plant Physiol 129:469-485.

Chinnusamy, V., K. Schumaker, and J.K. Zhu. 2004. Molecular genetic perspectives on cross-talk and specificity in abiotic stress signalling in plants. J Exp Bot 55:225-236.

Chourey, P.S., E.W. Taliercio, S.J. Carlson, and Y.L. Ruan. 1998. Genetic evidence that the two isozymes of sucrose synthase present in developing maize endosperm are critical, one for cell wall integrity and the other for starch biosynthesis. Mol Gen Genet 259:88-96.

Cordeiro, G.M., Y.B. Pan, and R.J. Henry. 2003. Sugarcane microsatellites for the assessment of genetic diversity in sugarcane germplasm. Plant Science 165:181-189.

Coto, O., M.T. Cornide, D. Calvo, E. Canales, A. D'Hont, and F. de Prada. 2002. Genetic diversity among wild sugarcane germplasm from Laos revealed with markers. Euphytica 123:121-130.

Cuadrado, A., R. Acevedo, S.M.D. de la Espina, N. Jouve, and C. de la Torre. 2004. Genome remodelling in three modern S. officinarum x S. spontaneum sugarcane cultivars. J Exp Bot 55:847.

Curley, J., and G. Jung. 2004. RAPD-based genetic relationships in kentucky bluegrass: Comparison of cultivars, interspecific hybrids, and plant introductions. Crop Sci 44:1299-1306.

Daniels, J., and B.T. Roach. 1987. Germplasm collection, maintenance and use, p. 25, In H. DJ, ed. Sugarcane Improvement Through Breeding, Vol. 11. Elsevier, New York.

70

Dejardin, A., C. Rochat, S. Wuilleme, and J.P. Boutin. 1997. Contribution of sucrose synthase, ADP-glucose pyrophosphorylase and starch synthase to starch synthesis in developing pea seeds. Plant Cell Environ 20:1421-1430.

Deren, C.W. 1995. Genetic Base of U.S. Mainland Sugarcane. Crop Sci 35:1195-1199.

D'hont, A., L. Grivet, P. Feldmann, S. Rao, N. Berding, and J.C. Glaszmann. 1996. Characterisation of the double genome structure of modern sugarcane cultivars (Saccharum spp) by molecular cytogenetics. Mol Gen Genet 250:405-413.

D'hont, A., Y.H. Lu, D.G. Deleon, L. Grivet, P. Feldmann, C. Lanaud, and J.C. Glaszmann. 1994. A Molecular approach to unraveling the genetics of sugarcane, a complex polyploid of the Andropogoneae Tribe. Genome 37:222-230.

D'hont, A., P.S. Rao, P. Feldmann, L. Grivet, N. Islamfaridi, P. Taylor, and J.C. Glaszmann. 1995. Identification and characterization of sugarcane intergeneric hybrids, Saccharum officinarum x Erianthus Arundinaceus, with molecular markers and DNA in-situ hybridization. Theor Appl Genet 91:320-326.

Diaz, M., E.L. Peralta, A. Iglesia, V. Pazos, O. Carvajal, M.P. Perez, E.A. Giglioti, P.R. Gagliardi, A. Wendland, and L.E.A. Camargo. 2001. Xanthomonas albilineans haplotype B responsible for a recent sugarcane leaf scald disease outbreak in Cuba. Plant Dis 85:334.

Excoffier, L. 1996. Arlequin, A software for population genetics data analysis [Online]. Available by Department of Anthropology & Ecology of the University of Geneva http://lgb.unige.ch/arlequin/ (verified Sept 19, 2004).

Excoffier, L., P.E. Smouse, and J.M. Quattro. 1992. Analysis of molecular variance inferred from metric distances among DNA Haplotypes - application to human mitochondrial-DNA restriction data. Genetics 131:479-491.

Falconer, D.S. 1989. Introduction to Quantitative Genetics. 3rd ed. Longman Scientific & Technical Wiley, New York.

Felsenstein, J. 1984. Distance methods for inferring phylogenies: A justification. Evolution 38:16–24.

Fukayama, H., H. Tsuchida, S. Agarie, M. Nomura, H. Onodera, K. Ono, B.H. Lee, S. Hirose, S. Toki, M.S.B. Ku, A. Makino, M. Matsuoka, and M. Miyao. 2001. Significant accumulation of C-4-specific pyruvate,orthophosphate dikinase in a C-3 plant, rice. Plant Physiol 127:1136-1146.

71

Glackin, C.A., and J.W. Grula. 1990. Organ-specific transcripts of different size and abundance derive from the same pyruvate, orthophosphate dikinase gene in maize. Proc Natl Acad Sci U S A 87:3004-3008.

Glaszmann, J.C., Y.K. Lu, and C. Lanaud. 1990. Variation of nuclear ribosomal DNA in sugarcane. Journal of Genetics and Breeding 44:191-198.

Glaszmann, J.C., A. Fautret, J.L. Noyer, P. Feldmann, and C. Lanaud. 1989. Biochemical Genetic-Markers in Sugarcane. Theor Appl Genet 78:537-543.

GRIN. 2004. The Germplasm Resources Information Network (GRIN) [Online] http://www.ars-grin.gov/cgi-bin/npgs/html/genform.pl.

Grivet, L., and P. Arruda. 2002. Sugarcane genomics: depicting the complex genome of an important tropical crop. Curr Opin Plant Biol 5:122-127.

Grof, C. 2001. Molecular manipulation of sucrose metabolism, p. 586-587 Proc Int Soc Sugarcane Tech, Vol. 24.

Ha, S., P.H. Moore, D. Heinz, S. Kato, N. Ohmido, and K. Fukui. 1999. Quantitative chromosome map of the polyploid Saccharum spontaneum by multicolor fluorescence in situ hybridization and imaging methods. Plant Mol Biol 39:1165-1173.

Halldén, C., N.O. Nilsson, I.M. Rading, and T. Sill. 1994. Evaluation of RFLP and RAPD markers in a comparison of Brassica napus breeding lines. Theor Appl Genet 88:123–128.

Harmon, A.C., M. Gribskov, E. Gubrium, and J.F. Harper. 2001. The CDPK superfamily of protein kinases. New Phytol 151:175-183.

Hatch, M.D. 1987. C-4 Photosynthesis - a Unique Blend of Modified Biochemistry, Anatomy and Ultrastructure. Biochimica Et Biophysica Acta 895:81-106.

Hawker, J.S., C.F. Jenner, and C.M. Niemietz. 1991. Sugar metabolism and compartmentation. Aust J Plant Physiol 18:227-237.

Heinz, D.J., and T.L. Tew. 1987. Hybridization Procedures, In D. J. Heinz, ed. Sugarcane Improvement Through Breeding, Vol. 11. Elsevier, New York.

Hoarau, J.Y., B. Offmann, A. D'Hont, A.M. Risterucci, D. Roques, J.C. Glaszmann, and L. Grivet. 2001. Genetic dissection of a modern sugarcane cultivar (Saccharum spp.). I. Genome mapping with AFLP markers. Theor Appl Genet 103:84-97.

72

Hoarau, J.Y., L. Grivet, B. Offmann, L.M. Raboin, J.P. Diorflar, J. Payet, M. Hellmann, A. D'Hont, and J.C. Glaszmann. 2002. Genetic dissection of a modern sugarcane cultivar (Saccharum spp.).II. Detection of QTLs for yield components. Theor Appl Genet 105:1027-1037.

Hu, J.G., and B.A. Vick. 2003. Target Region Amplification Polymorphism: A novel marker technique for plant genotyping. Plant Molecular Biology Reporter 21:289-294.

Huff, D.R. 1997. RAPD characterization of heterogenous perennial ryegrass cultivars. Crop Sci 37:783–791.

Huff, D.R., R. Peakall, and P.E. Smouse. 1993. RAPD variation within and among natural-populations of outcrossing buffalograss Buchloe-Dactyloides (Nutt) Engelm. Theor Appl Genet 86:927-934.

Insightful Corp. 2003. S-PLUS® 6.2 for Windows Professional Edition.

Jaccard, P. 1908. Nouvelles rescherches sur la distribution floral. Bull. Soc. Vaud. Sci. Nat. 44:223-270.

Jannoo, N., L. Grivet, M. Seguin, F. Paulet, R. Domaingue, P.S. Rao, A. Dookun, A. D'Hont, and J.C. Glaszmann. 1999. Molecular investigation of the genetic base of sugarcane cultivars. Theor Appl Genet 99:171-184.

Jenkin, M.J., S.M. Reader, K.A. Purdie, and T.E. Miller. 1995. Detection of rDNA sites in sugarcane by FISH. Chromosome Res 3:444.

Johnson, R.A., and D.W. Wichern. 2002. Applied Multivariate Statistical Analysis. 5th ed. Prentice Hall, Upper Saddle River, NJ, USA.

Kempthorne, O. 1957. An introduction to Genetic Statistics Wiley and Sons, New York.

Kennedy, G.G. 2003. Tomato, Pests, parasitoids, and predators: Tritrophic interactions involving the genus Lycopersicon. Annu Rev Entomol 48:51-72.

Kimbeng, C.A., C. LaBorde, K. Bischoff, and K.A. Gravois. 2004. Genetic diversity and relationship among parents in the LSU Agcenter sugarcane crossing program. LSU Agcenter.

Lahtinen, M., J.P. Salminen, L. Kapari, K. Lempa, V. Ossipov, J. Sinkkonen, E. Valkama, E. Haukioja, and K. Pihlaja. 2004. Defensive effect of surface flavonoid aglycones of Betula pubescens leaves against first instar Epirrita autumnata larvae. J Chem Ecol 30:2257-2268.

73

Larkin, J.C., D.G. Oppenheimer, A.M. Lloyd, E.T. Paparozzi, and M.D. Marks. 1994. Roles of the glabrous1 and Transparent testa glabra genes in Arabidopsis trichome development. Plant Cell 6:1065-1076.

Legendre, B.L., and K.A. Gravois. 2004. The 2003 Louisiana sugarcane variety survey. LSU Agcenter Annual Report.

Li, G., and C.F. Quiros. 2001. Sequence-related amplified polymorphism (SRAP), a new marker system based on a simple PCR reaction: its application to mapping and gene tagging in Brassica. Theor Appl Genet 103:455-461.

Lima, M.L.A., A.A.F. Garcia, K.M. Oliveira, S. Matsuoka, H. Arizono, C.L. de Souza, and A.P. de Souza. 2002. Analysis of genetic similarity detected by AFLP and coefficient of parentage among genotypes of sugar cane (Saccharum spp.). Theor Appl Genet 104:30-38.

Link, W., C. Dixkens, M. Singh, M. Schwall, and A.E. Melchinger. 1995. Genetic diversity in european and mediterranean faba bean germplasm revealed by RAPD Markers. Theor Appl Genet 90:27-32.

Liu, Q., M. Kasuga, Y. Sakuma, H. Abe, S. Miura, K. Yamaguchi-Shinozaki, and K. Shinozaki. 1998. Two transcription factors, DREB1 and DREB2, with an EREBP/AP2 DNA binding domain separate two cellular signal transduction pathways in drought- and low-temperature-responsive gene expression, respectively, in Arabidopsis. Plant Cell 10:1391-1406.

LSU AgCenter. 2001. Louisiana Sugarcane: Production and Processing [Online] http://www.agctr.lsu.edu/subjects/sugarcane/index.asp (verified August 19, 2004).

Lu, Y.H., A. D'hont, F. Paulet, L. Grivet, M. Arnaud, and J.C. Glaszmann. 1994. Molecular diversity and genome structure in modern sugarcane varieties. Euphytica 78:217-226.

Ma, H.M., S. Schulze, S. Lee, M. Yang, E. Mirkov, J. Irvine, P. Moore, and A. Paterson. 2004. An EST survey of the sugarcane transcriptome. Theor Appl Genet 108:851.

Mangelsdorf, A. 1983. Cytoplasmic diversity in relation to pests and pathogens. International Society of Sugarcane Technologists Newsletter 45:45-49.

Mantel, N. 1967. Detection of disease clustering and generalized regression approach. Cancer Res 27:209-220.

Marks, M.D. 1997. Molecular genetic analysis of trichome development in arabidopsis. Annu Rev Plant Physiol Plant Mol Biol 48:137-163.

74

Mellish, A., B. Coulman, and Y. Ferdinandez. 2002. Genetic relationships among selected crested wheatgrass cultivars and species determined on the basis of AFLP markers. Crop Sci 42:1662-1668.

Ming, R., S.C. Liu, P.H. Moore, J.E. Irvine, and A.H. Paterson. 2001. QTL analysis in a complex autopolyploid: Genetic control of sugar content in sugarcane. Genome Res 11:2075-2084.

Minorsky, P. 2002. Pharmacology of calcium spiking induced by Nod factors. Plant Physiol 128(1163-1164.

Mohammadi, S.A., and B.M. Prasanna. 2003. Analysis of genetic diversity in crop plants - salient statistical tools and considerations. Crop Sci 43:1235-1248.

Naidu, K.M., and T.V. Sreenivasan. 1987. Conservation of the sugarcane germplasm, p. 33-53 Copesucar International Sugarcane Breeding Workshop, Copersucar, Sao Paulo, Brazil.

Naidu, S.L., and S.P. Long. 2004. Potential mechanisms of low-temperature tolerance of C4 photosynthesis in Miscanthus x giganteus: an in vivo analysis. Planta 220:145-155.

Naidu, S.L., S.P. Moose, A.K. Al-Shoaibi, C.A. Raines, and S.P. Long. 2003. Cold tolerance of C-4 photosynthesis in Miscanthus x giganteus: Adaptation in amounts and sequence of C4 photosynthetic enzymes. Plant Physiol 132:1688-1697.

Nair, N.V., A. Selvi, T.V. Sreenivasan, and K.N. Pushpalatha. 2002. Molecular diversity in Indian sugarcane cultivars as revealed by randomly amplified DNA polymorphisms. Euphytica 127:219.

Nakai, T., N. Tonouchi, T. Konishi, Y. Kojima, T. Tsuchida, F. Yoshinaga, F. Sakai, and T. Hayashi. 1999. Enhancement of cellulose production by expression of sucrose synthase in Acetobacter xylinum. Proc Natl Acad Sci U S A 96:14-18.

Nei, M., and W.H. Li. 1979. Mathematical-Model for studying genetic-variation in terms of restriction endonucleases. Proc Natl Acad Sci U S A 76:5269-5273.

Nolte, K.D., and K.E. Koch. 1993. Companion-cell specific localization of sucrose synthase in zones of phloem loading and unloading. Plant Physiol 101:899-905.

Pan, Y.B., G.M. Cordeiro, E.P. Richard, and R.J. Henry. 2003. Molecular genotyping of sugarcane clones with microsatellite DNA markers. Maydica 48:319-329.

Phan, A.T., and S.R. Smith. 2000. Seed yield variation in blue grama and little bluestem plant collections in southern Manitoba, Canada. Crop Sci 40:555-561.

75

Piperidis, G., M.J. Christopher, B.J. Carroll, N. Berding, and A. D'Hont. 2000. Molecular contribution to selection of intergeneric hybrids between sugarcane and the wild species Erianthus arundinaceus. Genome 43:1033-1037.

Powell, W., M. Morgante, C. Andre, M. Hanafey, J. Vogel, S. Tingey, and A. Rafalski. 1996. The comparison of RFLP, RAPD, AFLP and SSR (microsatellite) markers for germplasm analysis. Mol Breed 2:225-238.

Reeves, J., J. Law, P. Donin, M.R. Koebner, and R. Cooke. 1999. Changes over time in the genetic diverisity of UK cereal crops [Online]. Available by National Institute of Agricultural Botany, Cambridge http://apps3.fao.org/wiews/Prague/Paper12.htm.

Rohlf, F. 2000. NTSYS-pc Numerical Taxonomy and Multivariate Analysis System Ver 2.11L Manual. Applied Biostatistics, New York.

Santos, J.B., J. Nienhuis, P. Skroch, J. Tivang, and M.K. Slocum. 1994. Comparison of RAPD and RFLP genetic markers in determining genetic similarity among Brassica oleraceae L. genotypes. Theoretical and Applied Genetics 87:909–915.

SAS Institute Inc. 2002. SAS - Statistical Analysis Software for Windows, 9.0 ed, Cary, NC, USA.

Schafer, W.E., J.M. Rohwer, and F.C. Botha. 2004. Protein-level expression and localization of sucrose synthase in the sugarcane culm. Physiol Plant 121:187-195.

Schneider, D., D. Roessli, and L. Excoffier. 2000. Arlequin ver 2.00: A software for population genetics data analysis, Genetics and Biometry Laboratory, University of Geneva, Switzerland.

Sheen, J. 1991. Molecular Mechanisms Underlying the Differential Expression of Maize Pyruvate, Orthophosphate Dikinase Genes. Plant Cell 3:225-245.

Shinwari, Z.K., K. Nakashima, S. Miura, M. Kasuga, M. Seki, K. Yamaguchi-Shinozaki, and K. Shinozaki. 1998. An Arabidopsis gene family encoding DRE/CRT binding proteins involved in low-temperature-responsive gene expression. Biochem Biophys Res Commun 250:161-170.

Smouse, P.E., J.C. Long, and R.R. Sokal. 1986. Multiple regression and correlation extensions of the Mantel test of matrix correspondence. Syst Zool 35:627-632.

Sreenivasan, T.V., B.S. Ahlowalia, and D. Heinz. 1987. Germplasm collection, maintenance and use, p. 211, In H. DJ, ed. Sugarcane Improvement Through Breeding, Vol. 11. Elsevier, New York.

76

St. Gabriel/Sugar Research Station. 2004. Sugarcane Research Annual Progress Report, Vol. 2004. LSU-Agcenter, St Gabriel, Baton Rouge.

Steponkus, P.L., M. Uemura, R.A. Joseph, S.J. Gilmour, and M.F. Thomashow. 1998. Mode of action of the COR15a gene on the freezing tolerance of Arabidopsis thaliana. Proc Natl Acad Sci U S A 95:14570-14575.

Stockinger, E.J., S.J. Gilmour, and M.F. Thomashow. 1997. Arabidopsis thaliana CBF1 encodes an AP2 domain-containing transcriptional activator that binds to the C-repeat/DRE, a cis-acting DNA regulatory element that stimulates transcription in response to low temperature and water deficit. Proc Natl Acad Sci U S A 94:1035-1040.

Sun, J.D., T. Loboda, S.J.S. Sung, and C.C. Black. 1992. Sucrose Synthase in Wild Tomato, Lycopersicon-Chmielewskii, and Tomato Fruit Sink Strength. Plant Physiol 98:1163-1169.

Swanson, T. 1996. Global values of biological diversity: the public interest in the conservation of plant genetic resources for agriculture. Plant Genetic Resources Newsletter 105:1-7.

Tew, T.L. 1987. New Varieties, In D. J. Heinz, ed. Sugarcane Improvement Through Breeding, Vol. 11. Elsevier, New York.

Thomashow, M.F. 1999. Plant cold acclimation: Freezing tolerance genes and regulatory mechanisms. Annu Rev Plant Physiol Plant Mol Biol 50:571-599.

Thormann, C.E., M.E. Ferreira, L.E.A. Camargo, J.G. Tivang, and T.C. Osborn. 1994. Comparison of RFLP and RAPD markers to estimating genetic relationships within and among cruciferous species. Theor Appl Genet 88:973–980.

Tivang, J.G., J. Nienhuis, and O.S. Smith. 1994. Estimation of sampling variance of molecular marker data using the bootstrap procedure. Theor Appl Genet 89:259-264.

Tomlinson, P.T., E.R. Duke, K.D. Nolte, and K.E. Koch. 1991. Sucrose synthase and invertase in isolated vascular bundles. Plant Physiol 97:1249-1252.

Urban, D.L. 2003. Spatial Analysis in Ecology : Mantel's Test [Online] www.env.duke.edu/landscape/classes/env352/mantel.pdf.

Vuylsteke, M., R. Mank, B. Brugmans, P. Stam, and M. Kuiper. 2000. Further characterization of AFLP® data as a tool in genetic diversity assessments among maize (Zea mays L.) inbred lines. Mol Breed 6:265-276.

Weir, B.S. 1996. Genetic Data Analysis II Sinauer Associates, Inc., Sunderland, MA.

77

Winter, H., and S.C. Huber. 2000. Regulation of sucrose metabolism in higher plants: Localization and regulation of activity of key enzymes. Crit Rev Biochem Mol Biol 35:253-289.

Wright, R.J., P.M. Thaxton, K.H. El-Zik, and A.H. Paterson. 1999. Molecular mapping of genes affecting pubescence of cotton. J Hered 90:215-219.

Yang, N.S., and D. Russell. 1990. Maize sucrose synthase-1 promoter directs phloem cell-specific expression of gus gene in transgenic tobacco plants. Proc Natl Acad Sci U S A 87:4144-4148.

Zhu, Y.J., E. Komor, and P.H. Moore. 1997. Sucrose accumulation in the sugarcane stem is regulated by the difference between the activities of soluble acid invertase and sucrose phosphate synthase. Plant Physiol 115:609-616.

Zhu, Y.J., H.H. Albert, and P.H. Moore. 2000. Differential expression of soluble acid invertase genes in the shoots of high-sucrose and low-sucrose species of Saccharum and their hybrids. Aust J Plant Physiol 27:193-199.

Zrenner, R., M. Salanoubat, L. Willmitzer, and U. Sonnewald. 1995. Evidence of the crucial role of sucrose synthase for sink strength using transgenic potato plants (Solanum tuberosum L). Plant J 7:97-107.

78

VITA

Jie A. Arro works at the Philippine Sugar Research Institute, Inc (PHILSURIN), Bacolod

City, Philippines, as a sugarcane plant breeder (1997-present). His job is to plan and supervise

the crossing activity, handle the first two stages of field evaluation and manage the sugarcane

germplasm collection. He holds the same job function from a previous employer (1995-1997),

Victorias Milling Company, Inc., which relinquished its mandate for sugarcane variety

development in order to streamline its operation.

He obtained a Fulbright-Philippine Agricultural Scholarship Program grant for a master’s

degree program in 2002. He is currently finishing his thesis research for the completion of his

degree of Master of Science in agronomy and environmental management at Louisiana State

University, Baton Rouge, Louisiana. His thesis research is under the supervision of Dr. Collins

Kimbeng and involves utilization of molecular markers in assessing genetic diversity in

sugarcane.

He has attended and presented a poster at the 11th and 12th International Plant and Animal

Genome Conference in San Diego, California. He is also a member of Crop Science Society of

America (2004). His research progress at work is consolidated with PHILSURIN’s presentation

to the annual conference of Philippine Sugar Technologists (PHILSUTECH), a local

organization promoting the enhancement of sugarcane technology through research and

development, where he is a member since 1998.

He received his Bachelor of Science in agriculture degree at the University of the

Philippines, Los Baños, Philippines, in April, 1995. Born in April 8, 1973, he is the eldest among

three siblings. He had his firstborn son during the completion of this thesis.