Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

48
Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando Master’s thesis University of Helsinki Department of Applied Biology Plant Breeding August, 2009

Transcript of Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

Page 1: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

Genetic characterization of Theobroma cacao L. in Nicaragua

Erwin Manuel Aragon ObandoMaster’s thesisUniversity of HelsinkiDepartment of Applied BiologyPlant BreedingAugust, 2009

Page 2: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

HELSINGIN YLIOPISTO HELSINGFORS UNIVERSITET UNIVERSITY OF HELSINKI

Tiedekunta/Osasto Fakultet/Sektion Faculty

Faculty of Agriculture and Forestry

Laitos Institution DepartmentDepartment of Applied Biology

Tekijä Författare Author

Erwin Manuel Aragon Obando

Työn nimi Arbetets titel TitleGenetic characterization of Theobroma cacao L Nicaraguan

Oppiaine Läroämne SubjectPlant breeding

Työn laji Arbetets art Level

Master Sciences Thesis

Aika Datum Month and yearAugust 2009

Sivumäärä Sidoantal Number of pages

50Tiivistelmä Referat Abstract

Genetic composition of Theobroma cacao L., including 60 Nicaraguan farmersaccessions, was investigated using nine microsatellite (SSR) markers. Fourteenbreeders accessions from Experimental Center “El Recreo”, INTA, Nicaragua, twoCriollos accession from CATIE, Costa Rica, and two accessions from Ecuadorwere included as reference material. The average PIC value (0.78) indicated ahigh power of discrimination for the nine loci used. A total of 155 alleles weredetected at the nine loci. The number of alleles per marker ranged from 10 to 22with a means of 17.22 alleles per locus. A heterozygosity deficiency (HExp < HObs)was registered for all microsatellite loci. The average expected heterozygositywas=0.68 among Nicaraguan farmers accessions. The Analysis of molecularvariance (AMOVA) showed a low level of differentiation among populations. Thegenetic distances determined for the groups of farmers accessions RAAS andPacifico Sur are closely similar, while a great genetic distance was observedbetween RAAN and RAAS groups. The cluster analysis presented a stronggenetic relation between the Criollo 13 from CATIE and farmers accessionMAT0404. The principal component analysis showed that 7 farmers accessionfrom Nicaragua are genetically related with the accessions Criollo 13 and Yucatanfrom the international accessions. The present study suggested a good possibilityto select farmers accessions to be included in breeding programs, especiallythose accessions related to know Criollo accessions.

Avainsanat Nyckelord Keywords

Genetic differentiation, Genetic diversity, microsatellites, Farmers and BreedersaccessionSäilytyspaikka Förvaringsställe Where deposited

Muita tietoja Övriga uppgifter Further informationSupervised by : Helena Korpelainen –- [email protected]

Page 3: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

CONTENTS

1. Introduction...............................................................................................4 1.1 Origin and domestication..................................................................4 1.2 Morphology and varieties of Theobroma cacao L...........................6 1.3 Importance of production and use of Theobroma cacao ...............8

2. Objectives..................................................................................................93. Materials and methods...........................................................................10 3.1 Plant materials ..................................................................................10

3.2 DNA extraction ..................................................................................16 3.3 Microsatellite analysis ......................................................................16 3.4 Capillary electrophoresis and data collection................................17 3.5 Data Analysis.....................................................................................18 3.5.1 Genetic diversity ......................................................................18 3.5.2 Genetic differentiation .............................................................19 3.5.3 Genetic distances ....................................................................20 3.5.4 Cluster analysis........................................................................20 3.5.5 Principal component analysis.................................................21

4. Results.....................................................................................................21 4.1 Genetic diversity of Nicaraguan farmers accessions....................22 4.2 Genetic diversity of Nicaraguan farmers accessions at group level.........................................................................................22

4.3 Genetic distances.............................................................................24 4.4 Cluster analysis.................................................................................25 4.5 Principal component analysis..........................................................27

5. Discussion...............................................................................................295.1 Genetic diversity and population structure....................................29

5.2 Differentiation among groups and genetic proximities ................326. Conclusions ............................................................................................347. Acknowledgements ................................................................................358. List of references....................................................................................369. Appendices .............................................................................................44

Page 4: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

4

1. Introduction

1.1 Origin and domestication

Sterculiaceae family has 70 genera and 1,500 species of tropical tree, which the

most famous and important species is Theobroma cacao L. (2n = 20) due to its

economic importance. Theobroma cacao L. is native to the tropical rainforest

and considered exclusive of the neotropical area, with a natural distribution in

rainforests, extended from the Amazon basin to southern Mexico (Dias 2001).

However, the origin and domestication have been controversial and widely

discussed due to wide geographic distribution of Theobroma cacao L.

The first propose about origin was done by Vavilov, who has indicated as

Mexico and Andes as centers of diversity, but it was rejected by F.J Pound in

1937/1938, who found an extensive variability in the wild cocoa trees of

Ecuador. According to Chessman (1944), based on Vavilov theory and the

variability of cacao reported by Pound argued that center of origin of

Theobroma cacao was located in 400 km radius of the Amazon River, in the

vicinity of the rivers Napo, Caqueta and Putumayo. Chessman suggested the

possible migration two groups of subspecies of cacao, one towards Venezuela

and other towards Central America, which the isthmus of Panama was a barrier

that helped to evolved independently the differentiation among both groups. The

Venezuelan group was called as Forastero and Central America group was

called Criollo.

More recently, through to biotechnology tools, Motamayor et al (2002)

compared the genetic diversity of cacao populations from Mesoamerica (Mexico

and Central America) and South America population. They concluded that

Criollo populations found in Lacadamia, Mexico, would consider as near to wild

population found in Peru, established as the center of origin. However, Ogata et

al. (2006) suggested include more samples from Oaxaca, southern of Veracruz,

(Mexico), Central America, and South America in molecular studies in order to

definite the real center origin due to previous (De la Cruz et al. 1995; Whitkus et

al. 1998) study have discovered unique cultivars in Yucatan, which suggested

that origin of Theobroma cacao L still a mystery.

Theobroma cacao L. is a relatively domesticated species. According to

Motamayor et al. (2002), McNeil (2006), and Bartley (2005), it has been

Page 5: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

5

cultivated in Central America already since pre-Columbian times. Cheesman

(1944) suggested Theobroma cacao L in Central America and South America

have evolved independently. Dias (2001), consider that Criollo populations were

originated and cultivated in central America, from Mexico to the south of Costa

Rica, and Forastero existed in the wild state in South America. Among Criollo,

some populations are classified on geographic basis, such as those from

Central America (Mexican and Nicaraguan Criollos) and those from northern

South America (Colombian and Venezuelan Criollos)

There are many evidences that support Mesoamerica domestication for

Theobroma cacao L. Firstly, the cacao word is a Spanish corruption of Nahuatl

(native language) used for the plant and product, but Indigenous peoples called

to the cacao tree as “Cacahuatl” and “Kakawa”.

Theobroma cacao L was more important in Mesoamerican culture (Maya

and Aztec culture) than Southern American culture; due to Theobroma cacao L

was used in religious ceremony and weeding as special beverage (McNeil,

2006). Also, the preparations, use, and consume of cacao, in Mesoamerica,

was different than Southern America. In Mesoamerica the husk, pulp, and seed

of cacao was prepared and consumed of different method by native peoples,

nonetheless, southern America only consumed the pulp; also cacao was used

as currency in many places as Nicaragua.

Bartley (2005) suggested that Theobroma cacao L had less importance in

southern America due to existences of many species that were used as

stimulant (e.g., Erythroxylum coca), nonetheless in Mesoamerica was the

principal sources used as stimulant.

In Nicaragua, there are archeological evidences about importance and

cultivation of Theobroma cacao L. during pre-Columbian period, which has

been located to North of Chinandega, Leon, Granada, and Rivas as centers of

cacao production in pre-Columbian time. In addition, the North of Chinandega

was presumably the existences of sporadic tree in areas inhabited by native

peoples; nevertheless, in department of Rivas were cultivated and managed a

precious tree in fertile lands (Stainbrenner, 2006).

Page 6: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

6

1.2 The Morphology and varieties of Theobroma cacao L.

The cacao tree seldom exceeds a height of 7.5 m in cultivations and may go up

to 12.5 m or more in wild growing. The main stem is divided in four to six lateral

branches called “Jorqueta” or “fan” with plagiotropic growth. An axillaries bud is

develops with an orthotropic shoot below the Jorqueta, which will again form a

Jorqueta a few feet higher up. Other Jorqueta will develop again, this process

should repeat during four times, producing a leafy canopy.

The mature leaves are dark green, about 37 cm long and 7.5 cm broad,

oblong oval or elliptic oval with prominent veins and veinletss, the short petiole

is provided with two articulations (Kochhar, 1986). The flush (young and

developing leaves) exhibit a range of colors from lightest green to red shades.

The flower is a complex organ and possesses some features that make it

unique and distinctive. The flowers are borne on the bark of the trunk and older

branches, and never on young shoot. Few of many thousand flowers will

develop in fruit. Flowering and fruiting continues through one year.

Theobroma cacao L. flowers are bisexual, regular and pentamerous. The

flower has the variation with regard to their shape and pigmentation. The sepals

are prominent, leathery and fused at the base. The corolla consists of five

petals, smaller than sepals, each having a broad curved, saccate basal and a

much narrower terminal portion expanding terminally into a cup of shaped a

pouch. Ten stamens are arranged in two whorls of five, the outer being

represented by infertile, narrow, ciliate, erect pointed structure forming a fence

around style, while the stamens of the inner whorl have curve filaments. The

stamens and staminoide are fused in the basal region forming a short tube. In

general; apparently there is a relation between color buds and fruit colors

(Kochhar, 1986).

Bartley (2005), the maturity fruit is a berry but commonly called a pod, which

is composed of a fairly thick, leathery, smooth or corrugated pericarp or husk.

The fruit varied in shape from spherical to very narrow elongate, and their colors

ranging from green (immaturity), yellow (mature), and red (immaturity) to purple

(maturity). Generally, a pod has 20 to 40 flat or round seeds, or cacao beans

embedded in a white, pink or brown, aromatic, mucilaginous, sweet or faintly

acid pulp. Each cacao tree produces only one or two pounds of beans every

year (Kochhar, 1986).

Page 7: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

7

The period between fertilization of a flower and the initiation of maturity is

around 135 to 180 days. Self-incompatibility is a series problem that some

genotype presents. The discovery of self-incompatibility in cacao was

determinate as consequence of genetic mechanisms. The mechanism by self-

incompatibility takes place in the ovary of the flower during fertilization, which is

describes as a rejection by the egg nucleus of a male gamete carrying the same

incompatibility alleles as the female gamete. Bartley & Cope (1973) showed that

a series of alleles were involved of which one had a neutral effect, in which case

the rejection mechanism did not function in the egg cells when the gametes are

identical.

The first classification of variety of cacao was proposed by Chessman

(1944), and it was according to morpho geographical groups designated such

as Criollo, Forastero and Trinitario. According to the geographic origin of the

clones, the Forastero group is subdivided into the lower Forastero and upper

Forastero type. The lower Forastero type was first cultivated in the Amazon

basin and the first to be introduced to Africa, also. The upper Forastero type is

highly diverse and often used in plant breeding (Laurent et al, 1994). The

Forastero group has a green pod when immature, turning yellow at maturity.

They are smooth, melon-shaped with rounded ends and relatively thick walled.

Forastero group are hardier, robust, highly productive and resistance to

diseases. The seed colors are purple due to low Theobromine concentration.

The Criollo group, found from Mexico to Colombia and Venezuela, has

narrow genetic bases (Motamayor, et al. 2002). The Criollo group has a yellow

or a red pod when ripe, usually deeply ten-furrowed and warty with long pointed

end. Enclosed within the thin shell is numerous seed. Criollo group has been

characterized by low productivity, susceptibility to diseases, but pleasant aroma

and white cotyledons as consequence of high Theobromine concentration.

According to Bartley (2005), Forastero and Criollo are words derived from

Spanish. Criollo is translated as “Native” and the word was used to defined to

the first variety cultivated in a geographical unit outside to indigenous range of

species to distinguish it from later introduction. In Nicaragua has reported two

kind of cacao Criollo, Dias (2001) reported the Nicaraguan Criollo, which were

referred as the group of Criollo cacao from Nicaragua, and not to native cacao

of Nicaragua. Bartley (2006) cited to Hart (1893), who described two kind of

Page 8: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

8

genotype from Nicaragua, which one of them was called “Alligator cacao” due to

husk characteristics, the another one was called “Nicaraguan Criollo”.

1.3 Importance of production and use of Theobroma cacao L.

According to Lass (2004), cacao cultivation has played an important role in the

transformations of lowland tropical forest landscapes in Latin America, Africa

and Asia over the past centuries and continues to do so today. Cacao is now

grown in some 50 tropical countries, which the most important production take

place in Cameroon (179,239 ton in 2007), Brazil (201,651 ton in 2007), Côte d’

Ivory (1,384,000 ton in 2007), Indonesia (740,006 ton in 2007), Ecuador and

Nigeria (85,891 and 500.000 in 2007). The average per hectare varied between

398.3 kg ha-1 and 800.9 kg ha-1, which Nigeria presented the highest yield ha-1

than others countries.

Currently, in Central America, cacao is produced in Guatemala, Costa Rica,

Honduras and Nicaragua. In the 1990s, Nicaragua had 6,500 hectares

cultivated. However, the Caribbean coast presents larger potential for cacao

cultivars, with its almost 350.000 hectares available to cacao cultivations (INTA,

2005). The yield average is ranged between 100.0 kg ha-1 to 276.1 kg ha-1,

which Nicaragua presented the highest value. However, Costa Rica has more

area harvested in 2007, with 4543 ha (FAOSTAT, 2009). The demand for

chocolate is growing and the question now is whether the world supply of a

product that comes from a strictly tropical, rainforest-inhabiting tree will continue

to meet the demand.

In Nicaragua, the cacao crop is one of the most important crops in the

national politics of agriculture, because the country has a big potential and the

international demands are huge, and it is possible that the national production

does not satisfy the national demand. Therefore the government is promoting

the increase of areas where cacao is grown. This should offer good incomes to

poor farmers and avoiding deforestation in critical areas.

The National Institute of Agriculture and Technology (Instituto Nicaraguense

de Tecnologia Agropecuaria –INTA- in Spanish) has been developing a cacao

breeding program through traditional methods, which is a slow and long

process. The Experimental Center “El Recreo”, situated 285 km from the capital

is part of the INTA. This center has been dedicated to the breeding of cacao

Page 9: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

9

tree. There are 180 clones introduced in the 1970s and 1980s from Colombia,

Costa Rica and Mexico.

In addition, in 2003, the INTA began a national collection of cacao Criollo,

including material from the Caribbean and Pacific coast. The principal objective

has been to include them in the breeding program of cacao beans. Currently,

those samples are growing in the Experimental center “El Recreo”.

“El Recreo” has been doing conventional breeding between Trinitario

cultivars; however, the material does not have molecular support for their

genetic characteristics, as those materials focuses on phenotype

characteristics. This method has resulted in a lot of variability in the new

materials. However, with the use of modern techniques it is possible to improve

some traits, which should be associated such as diseases and poor and the low

quality of the yield. Using molecular markers it should be possible to select

material from the national collection of cacao for diseases resistance and quality

of seed or for breeding before the phenotypic expression. For example, by

identifying a molecular marker associated with pod rot, one could identify

materials for breeding without having to wait until the disease appears in the

fruits. Such measures should greatly reduce the cycle time for the selection of

cacao genotypes.

Currently, breeding programs in cacao are based on crosses between

parents belonging to different groups. However, the classic breeding requires a

long generation time. A more modern technique using genetic markers may be

particularly helpful when used to improve breeding strategies and increase our

knowledge of the genetically important agronomic and quality traits (Lanaud et

al, 1995).

2. Objectives

The present study has the aims to investigate the genetic diversity of the

farmers accessions of Theobroma cacao L. in order to reveal the genetic

relationship among 60 farmers accession and to discover genotypes for

breeding programs.

Page 10: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

10

3. Materials and methods

3.1 Plant materials

A total of 60 accessions collected from Nicaraguan’s farms, 14 breeders

accession, and 2 samples from Ecuador, those samples are available in the

national collections of the Experimental Center “El Recreo”, located at 120 07'

North, 820 24' W in Autonomous Region of the Southern Atlantic (“Región

Autónoma del Atlántico Sur –RAAS -” in Spanish), Nicaragua. Also, two

samples considered as Criollo from CATIE germplasm were included in this

study.

Farmers accessions: The accessions were collected from different farmers

(see figure 1) by National Institute of Agriculture and Technology (Instituto

Nicaraguense de Tecnologia Agropecuaria –INTA- in Spanish). Those sampling

(table 1), it was conducted between 2003 to 2008 in farms around the country

(see figure1). During those collections were considered the parameters related

to cacao Criollo as a trunk with a single plagiotropic branch, leaf with pale green

flush, flowers with white staminodes, an elongate fruits (Pod) with long acute or

attenuate, rough and a ridged surface with a thin or soft husk, and large size

seeds with white cotyledons (Figure 2).

International accessions: In order to use samples with knowing genetic

structure, two samples of international germplasm from CATIE, Costa Rica

were used. Criollo 13 was collected in Panama and Yucatan in Mexico, both

samples considered as pure Criollo due to morphologic characteristic. The

samples of Ecuador were introduced in 2003 and were established in

Experimental Center El Recreo.

Breeders samples: In 1978, INTA introduced a total of 189 clones from

Costa Rica, South America, Africa, Jamaica, and Indonesia. Those clones were

evaluated during 25 years, under tropical rain condition. In 1998, INTA selected

clones with a high yield and tolerance to specific diseases such as frosty pod

and black pod. The clones selected were used as parents in the breeding

program. A total of 14 hybrids (Table 2) were produced and distributed widely

throughout the whole country.

Page 11: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

11

Figure 1: Map of Nicaragua with sites of farmer’s collections: 1. RAAS (Región

Autónoma del Atlántico Sur), 2. RAAN (Región Autónoma del Atlántico Norte),

3. RSJ (Rio San Juan), 4. MAT (Matagalpa), 5. Pacifico sur (5.a Masaya, 5.b

Granada and 5.c Rivas), 6. Chi (Chinandega)

Page 12: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

12

Figure 2: Characteristics of farmers accessions collected on Nicaraguan farm.

a) Tree habit on farm. b) Pale green flush. c) Long and thin leaf. d) White flower.

e) Fruit shape. f) Long and white seed.

Page 13: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

13

Table 1: Description of Nicaraguan farmers accessions. (See Figure 1 for the

location in the map)

*RAAS (Región Autonoma del Atlántico Sur), 2. RAAN (Region Autonoma delAtlántico Norte), 3. RSJ (Rio San Juan), 4.MAT (Matagalpa), 5. Pacifico sur(Masaya, Granada and Rivas), 6 Chi (Chinandega).

Name Location GroupRAAS 0401 Region Autonoma Atlantico Sur RAASRAAS 0402 Region Autonoma Atlantico Sur RAASRAAS 0403 Region Autonoma Atlantico Sur RAASRAAS 0404 Region Autonoma Atlantico Sur RAASRAAS 0405 Region Autonoma Atlantico Sur RAASRAAS 0406 Region Autonoma Atlantico Sur RAASRAAS0407 Region Autonoma Atlantico Sur RAASRAAS0408 Region Autonoma Atlantico Sur RAASRAAS 0409 Region Autonoma Atlantico Sur RAASRAAS 0410 Region Autonoma Atlantico Sur RAASRAAS 0411 Region Autonoma Atlantico Sur RAASRAAS 0412 Region Autonoma Atlantico Sur RAASRAAS0413 Region Autonoma Atlantico Norte RAANRAAS 0414 Region Autonoma Atlantico Norte RAANRAAN 0401 Region Autonoma Atlantico Norte RAANRAAN 0403 Region Autonoma Atlantico Norte RAANRAAN 0404 Region Autonoma Atlantico Norte RAANRSJ 0401 Rio San Juan RSJRSJ 0402 Rio San Juan RSJRSJ 0403 Rio San Juan RSJRSJ 0404 Rio San Juan RSJRSJ 0405 Rio San Juan RSJRSJ 0406 Rio San Juan RSJRSJ 0504 Rio San Juan RSJRSJ 0501 Rio San Juan RSJRSJ 0503 Rio San Juan RSJRSJ 0508 Rio San Juan RSJMAT 0401 Matagalpa MatagalpaMAT 0402 Matagalpa MatagalpaMAT 0404 Matagalpa Matagalpa

Page 14: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

14

Tabla 1: continuation...

* 5. Pacifico sur (Masaya, Granada and Rivas), 6 Chi (Chinandega).

Name Location GroupMAS 0401 Masaya Pacifico surMAS 0402 Masaya Pacifico sur

PACAYITA 0108 Masaya Pacifico surSJC 0108 Masaya Pacifico sur

CATARINA 0108 Masaya Pacifico surMENIER 0108 Granada Pacifico sur

EC 0108 Granada Pacifico surMOMBACHO 0108 Granada Pacifico surMOMBACHO 0208 Granada Pacifico surMOMBACHO 0308 Granada Pacifico sur

RIV 0401 Rivas Pacifico surRIV 0405 Rivas Pacifico sur

H-1 Granada Pacifico surH-2 Granada Pacifico surH-3 Granada Pacifico surH-4 Granada Pacifico surH-6 Granada Pacifico surH-5 Granada Pacifico sur

CHI 0306 Chinandega ChiCHI 0308 Chinandega ChiCHI 0304 Chinandega Chi

CHI 0301A Chinandega ChiCHI 0407 Chinandega ChiCHI 0409 Chinandega ChiCHI 0410 Chinandega ChiCHI 0411 Chinandega ChiCHI 0412 Chinandega ChiCHI 0414 Chinandega ChiCHI 0415 Chinandega ChiCHI 0405 Chinandega Chi

Page 15: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

15

Table 2: Description of breeders accessions and international accessions of

Theobroma cacao L.

* Sample provided by CATIE – Centro Agronómico tropical de investigación y

enseñanza, Costa Rica-

Name Type location

Group

UF 650 xPOUND 12

Trinitario (Criollo xForastero)

ElRecreo

Breeders accession

UF 667 x Pound12

Trinitario (Criollo xForastero)

ElRecreo

Breeders accession

EET 400 x Pound12

Hybrid (Forastero) ElRecreo

Breeders accession

UF 296 x Pound12

Hybrid x Forastero ElRecreo

Breeders accession

EET 399 x SCA12

Hybrid x Forastero ElRecreo

Breeders accession

IMC 67 x UF 613 Forastero x Trinitario ElRecreo

Breeders accession

UF 613 x Pound12

Trinitario x Forastero ElRecreo

Breeders accession

UF 668 x UF 613 Criollo x Trinitario ElRecreo

Breeders accession

UF 667 x SCA 6 Trinitario (Criollo xForastero)

ElRecreo

Breeders accession

UF 29 x Pound 12 Trinitario (Criollo xForastero)

ElRecreo

Breeders accession

EET 95 x SCA 6 Hybrid x Forastero ElRecreo

Breeders accession

IMC 67 x UF 12 Trinitario (Criollo xForastero)

ElRecreo

Breeders accession

UF 654 x Pound12

Trinitario (Criollo xForastero)

ElRecreo

Breeders accession

UF 676 x Pound12

Hybrid x Forastero ElRecreo

Breeders accession

ER-011 Ecuador ElRecreo

internationalAccession

ER-012 Ecuador ElRecreo

internationalAccession

Criollo 13 Criollo Catie* internationalAccession

Yucatan Criollo Catie internationalAccession

Page 16: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

16

3.2 DNA extraction

Dried and fresh young plant leaves were used for DNA extractions following the

standard protocol for DNA extraction described by Doyle and Doyle (1990). A 1

cm2 piece of leaf was placed into a 1.5 ml Eppendorf tube and 500 µl of the

CTAB buffer (2% CTAB, 1.4 mM Tris pH 8, 1 % PVP, 0.2 % 2- -

mercaptoethanol added before use) and small amount of sterilized sand were

added, after which grinding was conducted with a plastic Kontle Pellet Pestle.

After were incubated at 65 0C for 30 minutes. Then added 500 µl of CIA 24:1

(Chloroform: Isoamylalcohol) and mixed slowly, then was centrifuged at 14,800

g for 5 minutes. After removing supernatant into new 1.5 ml Eppendorf tube and

500 CIA 24:1 was added, and again centrifuged at 14,800 g for 5 minutes. Then

supernatant was removed into new 1.5 ml Eppendorf tube. To precipitate the

DNA, 2/3 of final volume of cold isopropanol was added and it was kept at -20ºC for 10 minutes. After centrifuging at 14,800 g for 5 minutes was decanted

liquid without disturbed and the pellet was washed added 100 µl of 70 %

ethanol and centrifuged for 5 minutes at 14,800 g for 2 minutes. Then the pellet

was dried. The pellet was re-suspended by adding 50 µl of low salt TE (100 mM

Tris-Cl and 1 mM EDTA pH 8). After was added 0.3 µl of RNAse (100mg/ml),

and then was incubated at 37 ºC for 1 hour. The DNA was electrophoresed on

0.8 % agarose gel, stained with ethidium bromide and visualized under UV

trans-illuminator for quality DNA assessment. DNA quality was estimated from

the clearness of the band, and the amount of DNA was estimate from bands

intensity in comparison with GeneRuler marker (50 bp DNA leader).

3.3 Microsatellite Analysis

DNA polymorphisms were detected by polymerase chain reaction (PCR), using

9 microsatellites markers (Table 3 describes microsatellites used in this study

developed for cacao by Lanaud, 1999). PCR reaction was performed in a

volume of 20 µl at testing state and 10 µl in the final analysis. The 10 µl reaction

mixture contained 0.5 µl DNA (2-3 ng of total genomic DNA per 1 µl of reaction

volume) as a template, 6 µl of MQ water, 1 µl 10x reaction buffer, 0.2 µl dNTP

mix, 0.3 µl (0.6 U) DynazymeTM II DNA Polymerase, 1 µl (5mM) forward primer

Page 17: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

17

and 1 µl (5mM) reverse primer (the forward primers were fluorescently labeled

at 5’ end).

The DNA amplification were performed in a thermocycler (Bio-Rad,

C1000TM Thermal Cycler), with the following programmed; a denaturation step

of 4 minutes at 94 ºC followed by 34 cycles followed by denaturation for 40

second at 94 ºC, annealing for 40 second at 46 ºC or 51 ºC depending on

microsatellite, extension at 72 ºC for 1 minute, and a final extension at 72 ºC for

8 minutes.

3.4 Capillary electrophoresis and data collection

The PCR product was diluted 1/19, using 1µl of PCR product and 19 µl of Mille

Q water. From this solution was used 0.5 µl of each template and mixed with 20

µl of HiDi-formamide and 0.15 µl of size standard (GeneScan 500 ROX).

Before the run the samples were briefly denatured (5 min, 95 0C). Finally the

samples were loaded with Applied Biosystems 3730 DNA Analyzer into 96-well

plates. Fragment sizes were determined by Peak Scanner™ Software v1.0.

Page 18: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

18

Table 3: Description of the microsatellite loci developed by Lanaud et al. (1999),with primer sequences used. The fluorescent labels used in this study areshown.

3.5 Data Analysis

3.5.1 Genetic diversity

For farmers selection, the following genetic diversity parameters were estimated

using POPGENE software v 3.2. (Yeh and Boyle, 1997)

1. Number of allele per locus. It was accounted for each amplified locus.

Effective number of allele. It was estimated with GENALEX v.6.2 software

program.

2. The mean of observed and expected heterozygosity (HObs and HExp). The

Heterozygosity was calculated at each locus using the formula proposed by

Nei, 1987.

For observed heterozygosity:

HObs= 1- Pij, (1)

SSR EMBLAccession

Linkagegroup

5’Forwardprimer

3’ Reverseprimer

AnnealingTemp 0C

Label

Size ofcloned

allele (bp)

Repeat

MtcCIR1 Y16883 8 GCAGGGCAGGCTCAGTGAAGCA

TGGGCAACCAGAAAACGAT

51 HEX 143 (CT)14

MtcCIR7 Y16981 7 ATGCGAATGACAACTGGT

GCTTTCAGTCCTTTGCTT

51 HEX 160 (GA)11

MtcCIR8 Y16982 9 CTAGTTTCCCATTTACCA

TCCTCAGCATTTTCTTTC

46 TET 301 (TC)5TT(CT)17TTT(C

T)4MtcCIR

10Y16984 5 CCGAATT

GACAGATGGCCTA

CCCAAGCAAGCCTCATACTC

46 FAM 208 (TG)13

MtcCIR11

Y16985 2 CATTGCGGATTACGGTTTTT

TGATTAAGCACACGAGCACTG

46 FAM 298 (TC)13

MtcCIR12

Y16986 4 TTTCTGACCCCAAACCTGTAA

TTCCAGTTAAAGCACATGAGGA

46 FAM 188 (CATA)4N18(T

G)6

MtcCIR15

Y16988 1 CAGCCGCCTCTTGTTAG

TATTTGGGATTCTTGATG

46 TET 254 (TC)19

MtcCIR18

Y16991 4 GCTAAGGGGATTGAGGAAGC

TGGGTTGCAGTCAATGTCTC

46 FAM 345 (GA)12

MtcCIR21

Y16995 3 GTCGTTGTTGATGTCGGT

GGTGAGTGTGTGTGTTTGTCT

46 TET 157 (TC)11N5(CA)

12

Page 19: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

19

Where Pij= wiPij at each locus, which wi is a relative size of sub population and

wi=1 and Pij is a genotype frequency

For expected heterozygosity:

HExp =1- p2j (2)

Where p2j= wip2

ij, Pij is the frequency of the jth allele at ith locus summed at

across all alleles of the locus.

The polymorphism information content value is a value used to measure the

polymorphism for a marker locus (Botstein et al. 1980). It depends only on the

number of alleles and the frequency of each allele at the marker locus (Guo and

Elston, 1998).

The PIC estimator was calculated for each locus using the formula proposed by

Anderson et al., 1993:

PICi= 1- p2ij (3)

Where: pij is the frequency of the jth microsatellite allele.

The Shannon index is once the most commonly measurement used to compare

diversity among individuals. It has been widely used in studies as ecological

diversity and diversity of social relationship among adult primates and female

primates (Di Bitetti, 2000). The Shannon diversity index is tipically used to

measure the number of species in a community (the richness) and the relative

frequency of these species.

The Shannon index was calculated for each locus and groups using

software Arlequin ver. 3.1. (Excoffier et al. 2005)

3.5.2 The genetic differentiation

Genetic differentiation was quantified by the analysis of molecular variance

(AMOVA) and F-statistics estimator (FST), as described by Weir and Cockerha

(1984), Excoffier et al. (1992) and Weir (1996) using software Arlequin ver. 3.1.

(Excoffier et al. 2005)

Page 20: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

20

3.5.3 Estimation of Genetic Distance

The genetic distance is estimation between two entities that can be described

by allelic variation (Nei´s, 1973).

The genetic identity (Di) and genetic distance (D) between groups were

calculated based on Nei´s (1987) unbiased genetic distance.

According to Nei´s (1978), the genetic distance is defined as:

D= -ln [GXY GXGY] (4)

Where: GX, GY and GXY are the means of pi2, qj

2 and piqi over all loci in the

genome, respectively.

For genetic identity, GX, GY and GXY, by sample gene identities, JX, JY, and JXY,

which are the means of xi2, yi

2, and xiyi over r loci studied. However, from D

was obtained by substitution the unbiased estimates of GX, GY for JX and JY:

= -ln[ XY X Y] (5)

Where: X and Y are the means of (2nXJX-1)/ (2nX-1) and (2nYJY-1)/ (2nY-1)

over r loci studied, respectively, and XY = JXY.

Nei (1978), suggested the used of unbiased when D bias has a large

magnitude, because bias is negligible. The unbiased estimator of Di and D were

obtained using POPGEN software.

3.5.4 Cluster analysis

From the original microsatellite data, a bootstrap values were computed over

1,000 replications with Seqboot program. After, the Mix program was used to

cluster the accessions according to the Wagner parsimony method (Eck &

Dayhoff, 1996; Kluge & Farris, 1969) for discrete character data. Then, the

Consense program was used to find the consensus tree applying the extended

majority rule. The Seqboot, Mix, and Consense program are from Phylip version

3.68 (Felsentein, 2004). The tree was drawn using the Treeview software

version 1.6.6 (Page, 1996).

Page 21: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

21

3.5.5 The Principal component analysis

In order to know the relationship between different individuals was carried out

the Principal Component Analysis (PCA), using GENALEX v.6.2 software

program.

The PCA allows identifying differences among the individuals and identifying

possible group. PCA derives a 2 or 3-dimensional scatter plot of individuals,

making geometrical distances among individuals in the plot reflect the genetic

distances among them with minimal distortion (Mohammadi & Prasanna, 2003).

This analysis used a variance-covariance matrix, considering absolute changes

between individuals. The farmers accessions were compared again breeders

materials and internationals accessions in order to know the relationship

between them.

4. Results

4.1 Genetic diversity of Nicaraguan farmers accessions

A total of 155 alleles were identified in the 60 farmer’s accessions with a mean

of 17.2 alleles per locus (Table 4). The microsatellites MtcCIR8 detected high

polymorphism with 22 alleles per locus. Moreover, the lowest detection was

identified by MtcCIR1 with 10 alleles per locus. (See appendix 1 for allele

frequency).

The effective number of alleles per locus had a mean of 3.7. However, the

highest effective number of allele per locus was found by MtcCIR15 with 6

alleles per locus and most of rest microsatellites were ranged between 2.9 and

4.2 effective number alleles per locus.

Heterozygote deficiency (HExp < HObs) was registered for all microsatellite

used in this study. The lowest observed Heterozygote valued was presented by

MtcCIR7 with 0.28 and the highest value was presented MtcCIR21 with 0.53,

respectively. The highest value for the heterozygote expected was 0.82 for the

microsatellite MtcCIR15 and 0.61 was the lowest value for the microsatellite

MtcCIR10.

Page 22: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

22

The PIC values presented were generally high for all loci (PIC > 0.5). PIC

values varied between 0.73 and 0.91, with a mean 0.78. The microsatellite

MtcCIR15 shown the highest value with 0.909 and MtcCIR8 presented the

lowest value with 0.73. The Shannon diversity index showed a mean of 1.97.

The microsatellite MtcCIR1 presented lowest value with 1.53 and MtcCIR15 the

highest value with 2.63, respectively.

Table 4: Genetic parameters for microsatellite markers L. based on Farmer’s

accession from Nicaragua using 9 microsatellites : number of alleles per locus

(N), effective number of alleles per locus (Ne), heterozygosity observed (Hobs)

and expected (HExp), PIC (Polymorphism information content), and Shannon

genotypic diversity index, based on Farmer’s accession from Nicaragua.

Microsatellite N* Ne* Hobs Hexp PIC I*MtcCIR1 10 2.9 0.52 0.62 0.75 1.53MtcCIR7 12 2.9 0.28 0.63 0.75 1.69MtcCIR8 22 3.3 0.45 0.61 0.73 2.03

MtcCIR10 18 4.2 0.49 0.64 0.85 2.22MtcCIR11 20 3.4 0.43 0.65 0.75 1.91MtcCIR12 15 2.9 0.54 0.65 0.70 1.54MtcCIR15 20 6.0 0.47 0.82 0.91 2.63MtcCIR18 18 4.2 0.41 0.70 0.81 2.14MtcCIR21 20 3.6 0.53 0.68 0.76 1.97

Mean 17.22 3.7 0.47±0.06 0.67±0.08 0.78±0.07 1.97±0.35*Ne = Effective number of alleles [Kimura & Crow (1964)]

***I = Shannon Index [Lewontin (1972)]

HObs and HExp were computed using Levene (1949)

4.2 Genetic diversity of Nicaraguan farmers accessions at group level

The genetic parameters used were a number of alleles per locus (N), Number of

different alleles per locus (Na), number of effective alleles per locus (Ne), and

heterozygosity observed (Hobs) and expected (HExp), and Shannon information

Index (I) (Table 5). Those parameters were evaluated for farmer accession,

international accession and breeder accessions.

The total means of number alleles per locus in whole groups was 5.7. In

addition, the groups RAAS and Pacifico sur obtained a high number of alleles

per locus with 8.4 and 7.8, respectively. Moreover, Matagalpa and RAAN

presented lowest value for number alleles per locus with 2.8 and 3.4.

Page 23: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

23

On other hand, the heterozygote presented a deficit in all groups (HObs >

HExp). The total means of HObs for whole groups studied was 0.46; the highest

values were presented by Chinandega and Rio San Juan with 0.55 and 0.51,

respectively. Pacifico Sur and RAAS presented lowest values with 0.34 and

0.42. The highest value for HExp was 0.79 for RAAS and the lowest 0.48 for

Matagalpa.

Shannon information Index (I) presented variation between 0.82±0.15

(Matagalpa) and 1.81±0.14 (RAAS).

Table 5: Genetic diversity of Theobroma cacao L. L. based on Farmer’s

accession from Nicaragua, international accession and breeders accessions,

using 9 microsatellite: mean number of alleles per locus (Na), Number of

different alleles per locus, number of effective alleles per locus (Ne), and

heterozygosity observed (Hobs) and expected (HExp), and Shannon information

Index (I).Group HObs HExp IRAAS 0.42 0.79 1.81RAAN 0.44 0.65 1.14

Rio San Juan 0.51 0.69 1.47Pacifico sur 0.34 0.76 1.66Chinandega 0.55 0.64 1.32Matagalpa 0.48 0.48 0.82

Mean for Fameraccessions

0.46±0.17 0.67±0.08 1.37±0.12

Group

Internationalaccessions

0.26±0.06 0.75±0.1 1.26±0.17

Breeder accessions 0.66±0.08 0.78±0.32 1.66±0.12Mean 0.46±0.07 0.77±0.21 1.46±0.15

The Analysis of molecular variance (AMOVA) showed that 6.1 % of genetic

variation was due to variance among groups, whereas 93.9 % was divided

within groups which show statistic different in AMOVA for both components

among groups and within group (Table 6). The mean FST presented for all group

was 0.06.

Page 24: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

24

Table 6: Component of Analysis of molecular variance of Theobroma cacao L.

L. based on Farmer’s accession from Nicaragua using 9 microsatellites

Source ofvariation

d.f Sum ofsquares

Variancecomponents

Percentage ofvariation

Among groups 5 36.76 0.22 Va 6.1Within groups 114 376.77 3.31 Vb 93.9

Total 119 413.53 3.52Mean FST 0.06

The pair comparison between groups (Table 7), shown that group RAAS has

not presented statistic difference between Matagalpa groups, but has statistic

significance with rest groups. RAAN has only statistic difference with

Chinandega and Matagalpa, Rio San Juan has only statistic difference with

Chinandega.

Table 7: Pairwise (FST) comparisons of Theobroma cacao L. for groups of

farmers accession from Nicaragua using 9 microsatellitesRAAS RAAN Rio San

JuanPacifico

SurChinandega Matagalpa

RAAS 0RAAN 0.10490* 0

Rio SanJuan

0.06058* 0.09236 0

Pacifico sur 0.03716* 0.09309 0.02645 0Chinandega 0.08957* 0.14122* 0.04623* 0.04606* 0Matagalpa 0.07369 0.20582* 0.07564 0.07711 0.08247 0

*significant at P< 0.05

4.3 Genetic distances

Nei’s distances and genetic identity matrices are shown in table 8. Those

results were based on analysis of six groups including all individuals. The

ranged presented for Nei’s distances was between 0.1374 for group

Chinandega and 0.9096 for Matagalpa. The genetic distance between RAAN

and Matagalpa was the highest, and Rio San Juan with Pacifico Sur was the

lowest.

Genetic identity values varied between 0.4027 and 0.8716. Pacifico Sur and

Rio San Juan obtained the highest value and Matagalpa with RAAN the lowest

value, respectively.

Page 25: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

25

Table 8: The Genetic identity and genetic distance of Theobroma cacao L.

based on Farmer’s accession from Nicaragua using 9 microsatellitesRAAS RAAN Rio San

JuanPacifico Sur Chinandega Matagalpa

RAAS **** 0.4438 0.7438 0.7992 0.7002 0.7437RAAN 0.8124 **** 0.6066 0.5631 0.5330 0.4027

Rio SanJuan

0.2960 0.4999 **** 0.8716 0.8464 0.7624

Pacifico sur 0.2242 0.5744 0.1374 **** 0.8505 0.7558Chinandega 0.3564 0.6293 0.1667 0.1620 **** 0.7726Matagalpa 0.2961 0.9096 0.2713 0.2800 0.2580 ****

*Nei's genetic identity (above diagonal) and genetic distance (below diagonal).

4.4 Cluster analysis

The cluster analysis classified whole accessions into 9 groups and three distinct

individuals. The distinct individuals were RAAS0405, RAAS0401, and

RAAN0404. The first group was divided into two subgroups and one distinct

individual. The first subgroup was formed for two hybrids (UF676xP12 and

UF29xP12), the second subgroup were included three individuals (ER012,

Chi0304, Chi0411). A hybrid (IMC67xUF613) was clustering as distinct

individual into this group. The second group was formed by four individuals

(RAAS0409, Chi0407, Chi0308, and RAAN0401); all of them presented a

strong clustering relationship (96%)

The third group was formed for one hybrid (UF668xUF613) and one farmer

accession (RSJ0404). In the fourth group were included two hybrids

(UF654xP12 and EET95xSCA6) and two farmers accession (RAAS0407 and

RSJ0503); they were exhibited a strong clustering relationship. The fifth group

present bootstrap values below 80%. However, this group was divided into two

subgroups. The first subgroup presented a strong relationship and was formed

for 5 individuals Pacayita0108, Mombacho0108, UF650x P12, Chi0410 and

Chi0301a), the second subgroup presented a weak clustering relationship

(below 80%).

The sixth group presented a strong relationship (100%) and was formed

from three individuals (RAAS0408, RAAS0404, and Chi0415). The seventh

group presented a weak clustering relationship value (below 80%). However,

into this subgroup was presented a strong bootstrap value. The eighth group

was formed for 25 individuals with a strong clustering relationship (100 %). In

this group was included the CRIOLLO 13, which presented a strong clustering

relationship with Mat0404. (Figure 3)

Page 26: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

26

Raas0405Raas0401Raan0404Imc67xUf613Uf676xP12Uf29xP12Er012Chi0304Chi0411Raas0409Chi0407Chi0308Raan0401Uf668xUf613Rsj0404Rsj0503Raas0407Uf654xP12Eet95xSca6Raas0403Chi0301aChi0410Uf650xP12Mombacho0108Pacayita0108Rsj0504Uf613xP12Eet00xP12Chi0412Chi0306Raas0408Raas0404Chi0415Rsj0406ChI0414Uf667xP12Raas0413YucatanH6Chi0409H1H2H4Chi0405Mat0402Uf296xP12Riv0405Sjc0108Imc67XUf613Mombacho0208Ec0108Catarina0108Mas0402Raas0410Mombacho0308Eet399xSca12Raas0414Riv0401Uf667xSca6Raan0403Rsj0401Raas0402Raas0406H3H5Mas0401Rsj0508Rsj0402Raas0411Mat0404Criollo13Raas0412Er011Menier0108Rsj0501Rsj0405Rsj0403Mat0401

100

100

100

100

100

100

100100

10010093

100

92

100

96

100

100

100

100

100

100

100100

100100

100

100

96

10093

95 100

100100

98

95

82

9999

10098

100

10094

100

96

100

100100

100

100100

96100

9999

100

1

2

3

4

5

6

7

8

Figure 3: Cluster analysis bootstrapped over 1,000 replicates, based on the

genetic distances between 60 farmer accessions from Nicaragua, 4

internationals accessions and 14 improve samples of Theobroma cacao.

Bootstrap values above 80 % were marked.

Page 27: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

27

4.5 Principal component analysis

The principal component analysis distributed all individuals in four different axes

(Figure 4). The axis one was found 24 individuals. In this axis was included

only three improved individuals, which were dominated by the cross Forastero x

Trinitario. Also, three pairs individuals were found with equally genetic alleles:

MAT0402 with CHI0301A, RSJ0504 with RSJ0404, and the last pair were

RAAN0404 with RAAS0409.

A total of 17 individuals were found in axis 2. The Criollo 13 and Yucatan

were included in this axis. In addition, three samples were found related to

Criollo 13 such as SJC0108, Catarina0108 and MAS0402. Also, Yucatan was

related with RAAS0406, RAAS0410. RAAS0414, and Menier0108.

Furthermore, RAAN0403 was found related with ER-012 (Ecuador). In addition,

most materials found in this axis were homozygote.

In the Axis three were found 25 individuals. Moreover, the majority of

improved materials were found in this axis. In additions, two pairs of individuals

were found close related with RAAS0403 with UF296xP12, and RAAS0401 with

EET95xSCA6. The rests of individuals (12) were distributed in axis 4. Further,

H5 and H6 had the same genetic structure.

Page 28: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

28

RAAS0401

RAAS0402

RAAS0403

RAAS0404

RAAS0405

RAAS0406

RAAS0407

RAAS0408

RAAS0409

RAAS0410

RAAS0411

RAAS0412

RAAS0413

RAAS0414

RAAN0401

RAAN0403

RAAN0404

RSJ0401

RSJ0402

RSJ0403

RSJ0404

RSJ0405 RSJ0406

RSJ0501

RSJ0503

RSJ0504

RSJ0508

MAS0401

MAS0402

RIV0401

RIV0405

H1

H2

H3H4

H5 H6

Pacayita0108

SJC0108

Menier0108

EC0108

Mombacho0108

Mombacho0208

Mombacho0308

Catarina0108

CHI0301A

CHI0304

CHI0306

CHI0308

CHI0407

CHI0409

CHI0410

CHI0411

CHI0412

CHI0414

CHI0415

CHI0405

Criollo13

Yucatan

ER-011

ER-012

UF650xP12

UF667xP12

EET400xP12

UF296xP12

EET399xSCA12

IMC67xUF613

UF613xP12

UF668XUF613

UF667xSCA6

UF29xP12

UF654xP12UF676xP12

EET95xSCA6

IMC67xP12

MAT0401MAT0402

MAT0404

Co

ord

. 2

Coord. 1

Principal Coordinates

Pop1

Pop2

Pop3

Pop4

Pop5

Pop6

Pop7

Pop8

Figure 4: Principal component Analysis (PCA) plot for 78 individuals using 9

microsatellites, based on the genetic distances between 60 farmer accessions

from Nicaragua (Pop1= RAAS, Pop2 = RAAN, Pop3 = RSJ, Pop4 = Pacifico

Sur, Pop5 = Chi, Pop7 = Mat), 4 internationals accessions (Pop6) and 14

breeders accessions (Pop7) of Theobroma cacao.

Page 29: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

29

5. Discussion

In the present study were used microsatellites genotyping to examine the

genetic diversity, population structure, and the relationship between farmers

accession, breeders accession and internationals accessions.

5.1 Genetic diversity and population structure

Studies of genetic diversity have been carried out using isozymes, RAPD,

AFLP, RFLP, and SSR (microsatellite) markers in Theobroma cacao L. due to

the importance of this crop in chocolate industry. AFLP has been used to

distinguish cultivars and create a linkage map in cacao (Davey et al. 1998;

Risterucci et al. 2000). In previous studies, Motamayor et al. (2002) and

Risterucci et al. (2000) have shown that microsatellites are more efficient than

RFLP in genetic diversity study for Theobroma cacao L., because

microsatellites have allowed detected more alleles per locus as well as higher

level of polymorphism and are equally powerful tools for estimation of

heterozygosity.

Microsatellites have been widely used to study genetic diversity in

Theobroma cacao (Zhang et al. 2009; Cryer et al. 2006; Efombagn et al. 2006;

Sereno et al. 2006; Takrama et al. 2005; Fossati et al. 2005; Saunders et al.

2004; Motamayor et al. 2003; Motamayor et al. 2002; Marita et al. 2001;). The

main purpose of those analyses has been to identify differences between Criollo

groups and Forastero groups.

The polymorphic information content (PIC) was used to measure the ability

of a marker to detect polymorphism in cacao samples of Nicaraguan farmers

accession. In the present study, the PIC value showed a high power of

discrimination (PIC>0.78), of the loci used in Nicaraguan famers accessions,

which is an indicator of a high genetic diversity for nine loci studied. This

corroborated the use of these microsatellite loci in the estimation of genetic

diversity among each famer accession collected in Nicaragua. This result has

similarity with the result found by Zhang et al. 2006 and Efombagn et al. 2006

(0.58 and 0.69, respectively).

In this study, the overall allelic richness per locus presented was (17.2

overall allelic per locus and 3.7 effective allelic per locus). The level of

Page 30: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

30

polymorphism detected in the present study would be affected by the mutation

rate of microsatellite and the small size population in the present studied.

According to Jin et al. (1996), Thuillet et al. (2002), and Wilkinson (2000),

mutation rate of microsatellite can be originated by replication slippage, which

can create high similarity in allele’s sizes and differences between few repeat

units.

The allelic diversity (relatively low Ne= 3.7), presented in this study was

similar to results presented by Smulder et al. (2009), that reported a low allelic

diversity (Ne= 3.54) for cacao populations from South America, and by Zhang et

al (2006) with an allelic diversity of 3.68 for semi natural population from

Huallaga, Peru, while Alves et al. 2007 presented an average 3.37 of alleles per

locus for a natural population of Theobroma grandiflorun. Previous studies, in

Theobroma cacao, using microsatellite found averages of 5.7 allelic per locus

for managed populations (Zhang et al. 2006). Studies realized separately in

Brazil and French Guiana presented averages of 4.45 and 4.48, respectively.

Those values were considered moderately high for richness allelic (Sereno et

al. 2006; Lachenaud & Zhang, 2008). In additions, low allelic diversity could be

explain by the fact that cacao is an endemic plant from Central America or

genetic drift, which would have reduced allelic diversity or allelic loss in that

accessions.

The heterozygote deficit (HExp<HObs) showed an inbreeding in all Nicaraguan

famers’ accessions analyzed. The results presented in study coincided with

results presented in previous studies by Zhang et al. (2009); Lachenaud &

Zhang, (2008); Cryer et al. (2006); Efombagn et al. (2006); Sereno et al. (2006);

Takrama et al. (2005); Fossati et al. (2005); Saunders et al. 2004; Motamayor et

al. 2003; Motamayor et al. 2002; Marita et al. (2001). The inbreeding could be

related to the biology of the reproductive system in cacao plant and the

selection that famers have been managed with cacao plantation through time.

Nonetheless, the inbreeding was unexpected, as cacao is an outcross species,

which it would be assumed an absence of inbreeding, which indicate that most

individuals into Nicaraguan farmers accessions are homozygosis and have self-

compatibility. In that sense, RAAS and Pacifico Sur presented more inbreeding,

compare to places where that have been considered as places where

indigenous tribes have managed and selected the cacao trees since the pre-

Columbian period. In that way, Dias (2002), have supported the idea that

Page 31: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

31

Indigenous people started the cacao selection based on the white beans, by

doing so, they indirectly selected self-compatibility, promoting inbreeding.

Breeder accession was expected to present high level of heterozygosis

(HObs= 0.66) as they are product of strong human intervention, Chi and RSJ

groups, also presented relative value with breeder accession (HObs = 0.55 and

0.5, respectively), which was expected the presence of high heterozygosis in

those areas where cacao crops has been promoting with breeding accession by

national government and nongovernmental organization.

The gene diversity (HExp= 0.67) presented in this study was similar to

results presented by Sereno et al. (2006) for population form the Brazilian

Amazon (HExp= 0.5), also Motamayor et al. (2002) presented a similar gene

diversity for modern Criollo (HExp= 0.52). Motamayor et al. (2002) defined as

"Modern Criollo" those samples that have presented the morphological trait that

were described by Chessman (1944), but collected on farms. Farmers

accessions presented less gene diversity than breeders accession, and it was

expected, because Breeder accessions are the product of intensive genetic

recombination in hybridization realized. This result coincides with previous

comparison between farmers selection and breeders accession realized in

Cameroon by Efombang et al. (2006), also, Laurent et al (1993) found a high

level of polymorphism among Criollo group and low level in Forastero group.

The study found that RAAS and Pacifico sur of farmers accessions are more

close to International accessions (Criollo group) than the rest of groups studied

(Table 7), and RSJ and Chi are more related to Breeders materials. Several

studies have coincided that during pre-Columbian time, cacao Criollo was

distributed along the Pacific Coast in Nicaragua (Steinbrenner, 2006; Bartley,

2004; Ciferri & Ciferri, 1957; Bergmann, 1969), which result presented in this

study support previous hypothesis about Criollo distribution. As cacao

exchange and the human migration until jungle as consequence of the

conquest, indigenous carry out cacao samples, for this reason samples of

RAAS are similar to Pacifico Sur.

Moreover, RAAS and Pacifico Sur presented heterozygosis similar to

international accession (Criollo reference). Moreover, RSJ and Chi groups were

more close to breeders accession, due to the recent promotion of cacao crops

in Nicaragua, which indicate that farmers have obtained breeders materials and

Page 32: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

32

this action have allowed the genetic introgression between Criollo group and

breeder accession.

5.2 Groups differentiation and genetic proximities

The analysis of molecular variance (AMOVA), shown that most variation was

found within groups (Table 6) while low variation among groups. This was

expected due to the endemic nature of Thebroma cacao in the region (Central

America and South America). According to Silvertown & Charlesworth (2001),

endemic species have a low variation among group as they have a unique

genetic structure.

Additionally, the pairwise comparison (Table 7) shown that RAAS and

Pacifico Sur have low genetic variability (0.037), this result coincided with

genetic distance between both groups, which was more near among the others

groups. It seems that exchange of materials due to human migration, have

reduced the genetic distances between both group (RAAS and Pacifico Sur).

The Cluster and PCA analyses allowed identify the relationship between

individuals. The cluster analysis showed a strong clustering relationship among

individuals (Figure 3). Four undistinguished farmers accessions were indentified

as well as some stronger genetic relation between farmers accessions and

international accessions (Criollo). This was expected, as researcher such as

Dias (2002), Bartley (2005) and Laurent et al. (1993) have considered the

existence of Criollo sample in Nicaragua. Nonetheless, Motamayor (2002)

considered that most cacao samples in Nicaragua are Trinitario type, as

consequence of outcrossing among Forastero and Criollo. The cluster analysis

found many farmers accession and breeders accession with a strong

relationship.

An important finding in the cluster analysis was the strong relationship

among ER011 from Ecuador and Nicaraguan farmers accession (Menier0108,

Criollo 13, RAAS0412, RSJ0405, RSJ0403, and Mat0401). According to Dias

(2002), the Nacional accession from Ecuador is more genetically related to

Criollo than forasteros, also Bartley (2006) reported that first gene flow of

Theobroma cacao from Nicaragua occurred in 1893, when Harts carried out a

total of 150 plants with Criollo characteristic toward Trinidad. This event could

Page 33: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

33

be considered as trigger genetic origin of the relationship between Ecuador's

and Nicaraguan's accession.

In addition, PCA allowed identification of duplicated samples and relation

between Criollo 13 and Yucatan samples (Figure 4). The samples duplicated

were found in Matagalpa and Chinandega (Mat0402 and Chi0301a) and

Pacifico sur (H5 and H6). Also, most Trinitarios group was finding in axis 3,

which were included most breeders accessions that were classified as

Trinitario. In this axis was predominated farmers accession from RAAS.

The PCA also indicated that most samples, genetically related with

Criollo 13 were collected in pacific Sur (SJC0108, Catarina0108 and MAS0402).

Those samples had morphologic characteristic similar to the samples described

by Chessman (1944) and Hart 1893 (in Bartley, 2002) for cacao Criollo.

Additionally, it was found that Nicaraguan farmers accessions (RAAS0406,

RAAS0410. RAAS0414 and Menier0108) were genetically related to Yucatan,

that relationship would indicate that Nicaraguan indigenous was related to

Mexican indigenous, especially indigenous for the groups located in the Pacific

Coast, Nicaragua.

From the PCA included in axis 3 (Figure 4), it can be seen that most

breeders accession (Trinitario), as well as 25% of all farmers accessions were

genetically related to breeder accessions. this result could be caused by the

high promotion of cacao and renovations of old plantations for new breeders

plantations with specific characteristic as diseases resistance to Black pod

(Phytophthora spp) and Frosty pod (Moniliophthora roreri) and high yield.

Page 34: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

34

6. Conclusions

The present study has revealed the existence of a genetic diversity of 60

Nicaraguan farmers accessions based on nine microsatellites. This study

provides a basic knowledge about genetic structure of cacao farmers in

Nicaragua and differences between cacao accessions from different area when

were collected those samples. The values of the genetic parameter obtained in

this study, mainly the values of heterozygosis indicate that Nicaraguan farmers

accessions are inbreed with high genetic diversity. Also, the allelic richness

suggests the presence of potential unique and rare alleles.

The analysis molecular of variance (AMOVA) was highly significant and

showed a low variance among groups but high variance within groups. Based

on this study (cluster and PCA analyses), it has been found that some cacao

farmers accession are genetically related with Criollos genotypes, but most

farmers accessions have Trinitario characteristic as consequence of promoting

improved materials. Also, was found a few farmers accessions and Ecuadorian

accession with the same genetic origin. Those results would suggest a good

possibility for selection and improvement of cacao. Those promising accession

should be used in a breeding program.

In the future, will be necessary collect and increase the collections of

farmers accessions and the molecular data generated will be correlated with

morphologic characterization.

Page 35: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

35

7. Acknowledgements

To Finnish Government –specially Finnish Embassy in Nicaragua and Foreigner

Ministry of Finland-

To University of Helsinki –specially Department of Applied Biology-

To NIFAPRO Staff – Jari Valkonen, Helena Korpelainen, Paula Elomaa, Aldo

Rojas-

INTAs’ Authorities and all cacaos Workers -present and not presents- that have

contributed and rescued the knowledge about cacao.

To 71 years of foundation of Experimental Center “EL Recreo” and 30 of cacao

research

To CATIE –Costa Rica- for facilities the Criollo genotypes

Page 36: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

36

8. List of references

Alves, R.M., Sebbenn, A.M., Artero, A.S., Clement, C. & Figueira, A. 2007. High

levels of genetic divergence and inbreeding in populations of cupsuassu

(Theobroma grandiflorum). Tree Genetics & Genomes 3: 289-298.

Anderson, J.A., Churchill, G.A., Autrique, J.E., Tanksley, S.D. & Sorrells, M.E.

1992. Optimizing parental selection for genetic linkage maps. Genome 36:

181-186.

Bartley B.G.D & Cope F.W (1973). Practical aspects of self incompatibility in

Theobroma cacao L. In: Moav R (ed). Agricultural Genetics Selected

Topics. Wiley and Sons: New- York, pp 109–134.

Bartley BGD, Cope FW (1973). Practical aspects of self-incompatibility in

Theobroma cacao L. In: Moav R (ed). Agricultural Genetics—Selected

Topics. Wiley and Sons: New-York, pp 109–134.

Bergmann, J. F. 1969. The distribution of cacao cultivation in pre-Columbian

America. Annals of the Association of American Geographers 5:185–96.

Botstein, D., White, R. L., Skalnick, M. H., & Davies, R. W. 1980. Construction

of a genetic linkage map in man using restriction fragment length

polymorphism, American Journal of Human Genetics 32: 314-331.

Cheesman, E.E. 1944. Notes on the nomenclature, classification and possible

relationships of cocoa groups. Tropical Agriculture, 21: 144–159.

Ciferri, R., & F. Ciferri. 1957. The evolution of cultivated cacao. Evolution:

International Journal of Organic Evolution 9:381–397

Cryer, N.C., Fenn, M.G.E., Turnbull. C.J. & Wilkinson, M.J. 2006. Allelic size

standards and reference genotypes to unify international cocoa (Theobroma

cacao L.) microsatellite data. Genetic Resources and Crop Evolution

53:1643-1652.

Page 37: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

37

Davey, M.R., Perry, M.D., Power, J.B., Lowe, K.C., J Blich, H.F., Roach, P.S. &

Jones, C. 1998. DNA isolation and AFLPTM genetic fingerprinting of

Theobroma cacao (L). Plant Molecular Biology Reporter 16: 49-59.

De la Cruz, M., Whitkus, R., Gomez-Pompa, A., & Mota-Bravo, L. 1995. Origins

of cacao cultivation. Nature 375: 542-543.

Dias, L.A.S. 2001.Genetic Improvement of Cacao (Melhoramento Genético Do

Cacaueiro) (ed. Dias LAS), FUNAPE-UFG, Brazil. Translated into English

by Abreu-Richart CE (http://ecoport.org)

Di Betetti, M.S. 2000. The distribution of grooming among female primates:

testing hypotheses with Shannon-Wiener diversity index. Behaviour 137

(11): 1517-1540.

Doyle, J. F., & Doyle, J. L.1990. Isolation of plant DNA from fresh tissue. Focus

12: 13–15.

Efombagn, M.I.B., Sounigo, O., Nyasse, S., Manzanares-Dauleux, M., Cilas, C.,

Eskes, M.A.B. & Kolesnikova-Allen. 2006. Genetic diversity in cocoa

germplasm of southern Cameroon revealed by simple sequences repeat

(SSRS) markers. African Journal of Biotechnology 5 (16): 1441-1449.

Excoffier, L. G. Laval, & S. Schneider. 2005. Arlequin ver. 3.0: An integrated

software package for population genetics data analysis. Evolutionary

Bioinformatics Online 1:47-50.

Excoffier, L., Smouse, P, & Quattro, J. 1992. Analysis of Molecular Variance

Inferred from Metric Distances among DNA Haplotypes: Application to

Human Mitochondrial DNA Restriction Data. Genetics 131 (2): 479-491.

FAO. 2009. FAOstat. http://faostat.fao.org/site/567/default.aspx#ancor. Vienna,

Austria: Food and Agriculture Organization of the United Nation; Visited

30.07.2009.

Page 38: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

38

Fossati, T., Zapelli, I., Bisoffi, S., Micheletti, A., Vietto, L., Sala, F., & Lastiglione,

S. 2005. Genetic relationships and clonal identity in a collection of

commercially relevant poplar cultivars assessed by AFLP and SSR. Tree

Genetics & Genomes 1: 11-19.

Garcia, A.A.F., Benchimol, L.L., Barbosa, A.M.M., Geraldi, I.O., Souza Jr, C.L &

De Souza P. 2004. Comparisson of RAPD, RFLP, AFLP, and SSR markers

for diversity studies in tropical maize inbred lines. Genetic Molecular Biology

27: 579-588.

Guo X, & Elston RC. 1999. Linkage Information Content of Polymorphic Genetic

Markers. Human Heredity 49:112-118.

INTA. 2006. El cacao: Riqueza potencial de la tierra nica a la espera de ser

explotada comercialmente en los mercados internacionales. INTA.

Managua, Nicaragua. 7 pp.

Jin, L., Macaubas, C., Hallmayer, J., Kimura, A. & Mignot, E. 1996. Mutation

rate varies among alleles at a microsatellite locus: phylogenetic evidence.

Proceedings of the National Academy of Sciences of the United of States of

America 93: 15285-15288.

Kimura, M., & J. F. Crow. 1964. The number of alleles that can be maintained in

a finite population. Genetics 49:725–738.

Kochhar, S.L. 1986. Tropical crops. A textbook of economic botany. 1st edition.

Macmillan publisher Ltd. 467 pp

Lachenaud, P. & Zhang, D. 2008. Genetic diversity and population structure in

wild stands of cacao trees (Theobroma cacao L.) in French Guiana. Annals

of forest science 65: 310.

Page 39: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

39

Lanaud, C., Risterucci, A. M., Pieretti, I., Falque, M,. Bouet A, & Lagoda PJL.

1999. Isolation and characterization of microsatellites in Theobroma cacao

L. Molecular Ecology. 8: 2141–2152.

Lanaud, C., Risterucci, A.M., N’Goran, A.K.J., Clement, D., Flament, M.H.,

Laurent, V. & Falque, M. 1995. A genetic linkage map of Theobroma cacao

L. Theoretical and Applied Genetics 91: 987-993.

Lass, T. 2004. Balancing cocoa production and consumption. In: Flood J,

Murphy R (eds) Cocoa futures. A source book on some important issues

facing the cocoa industry. Cabi FEDERACAFE. USDA. Chinchina´.

Colombia. pp 8–15.

Laurent V, Risterucci AM, Lanaud C.1994. Genetic diversity in cacao revealed

by cDNA probes. Theoretical and Applied Genetics 88:193-198.

Lerceteau, E., Robert, T., Petiard, V. & Crouzillat, D. 1997. Evaluation of the

extent of genetic variability among Theobroma cacao accessions using

RAPD and RFLP markers. Theoretical and Applied Genetics 95: 10-19.

Levene, H. 1949. On a matching problem arising in genetics. The Annals of

Mathematical Statistics 20:91–94.

Lewontin, R.C. 1972. The apportionment of human diversity. In: Dobzhansky T,

Hecht MK, Steere WC, editors. Evolutionary Biology 6: 381–398.

Madell, S. 2008. Where is the true home of cocoa?.

http://www.chocolatereview.com.au/cocoa_history.ChocolateReview.

Visited 13.07.09.

Marita, J.M., Nienhuis, J., Pires, J.L. & Aitken, W.M. 2001. Analysis of genetic

diversity in Theobroma cacao with emphasis on Witches’ broom disease

resitance. Crop Science 41: 1305-1316.

Page 40: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

40

McNeil, C. 2006. Chocolate in Mesoamerica. A cultural history of cacao, 1th

edition. University Press of Florida.542 pp.

Mohammadi, S. A. & Prasanna, B. M. 2003. Analysis of Genetic Diversity in

Crop Plants--Salient Statistical Tools and Considerations. Crop Science 43:

1235-1248.

Motamayor, J.C., Ristericci, A.M., Heath, M. & Lanaud, C. 2003. Cacao

domestication II: Progenitor Germplasm of the Trinitario cacao cultivar.

Heredity 91: 322-330.

Motamayor, J.C., Risterucci, A.M., Lopez, P.A., Ortiz, C.F., Moreno, A., &

Lanaud, C. 2002. Cacao domestication I: The origin of the cacao cultivated

by the mayas. Heredity 89: 380-386.

Mouzeyar, S., Roeckel-Drevet, P., Gentzbittel, J.P., Tourvieille De Labrouhe,

D.T. & Nicolas, P. 1995. RFLP and RAPD mapping of sunflower pl1 locus

for resistance to plasmopara halstedii race 1. Theoretical and Applied

Genetics 91: 733-737.

N’Goran, J.A.K., Laurent, V., Risterucci, A.M. & Lanaud, C. 1994. Comparative

genetic diversity studies of Theobroma cacao L. using RFLP and RAPD

markers. Heredity 73: 589-597.

Nagaoka, T. & Ogihara, Y. 1997. Applicability of inter-simple sequence repeat

polymorphism in Wheat for use as DNA markers in comparison to RFLP and

RAPD markers. Theoretical and Applied Genetics 94: 597-602.

Nei, M. 1973. Analysis of gene diversity in subdivided groups. Proceedings of

the National Academy of Sciences, USA 70: 3321–3323.

Nei, M. 1978. Estimation of average heterozygosity and genetic distance from a

small number of individuals. Genetics 89: 583–590.

Page 41: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

41

Nei, M. 1987. Molecular Evolutionary Genetics. Columbia University Press, New

York.

Nei, M., & CHESSER, R.K. 1983. Estimation of fixation indices and gene

diversities. Annals of Human Genetics 47: 253–259.

Ogata, N. Gomez-Pompa., & Tabue, K.A. 2006. Chapter 3: The domestication

and distribution of Theobroma cacao L. in the Neotropics. In: Chocolate in

Mesoamerica: A cultural history of cacao. (Ed. by McNeil, C). University of

press of Florida.

Page, R.D.M. 1996. TREEVIEW: An application to display phylogenetic trees on

personal computers. Computer Applications in the Biosciences 12: 357-358.

Peakall, R., & Smouse, P.E., 2006. GENALEX 6: genetic analysis in Excel.

Population genetic software for teaching and research. Molecular Ecology

Notes 6: 288-295.

Risterucci, A.M., Eskes, B., Fargeas, D., Motamayor, J.C. & Lanaud, C. 2000.

Use of microsatellite markers for germplasm identity analysis in cocoa. In:

Proceedings of the international workshop on the new technologies and

cocoa breeding. INGENIC Newsletter: 25-33.

Risterucci, A.M., Grivet, L., N’Goran, J.A.K., Pieretti, I., Flament, M.H. &

Lanaud, C. 2000. A high-density linkage map of Theobroma cacao L.

Theoretical and Applied Genetics 101: 948-955.

Saunders, J.M., Mischke, S., Leamy, E.A. & Hemeida, A.A. 2004. Selection of

international molecular standards for DNA fingerprinting of Theobroma

cacao. . Theoretical and Applied Genetics 110: 41-47.

Sereno, M.L., Alburquerquer, P.S.B., Vencovsky, R. & Figueira, A. 2006.

Genetic diversity and natural population structure of cacao (Theobroma

cacao L.) from the Brazilian Amazon evaluated by microsatellite markers.

Conservation Genetics 7: 13-24.

Page 42: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

42

Shannon, CE. & Weaver, W. 1949. The mathematical theory of communication.

University of Illinois Press, Urbana.

Souza, C.A.S. & Dias, L.A.S. 2001. Chapter 1: Environment improvement and

socio-economy. In: Genetic improvement of cacao. (ed. Dias LAS), FUNAPE-

UFG, Brazil. Translated into English by Abreu-Richart CE (http://ecoport.org).

Stainbrebber, L. 2006.Chapter 12: Cacao in greater Nicoya. Ethnohistory and

unique tradition. In: Chocolate in Mesoamerica: A cultural history of cacao.

(Ed. by McNeil, C). University of press of Florida.

Takrama, J.F., Cervantes-Martinez, C., Brown, J.S., Motamayor, J.C. & Schnelf.

2005. Determination of off-types in cocoa breeding programme using

microsatellite. Ingenic Newslatter 10: 2-8.

Thormann, C.E., Ferreira, M.E., Camargo, L.E.A., Tivang, J.G. & Osborn, T.C.

1994. Comparison of RFLP and RAPD markers to estimating genetic

relationships within and among cruciferous species. Theoretical and Applied

Genetics 88: 973-980.

Thuillet, A.C., Bru, D., David, J., Roumet, P., Santoni, S., Sourdille, P. &

Bataillon, T. 2002. Direct estimation of mutation rate for 10 microsatellite

loci in Durum wheat, Triticum turgidum (L) Thell.ssp durum desf. Molecular

Biology and Evolution 19 (1): 122-125.

WEIR, B. S. 1996. Genetic data analysis II. Sinauer, Sunderland,

Massachusetts, USA.

Weir, BS, & Cockerham, C.C. 1984. Estimating F-Statistics for the analysis of

population structure. Evolution, 38: 1358–1370.

Whitkus, R., De la Cruz, M., Mota-Bravo, A. & Gómez-Pompa, A. 1998. Genetic

diversity and relationships of cacao (Theobroma cacao L.) in southern

Mexico. . Theoretical and Applied Genetics 96: 621-627.

Page 43: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

43

Yeh, F.C. & Boyle, T.J.B. 1997. Population genetic analysis of co-dominant and

dominant markers and quantitative traits. Belgian Journal of Botany 129:

157.

Zhang, D., Arevalo-Gardini, E., Mischke, S., Zuñiga-Cernades, L., Barreto-

Chavez, A. & Adriazola de Aguila, J. 2006. Genetic diversity and structure of

managed and semi-natural populations of cocoa (Theobroma cacao) in the

Huallaga and Ucayali Valleys of Peru. Annals of Botany 98: 647-655.

Zhang, D., Mischke, S., Johnson, E.S., Phillips-Mora, W., Meinhardt, L. 2009.

Molecular characterization of an international cacao collection using

microsatellite markers. Tree Genetics & Genomes 5: 1-10.

Page 44: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

44

9. Appendices

Appendix 1: Allele frequencies of microsatellites loci in groups of Theobroma

cacao L. from Nicaragua

Locus Allele/n RAAS RAAN RSJ P.S Chi I.A B.A MatmTcCIR 1 N 14 3 10 18 12 4 14 3

127 0.107 0.333 0.250 0.333 0.250 0.500 0.179 0.000128 0.357 0.000 0.100 0.083 0.042 0.000 0.107 0.667131 0.036 0.000 0.000 0.000 0.000 0.000 0.000 0.000133 0.000 0.000 0.000 0.028 0.000 0.000 0.000 0.000134 0.000 0.000 0.000 0.000 0.000 0.000 0.071 0.000135 0.000 0.000 0.000 0.000 0.000 0.125 0.000 0.000139 0.036 0.000 0.000 0.028 0.000 0.000 0.000 0.000140 0.107 0.667 0.250 0.111 0.208 0.250 0.179 0.000141 0.357 0.000 0.400 0.389 0.500 0.125 0.464 0.167150 0.000 0.000 0.000 0.028 0.000 0.000 0.000 0.167

mTcCIR 7 N 14 3 10 18 12 4 14 3149 0.000 0.000 0.000 0.000 0.000 0.000 0.036 0.000150 0.000 0.000 0.000 0.000 0.000 0.250 0.000 0.000153 0.000 0.000 0.000 0.000 0.000 0.000 0.036 0.000155 0.071 0.333 0.100 0.167 0.042 0.125 0.036 0.000156 0.321 0.500 0.700 0.389 0.375 0.000 0.393 0.667157 0.071 0.000 0.000 0.222 0.250 0.250 0.000 0.333158 0.286 0.000 0.000 0.000 0.000 0.250 0.000 0.000159 0.000 0.167 0.050 0.167 0.000 0.000 0.250 0.000160 0.250 0.000 0.000 0.000 0.125 0.000 0.071 0.000161 0.000 0.000 0.150 0.028 0.208 0.000 0.107 0.000162 0.000 0.000 0.000 0.000 0.000 0.000 0.036 0.000163 0.000 0.000 0.000 0.028 0.000 0.125 0.036 0.000

mTcCIR8 N 14 3 10 17 12 4 14 3250 0.000 0.000 0.000 0.029 0.000 0.000 0.000 0.000251 0.107 0.000 0.000 0.000 0.000 0.000 0.000 0.000257 0.000 0.000 0.000 0.000 0.000 0.250 0.000 0.000270 0.000 0.167 0.000 0.000 0.000 0.000 0.000 0.000286 0.071 0.000 0.000 0.000 0.000 0.000 0.000 0.000288 0.071 0.000 0.000 0.176 0.000 0.000 0.000 0.000289 0.393 0.333 0.600 0.324 0.708 0.250 0.393 0.833291 0.107 0.000 0.050 0.029 0.000 0.000 0.214 0.000292 0.071 0.000 0.050 0.000 0.000 0.000 0.000 0.000293 0.036 0.000 0.000 0.088 0.000 0.000 0.214 0.000294 0.000 0.000 0.000 0.000 0.042 0.000 0.000 0.000295 0.000 0.000 0.050 0.000 0.000 0.000 0.000 0.000

Page 45: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

45

Appendix 1: Continue…

Locus Allele/n RAAS RAAN RSJ P.S Chi I.A B.A MatmTcCIR8 N 14 3 10 17 12 4 14 3

297 0.000 0.000 0.100 0.000 0.000 0.000 0.143 0.000299 0.000 0.333 0.000 0.000 0.000 0.250 0.000 0.000300 0.000 0.000 0.000 0.088 0.000 0.250 0.000 0.000302 0.036 0.000 0.000 0.000 0.000 0.000 0.000 0.000303 0.000 0.000 0.000 0.059 0.000 0.000 0.000 0.000304 0.036 0.000 0.000 0.029 0.000 0.000 0.000 0.000305 0.000 0.167 0.150 0.118 0.208 0.000 0.036 0.167307 0.071 0.000 0.000 0.000 0.000 0.000 0.000 0.000308 0.000 0.000 0.000 0.000 0.042 0.000 0.000 0.000389 0.000 0.000 0.000 0.059 0.000 0.000 0.000 0.000

mTcCIR 10 N 14 3 10 18 12 3 14 3194 0.000 0.000 0.000 0.000 0.000 0.000 0.036 0.000212 0.000 0.333 0.000 0.000 0.000 0.000 0.000 0.000213 0.036 0.000 0.000 0.000 0.000 0.000 0.000 0.000214 0.071 0.167 0.000 0.000 0.250 0.000 0.107 0.000215 0.071 0.000 0.050 0.000 0.000 0.000 0.036 0.000216 0.000 0.000 0.050 0.083 0.000 0.000 0.036 0.000217 0.000 0.000 0.050 0.000 0.000 0.000 0.036 0.000218 0.393 0.000 0.200 0.306 0.250 0.000 0.071 1,000219 0.036 0.500 0.000 0.000 0.042 0.000 0.036 0.000220 0.179 0.000 0.050 0.278 0.167 0.000 0.214 0.000221 0.000 0.000 0.250 0.056 0.083 1,000 0.107 0.000222 0.000 0.000 0.050 0.000 0.000 0.000 0.036 0.000225 0.071 0.000 0.000 0.000 0.000 0.000 0.036 0.000226 0.000 0.000 0.100 0.000 0.000 0.000 0.000 0.000228 0.071 0.000 0.150 0.167 0.125 0.000 0.214 0.000229 0.071 0.000 0.000 0.056 0.083 0.000 0.036 0.000230 0.000 0.000 0.050 0.000 0.000 0.000 0.000 0.000303 0.000 0.000 0.000 0.056 0.000 0.000 0.000 0.000

mTcCIR11 N 14 3 10 18 12 4 14 3196 0.036 0.000 0.000 0.000 0.000 0.000 0.000 0.000197 0.036 0.000 0.100 0.000 0.000 0.000 0.429 0.000198 0.000 0.000 0.000 0.000 0.000 0.000 0.071 0.000200 0.000 0.000 0.000 0.000 0.000 0.250 0.000 0.000202 0.000 0.000 0.050 0.000 0.000 0.000 0.000 0.000203 0.000 0.333 0.000 0.000 0.000 0.000 0.036 0.000205 0.036 0.000 0.000 0.000 0.000 0.000 0.000 0.000209 0.107 0.333 0.050 0.028 0.000 0.000 0.000 0.000

Page 46: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

46

Appendix 1: Continue...

Locus Allele/n RAAS RAAN RSJ P.S Chi I.A B.A MatmTcCIR11 N 14 3 10 18 12 4 14 3

210 0.250 0.167 0.250 0.361 0.083 0.250 0.071 0.167211 0.000 0.000 0.050 0.000 0.000 0.000 0.000 0.000212 0.071 0.000 0.000 0.000 0.000 0.000 0.036 0.000216 0.000 0.000 0.000 0.028 0.042 0.125 0.071 0.000217 0.000 0.000 0.100 0.083 0.000 0.000 0.000 0.000219 0.000 0.167 0.000 0.000 0.000 0.000 0.000 0.000220 0.000 0.000 0.000 0.000 0.083 0.000 0.000 0.000221 0.000 0.000 0.000 0.028 0.000 0.000 0.000 0.000223 0.179 0.000 0.000 0.028 0.000 0.000 0.000 0.167224 0.000 0.000 0.000 0.056 0.000 0.000 0.000 0.000225 0.214 0.000 0.400 0.389 0.792 0.375 0.286 0.667295 0.071 0.000 0.000 0.000 0.000 0.000 0.000 0.000

mTcCIR 12 N 14 3 10 18 12 4 14 3163 0.000 0.000 0.000 0.000 0.000 0.250 0.000 0.000189 0.179 0.000 0.200 0.306 0.375 0.000 0.036 0.333197 0.000 0.000 0.000 0.000 0.000 0.250 0.000 0.000202 0.036 0.000 0.050 0.111 0.208 0.000 0.143 0.000204 0.000 0.000 0.000 0.028 0.000 0.125 0.036 0.000205 0.000 0.167 0.050 0.000 0.000 0.000 0.000 0.000206 0.000 0.000 0.050 0.028 0.000 0.000 0.107 0.000207 0.000 0.333 0.000 0.000 0.000 0.000 0.000 0.000210 0.000 0.333 0.000 0.000 0.000 0.000 0.000 0.000213 0.250 0.000 0.000 0.056 0.125 0.125 0.000 0.000214 0.000 0.000 0.000 0.000 0.000 0.125 0.000 0.000215 0.536 0.167 0.650 0.472 0.292 0.000 0.643 0.333223 0.000 0.000 0.000 0.000 0.000 0.125 0.000 0.000224 0.000 0.000 0.000 0.000 0.000 0.000 0.036 0.000238 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.333

mTcCIR 15 N 14 3 10 17 12 4 14 3214 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.167234 0.214 0.000 0.000 0.000 0.042 0.000 0.000 0.000235 0.071 0.167 0.000 0.059 0.083 0.000 0.107 0.000236 0.071 0.000 0.000 0.000 0.000 0.125 0.000 0.333237 0.000 0.000 0.300 0.206 0.125 0.500 0.107 0.167239 0.036 0.000 0.000 0.029 0.000 0.000 0.036 0.000242 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.167245 0.036 0.000 0.000 0.029 0.042 0.000 0.071 0.000

Page 47: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

47

Appendix 1: continue…

Locus Allele/n RAAS RAAN RSJ P.S Chi I.A B.A MatmTcCIR 15 N 14 3 10 17 12 4 14 3

246 0.000 0.000 0.100 0.000 0.000 0.000 0.000 0.000248 0.036 0.000 0.000 0.000 0.000 0.000 0.000 0.000250 0.000 0.000 0.000 0.059 0.125 0.250 0.000 0.000251 0.107 0.000 0.000 0.059 0.042 0.000 0.107 0.000252 0.143 0.000 0.100 0.000 0.083 0.000 0.036 0.000253 0.000 0.000 0.000 0.118 0.208 0.000 0.107 0.000254 0.143 0.333 0.000 0.176 0.083 0.125 0.000 0.000255 0.071 0.333 0.200 0.265 0.125 0.000 0.321 0.000256 0.036 0.000 0.150 0.000 0.042 0.000 0.036 0.167257 0.036 0.000 0.000 0.000 0.000 0.000 0.036 0.000258 0.000 0.000 0.100 0.000 0.000 0.000 0.000 0.000259 0.000 0.167 0.050 0.000 0.000 0.000 0.036 0.000

mTcCIR18 N 13 3 10 18 12 3 14 3192 0.038 0.000 0.000 0.000 0.000 0.000 0.000 0.000199 0.000 0.000 0.100 0.111 0.000 0.333 0.000 0.167200 0.231 0.000 0.150 0.167 0.000 0.333 0.107 0.167201 0.000 0.000 0.000 0.083 0.042 0.000 0.000 0.000202 0.038 0.167 0.100 0.028 0.000 0.167 0.179 0.000203 0.038 0.000 0.000 0.000 0.000 0.000 0.036 0.000204 0.038 0.000 0.050 0.000 0.042 0.167 0.286 0.000205 0.077 0.000 0.000 0.000 0.000 0.000 0.000 0.000207 0.000 0.000 0.000 0.056 0.000 0.000 0.000 0.000209 0.000 0.333 0.000 0.000 0.000 0.000 0.000 0.000210 0.077 0.000 0.000 0.000 0.000 0.000 0.000 0.000212 0.154 0.000 0.050 0.111 0.292 0.000 0.179 0.000213 0.077 0.000 0.000 0.167 0.000 0.000 0.000 0.167214 0.077 0.500 0.550 0.278 0.625 0.000 0.214 0.500227 0.038 0.000 0.000 0.000 0.000 0.000 0.000 0.000236 0.038 0.000 0.000 0.000 0.000 0.000 0.000 0.000243 0.038 0.000 0.000 0.000 0.000 0.000 0.000 0.000255 0.038 0.000 0.000 0.000 0.000 0.000 0.000 0.000

mTcCIR 21 N 14 3 10 17 12 4 14 3142 0.000 0.000 0.100 0.000 0.000 0.000 0.071 0.000143 0.000 0.000 0.000 0.000 0.042 0.000 0.000 0.000144 0.107 0.167 0.000 0.029 0.083 0.125 0.036 0.000145 0.071 0.000 0.000 0.000 0.042 0.000 0.250 0.000147 0.000 0.000 0.000 0.000 0.000 0.250 0.000 0.000150 0.000 0.000 0.000 0.029 0.000 0.000 0.000 0.000

Page 48: Genetic characterization of Theobroma cacao L. in Nicaragua Erwin Manuel Aragon Obando

48

Appendix 1: Continue...

Locus Allele/n RAAS RAAN RSJ P.S Chi I.A B.A MatmTcCIR21

N 14 3 10 17 12 4 14 3

151 0.000 0.000 0.000 0.029 0.083 0.000 0.000 0.000153 0.071 0.000 0.100 0.000 0.000 0.250 0.000 0.000154 0.143 0.000 0.100 0.118 0.000 0.000 0.000 0.000155 0.286 0.500 0.350 0.500 0.542 0.250 0.393 0.667157 0.000 0.000 0.150 0.000 0.000 0.000 0.000 0.000158 0.000 0.000 0.000 0.000 0.000 0.000 0.071 0.000159 0.179 0.167 0.000 0.000 0.000 0.000 0.071 0.000160 0.000 0.000 0.000 0.029 0.000 0.000 0.000 0.000164 0.143 0.167 0.150 0.206 0.125 0.000 0.000 0.333165 0.000 0.000 0.000 0.000 0.083 0.000 0.000 0.000169 0.000 0.000 0.000 0.029 0.000 0.000 0.107 0.000173 0.000 0.000 0.050 0.000 0.000 0.000 0.000 0.000180 0.000 0.000 0.000 0.029 0.000 0.000 0.000 0.000181 0.000 0.000 0.000 0.000 0.000 0.125 0.000 0.000