Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 ·...

44
Accepted Article Advances in genomics for the improvement of quality in Coffee Hue T.M. Tran ab , L. Slade Lee c , Agnelo Furtado a , Heather Smyth a , Robert Henry a* a Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of Queensland, Australia b Western Highlands Agriculture & Forestry Science Institute (WASI), Vietnam c Southern Cross University, Australia * Correspondence to: Robert Henry, Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of Queensland, St Lucia QLD 4072, Australia, Tel. +61 7 3346 0551, Email: [email protected] This article has been accepted for publication and undergone full peer review but has not been through the copyediting, typesetting, pagination and proofreading process, which may lead to differences between this version and the Version of Record. Please cite this article as doi: 10.1002/ jsfa.7692 This article is protected by copyright. All rights reserved.

Transcript of Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 ·...

Page 1: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eAdvances in genomics for the improvement of quality in Coffee

Hue T.M. Tranab, L. Slade Leec, Agnelo Furtadoa, Heather Smytha, Robert Henrya*

a Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of

Queensland, Australia

b Western Highlands Agriculture & Forestry Science Institute (WASI), Vietnam

c Southern Cross University, Australia

* Correspondence to: Robert Henry, Queensland Alliance for Agriculture and Food

Innovation (QAAFI), The University of Queensland, St Lucia QLD 4072, Australia, Tel.

+61 7 3346 0551, Email: [email protected]

This article has been accepted for publication and undergone full peer review but has not been through the copyediting, typesetting, pagination and proofreading process, which may lead to differences between this version and the Version of Record. Please cite this article as doi: 10.1002/ jsfa.7692

This article is protected by copyright. All rights reserved.

Page 2: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eAbstract

Coffee is an important crop that provides a livelihood to millions of people living in

developing countries. Production of genotypes with improved coffee quality attributes

is a primary target of coffee genetic improvement programs. Advances in genomics are

providing new tools for analysis of coffee quality at the molecular level. The recent

report of a genomic sequence for robusta coffee, Coffea canephora, is a major

development. However, a reference genome sequence for the genetically more

complex arabica coffee (C. arabica) will also be required to fully define the molecular

determinants controlling quality in coffee produced from this high quality coffee

species. Genes responsible for control of the levels of the major biochemical

components in the coffee bean that are known to be important in determining coffee

quality can now be identified by association analysis. However, the narrow genetic

base of arabica coffee suggests that genomics analysis of the wild relatives of coffee

(Coffea spp.) may be required to find the phenotypic diversity required for effective

association genetic analysis. The genomic resources available for the study of coffee

quality are described and the potential for the application of next generation

sequencing and association genetic analysis to advance coffee quality research are

explored.

Key words: Coffee quality, genetics, genomics, biochemical compounds, next

generation sequencing, association studies.

This article is protected by copyright. All rights reserved.

Page 3: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eGENERAL INTRODUCTION

Coffee is an important crop and the second most traded commodity in the world (after

petroleum) providing a living to more than 125 million people 1. Coffee belongs to the

Rubiaceae family and the Coffeeae tribe and consists of more than 124 species spread

across two genera Coffea L. and Psilanthus Hook.f, each of which in turn consist of two

sub-genera, Coffea and Baracoffea (J.-F. Leroy) J.-F. Leroy, and Psilanthus and

Afrocoffea (Moens) summarized by Anthony et al. 2 respectively. The grouping

of Coffea and Psilanthus genera has been examined in several studies 3 4 5 6 7.

Commercial coffee production is dominated by only two species belonging to the

Coffea genus: C. arabica and C. canephora (the latter generally referred to as robusta

coffee). All coffee species are diploid (2n=2x=22) and generally self-incompatible,

except for C. arabica which is a self-fertile tetraploid (2n=4x=44) derived from a

spontaneous hybridization between C. canephora (as paternal progenitor) and C.

eugenioides (as maternal progenitor) 8 9.

Although C. arabica is considered to have better cupping quality than C. canephora,

improving the quality of both commercial species remains a target for most coffee

improvement programs. With advances in genomic and sequencing technology, it is

feasible to understand the coffee genome and the molecular inheritance underlying

coffee quality, thereby helping improve the efficiency of breeding programs.

This review will discuss current knowledge regarding the genetics of those biochemical

compounds which are considered quality determinants, for use as a foundation for

improving coffee quality by breeding. Available genomic resources for the study of the

genetics of biochemical compounds that are likely to be playing a role in coffee flavour

will also be discussed. In addition, the potential value of the study of genetics of coffee

This article is protected by copyright. All rights reserved.

Page 4: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

equality using next generation sequencing and association genetic analysis will be

considered.

BIOCHEMICAL CONTROL OF COFFEE QUALITY AND ITS VARIATION

Coffee quality is assessed by evaluating both the physical attributes of the coffee beans

and the organoleptic properties of the coffee. Physical quality reflects moisture content,

defects (e.g. sticks, stones, damaged beans and black beans ), bean size and bean colour;

while organoleptic quality reflects aroma, taste, flavour, body (a feeling of the heaviness or

richness on the tongue), acidity and the preference of tasters 10. A range of biochemical

compounds in coffee beans are important contributors to the quality of the coffee in the

cup.

Biochemical compounds influencing the organoleptic quality of coffee beans include

non-volatile and volatile components. Important non-volatile components of the bean

include carbohydrates and fibre (sucrose, reducing sugars, cell wall polysaccharides,

lignin), nitrogenous compounds (protein, free amino acids, caffeine, trigonelline), lipids

(coffee oil, diterpene esters), minerals (potassium and phosphorus), acids and esters

(total chlorogenic acids, aliphatic acids and quinic acid) 11. All of these biochemical

compounds are considered to play a role in roasting chemistry 12. Proteins and amino

acids, for example, react with reducing sugars to produce aroma precursors.

Chlorogenic acids (CGAs) and caffeine, are responsible for bitterness 13. Among these

components, sucrose, caffeine, trigonelline, lipids and CGAs are major biochemical

compounds that contribute to the flavour of the beverage after the roasting of the

beans 11. While sucrose, trigonelline and lipid have a positive correlation with coffee

cup quality, caffeine and some sub classes of CGAs have a negative correlation with

coffee cup quality (often referred to simply as ‘cup’). There are about three hundred

volatiles detected so far in green coffee bean 14 of which twenty one volatiles,

This article is protected by copyright. All rights reserved.

Page 5: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eincluding 3-isobutyl-2-methoxypyrazine and 2-methoxy-3,5-dimethylpyrazine, have

been identified as key compounds for coffee cup 14. In the roasted coffee bean, there

are more than a thousand volatiles which can be grouped into twenty key aroma

compound-groups 15. The aroma of the brew is different from that of roasted coffee.

Several notes in roasted coffee become more intensive in the brew such as caramel and

phenolic odour. This change in aroma profile of coffee is not caused by the formation of

new odorants, but by a shift in the concentrations of existing ones as summarised by

Grosch 16.

Coffee flavour and its formation is extremely complex; however, numerous studies on

coffee flavour and its constituents have been reported. How the volatile and non-

volatile compounds contribute to coffee quality and relationships between sensory

properties and the composition of coffee has been thoroughly reviewed by Flament 12

and more recently by Sunarharum et al.17. The reactions involved in the formation of

coffee aroma and its mechanisms have been reviewed by Buffo & Cardelli-Freire 18.

However, the change in bean composition, especially in aroma profile, from green to

roasted bean is complicated and not all formation pathways are fully understood

under coffee roasting conditions 19. The use of green bean or roasted bean in the study

of coffee quality genetics therefore should be considered carefully since aroma in

roasted bean and brew is of customer's interest, while the green bean attributes are

more directly affected by genetics. This is a challenge in the study of genetics of bean

composition, that is, ascribing links to coffee quality. However, with the progress in

functional genomic approaches, the identification of molecular determinants of coffee

quality characteristics is feasible and it is possible to select coffee varieties with

superior beverage quality 20.

This article is protected by copyright. All rights reserved.

Page 6: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eThe bean chemical composition and organoleptic characteristics are significantly

different between species and to a lesser extend within species 10. A number of studies

on the variation in the biochemical composition of the bean are summarised in Table 1

21-24.

Fatty acid and CGAs were used as indicators to differentiate arabica varieties 25 26.

Similarly, Tessema et al. 27 also found that the quality traits (organoleptic quality) and

biochemical constituents (sucrose, fat, crude protein and minerals) were diverse in the

arabica germplasm collections from Ethiopia. This is an encouraging result for the

study of variation of quality traits in C. arabica for association mapping. In general,

compounds that have a positive correlation with quality such as sucrose, trigonelline

and lipids are at a higher concentration in C. arabica than in C. canephora and some

other Coffea species.

Significant differences in flavour between different coffee types have also been reported.

Robusta coffee beans have a bitter, full bodied taste, but low acidity while those of arabica

coffee are more aromatic with more perceptible acidity but less body. Within the robusta

coffees, Moschetto et al 28 found important differences in cup quality between the two

main genetic groups studied (Congolese and Guinean), for preference, aroma, acidity,

body and bitterness. Within the arabica coffees, different varieties can be associated with

specific flavour profiles. For example, genotype SL28 produces a milder brew than Kent,

genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while

genotype Liberica yields coffee that is bitter and without finesse 29.

The composition of the beans that determines the properties and quality of coffee in

the cup depends not only on genetic factors (species, varieties) but also on non-genetic

factors such as the cultivation conditions (soil, air temperature, altitude, sun exposure

This article is protected by copyright. All rights reserved.

Page 7: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eand rainfall), plant condition (plant age, bean maturity), cultivation practices (shade

management, fertilisation and irrigation) and harvest and processing methods. In

general, volcanic soils, low temperature, high altitude and growing under shade have

positive correlation to coffee compositions which relates to coffee quality 30-34. Post-

harvest techniques, roasting and the preparation of the beverage also have a strong

influence on coffee quality 30 35. Understanding the influence of each factor and their

interaction would help with defining realistic breeding strategies. Since there have

been many reports presented in detail on these issues 26 33 29 36 37 38 20, especially the

most recent reviews of Joet et al. 39 on the relationships between coffee quality, bean

chemical composition and environmental effects, or Santos et al. 40 on climate change

impacts on bean quality, or Sunarharum et al. 17 on the impacts of processing, roasting

and preparation methods to aroma profile, they are not reviewed in this paper.

Another factor that may influence coffee quality is epigenetic effects. Epigenetics is

modified gene expression not caused by alteration in the gene sequence or DNA code.

This change is due to factors that control how and when each gene is expressed.

Epigenetics is extremely complex and is not yet fully understood. Although there have

been several studies on factors affecting coffee quality, there has not been research on

epigenetics and coffee quality. Only a very few studies have reported on epigenetics in

relation to size and shape of the tree 41, and epigenetic mechanisms (especially in

phenolic compounds) in somatic embryo regulation in arabica coffee species 42 43.

Being an allotetraploid species formed by a recent natural hybridization between two

diploid ancestors, C. canephora and C. eugenioides, and which may have gone through

a single allopolyploidization event 44, arabica may have genetic redundancy allowing

for sequence divergence resulting in development of functional variations between

duplicated genes (homoelogues). Proteins with different properties may be encoded

This article is protected by copyright. All rights reserved.

Page 8: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eby homoelogues leading to new genetic diversity in breeding populations. However,

homoelogues are also subject to extensive epigenetic control and are therefore not

always equally expressed 45. Landey et al. 43confirmed the very limited genetic and

epigenetic alternations in somatic embryogenesis-derived plants during coffee somatic

embryogenesis while embryogenic suspensions have a high risk of genome and

epigenome instability. However, Vidal et al. 46 found homoeologous genes displaying

differential expression for arabica using Single Nucleotide Polymorphisms (SNPs) from

Expressed Sequence Tags (ESTs) and gene expression from transcript redundancy

between C. arabica and C. canephora. The divergence of function in homoeologous

genes may result in phenotypic novelty with useful agronomic traits. Homoeologous

gene expression in C. arabica grown in two contrasted conditions was also used to

compare with those of its progenitors and to examine its ability to adapt to

environmental variations 47. Recently, Lashermes et al. 44 confirmed that the genomic

rearrangements involving homoeologous exchanges as well as gene silencing,

elimination and conversion appear to have occurred in C. arabica and to be a major

source of genetic diversity. This on one hand will make a potent new source of genetic

diversity for different traits, but on the other hand make it difficult to conduct

association genetics in coffee.

GENETICS OF BIOCHEMICAL COMPOUNDS KNOWN TO BE RELATED TO COFFEE

QUALITY

Improving both yield and quality is the target for coffee breeding programs. In a study

by Montagnon et al 48, genetic correlations between coffee yield and quality

determinants and cup tasting components measured in two groups of C. canephora

were not significant, indicating that the coffee quality is not negatively associated

with yield in C. canephora. Understanding the genetic inheritance of quality traits can

This article is protected by copyright. All rights reserved.

Page 9: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

ebenefit breeding programs, but there appears little research that has focussed on

understanding the genetic control of key biochemical compounds that relate to coffee

beverage quality. Known inheritance mode and heritability values of the key

biochemical compounds determining coffee quality in intraspecific and interspecific

hybrids are detailed in Table 2 48-56.

Male/female additive inheritance in sucrose and trigonelline was contrasted in two

studies using intraspecific hybrids 51, 56. The heritability measured in interspecific

hybrids was different from that of intraspecific hybrids due to the differences among

species in relation to their fixation during evolution of specific alleles of some genes

relating to the variability of quality components. As a result, the genetic variation in

quality at the interspecific level is not similar to that in intraspecific hybrids 10. The high

narrow-sense heritability of caffeine and fat content in intraspecific hybrids implies that

these traits can be improved at the intraspecific level. The negative correlation between

fat content and sucrose content indicates that care should be taken in selecting for a

combination of fat and sucrose content 48. Both major and minor genes with additive

gene effects appear to be involved in the variation of caffeine content in the seeds 57

implying the complexity of this trait.

The inheritance of coffee cup quality was also reported by Leroy et al 51 where for

bitterness of robusta coffee, female additive effects were predominant, while for

acidity, only a male additive effect was observed 51. This information will assist the

breeding (e.g. parent selection) for robusta coffee quality improvement. The

inheritance of quality characters in introgressed lines, which resulted during arabica

improvement to introduce the disease resistant genes from Timor hybrid to arabica,

was mentioned by Bertrand et al. 58 31 and also reviewed by Leroy et al. 10.

This article is protected by copyright. All rights reserved.

Page 10: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eIn summary, highly heritable traits like fat or caffeine content could be improved

effectively by crossing between parents of different favourable values. For traits with

lower heritability like trigonelline, CGAs or sucrose, Marker Assisted Selection (MAS)

would be necessary 4. With of the advances in sequencing technology and marker

development coupled with trait phenotyping, it should be possible to easily identify

trait-linked markers for use in MAS.

For coffee genetic improvement, most of the past and recent research has focused on

the improvement of canephora quality due its known lower quality compared to

arabica. The focus of arabica improvement is more on transfer of disease resistance

from Timor hybrid 10 or climate change adaptation 59; quality improvement is the

second priority in arabica as its quality is considered superior. However, this does not

mean that arabica quality improvement is unnecessary. Quality determinants are not

the same between arabica and canephora. For canephora, apart from caffeine

reduction, the bean composition of galactomannans 60or other compounds (other

polysaccharides) which influence the extraction during the industrial process of

lyophilisation (i.e. instant coffee processing) is probably the most important target.

Fresh brewing is popular for arabica, rather than instant coffee processing – although it

may be applied to arabica in the future. For arabica, the increasing consumer demand

for speciality coffee has intensified research aimed at pinpointing small differences in

quality factors as well as to quantify these organoleptic features 61. For more detail on

the trend and strategies of canephora and arabica improvement breeding, there are

excellent reviews by Leroy et al. 10 and Vossen et al. 20.

This article is protected by copyright. All rights reserved.

Page 11: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eCOFFEE GENOMIC RESOURCES

Coffee genetic diversity in C. arabica and C. canephora

A number of works on the assessment of arabica diversity have been done with

differing results. In general, among three main types of material (cultivar/varieties;

accessions/introgression/hybrids; and spontaneous/sub-spontaneous) almost all

studies show a very low genetic variation amongst arabica cultivars using different

marker systems (RAPD, AFLP, SSR, ISSR, SRAP, TRAP) and coffee collected from

different regions (Brazil, Yemen, Indian, Australia and Vietnam, Tanzania, Hawaii) 62-69.

The second group (accessions/introgression/hybrids) shows low to significant

variation, more diverse in the introgressed lines 70 27 71-74. The last group, in which one

may expect to see the most diversity, has diversity ranging from low 75 72 76 to

moderate 77 to high 78. The correlation of genetics with geography/origin clearly shows

in some studies 77-79 but not in the others 75 72 70. In the most recent work presented in

the World Coffee Research annual report 59, genetic diversity assessment of 800

arabica coffee accessions from the arabica collection at CATIE, Costa Rica shows the

least genetic diversity of this species compared to other major crops. This study also

found that cultivated coffee varieties contain approximately 45% of the genetic

diversity found in the 800 aforementioned accessions (while only 2% of tomato

diversity found in cultivated tomato) indicating the limitation of variability for breeding

programs. This also is problematic for association mapping work in arabica for

breeding improvement, and thus the selection of appropriate germplasm for

association mapping is very critical. Therefore it is essential to understand the

population structure and its genetic diversity.

In contrast to C. arabica, C. canephora possess a high genetic diversity due to its origin,

reproduction method and dissemination 80, and it is structured in two main groups

This article is protected by copyright. All rights reserved.

Page 12: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e(Guinean and Congolese) with six subgroups 81-83. The linkage disequilibrium assessed

in C. canephora breeding populations was variable, indicating the possibility of

applying association studies within C. canephora species and using it to improve C.

arabica due to its parental relationship to this species 84. Understanding the genetic

diversity of the two dominant species is critical for any genetic improvement scheme

for any traits in coffee.

Coffee whole genome and synteny

The genome size of coffee has been estimated to be 1.3 Gb for C. arabica 85 and 710Mb

for C. canephora 86. Using flow cytometry, the DNA content of arabica was estimated as

2.47 pg (equivalent to 1.2 Gb) and canephora as 1.43 pg (equivalent to 700 Mb), while

other diploid coffee species varied from 0.96 pg (469 Mb) (C. mauritiana) to 1.84 pg (900

Mb) (C. humilis) 87 88. Despite its economic importance, the first high-quality draft genome

of robusta coffee has just been completed and reported in 2014 86. To generate a whole

genome sequence of C. canephora, Roche 454 single and mate-pair reads and Sanger

BAC-end reads were used, representing 29.5X coverage of the genome size (710 Mb).

In addition, Illumina sequence data, corresponding to 60X of coverage of the Coffee

genome (710 Mb), was also used to correct sequencing errors and fill gaps. The

resulting genome assembly consists of 25,216 contigs and 13,345 scaffolds with a total

length of 568.6 Mb (Table 3). Using a high-density genetic map, a total of 349 scaffolds

covering approximately 364 Mb were anchored to the 11 C. canephora chromosomes,

among which 139 were both anchored and oriented. Coffee displays the most

conservative chromosomal gene order among asterid angiosperms. Transposable

elements (TEs) in the C. canephora genome were identified and classified accounting

for almost half (49.2%) of the reference genome length. Organelle to nuclear genome

transfers was also analysed with a typical amount of chloroplast-derived fragments

This article is protected by copyright. All rights reserved.

Page 13: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e(2,014 insertions, accounting for 0.16% of the draft genome) and an unusually large

750kb mitochondrion-derived fragment more than 100 Kb longer than those known

from other plant genomes. In total, 25,574 protein-coding gene models were obtained

in which several gene functions involved in secondary compound biosynthesis and

coffee flavour and aroma were characterized. Species-specific gene family expansions

were examined such as N-methyltransferases (NMTs), defence-related genes, and

alkaloid and flavonoid enzymes involved in secondary compound synthesis. This is the

most valuable genomics resource for further genomics study in coffee, especially as all

of the genomics data, transciptomic data, SNP polymorphisms and genotyping data,

gene families and metabolic pathways are publicity available via the Coffee Genome

Hub 89.

The whole chloroplast genome sequence of Coffea arabica L. was first reported by Samson

et al. 90 (Table 3) and this also provides an important reference for studying other species

and helps build up the phylogeny among members of the Coffeeae tribe which is still

questionable. Several research groups are working on an C. arabica genome sequence

using different biological materials and sequencing platforms 91 92 93 94. This genomic

resource is extremely important for the study of genetics and genomics of the high quality

coffee species, C. arabica.

Several studies 95-97 have compared genomes between coffee and other crops. The

most recent study reported that coffee chromosomal regions showed unique one-to-

one correspondences with grape chromosomes, proving the lack of any events

changing ploidy intervening in their separate histories. Coffee and grape genomes

show one-to-three correspondence with the tomato genome 86.

Among coffee species, comparison of genomes showed a high gene synteny between

C. arabica (EaEaCaCa) and C. canephora (CC) and C. arabica (EaEaCaCa), C. eugenioides

This article is protected by copyright. All rights reserved.

Page 14: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e(EE) and C. liberica (LL) 98-100, but numerous chromosomal rearrangements were

detected 98. The two sub-genomes of C. arabica (Ca and Ea) indicated sufficient

sequence differences, and thus obtaining a whole-genome sequence would be an

important resource for the allotetraploid genome of C. arabica. Results also revealed

that C. eugenioides was more closely related to C. arabica than to C. liberica, which

was in agreement with the ancestral history of the allotetraploid C. arabica 100.

Other genomic resources

Molecular markers

As for other crops, determination of molecular predictors for coffee quality traits

would help reduce the length of breeding selection cycles and thus phenotypic

evaluation cost. However, the use of DNA technology in coffee quality improvement is

still in its infancy and therefore limited. Pot et al. 101 used polymorphisms generated

from SNPs, INDELs (insertions and deletions) and SSRs (Simple Sequence Repeats) to

identify the nucleotide diversity of four sucrose metabolism enzymes in C. canephora

genotypes using direct sequencing. The variation of these genes was also analysed

between different Coffea species to allow the identification of more polymorphic sites

using parallel in silico analysis of EST resources 101. AFLP (Amplified Fragment Length

Polymorphism) and SSR markers were used to construct a genetic map of an F2

population between C. arabica and C. canephora (artificial tetraploid). The number of

markers associated with quality traits identified was nineteen for sugar content, eight

for caffeine, eight for CGAs and one for caffeine and CGAs 102. These markers need to

be validated in other genotypes for consistency before they can be used in marker

assisted selection in coffee breeding. Recently, a total of 33,239 SNPs specific to C.

arabica and 87,271 SNPs specific to C. canephora were developed using targeted

genome capture strategies and next-generation sequencing, and were evaluated on 72

This article is protected by copyright. All rights reserved.

Page 15: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

esamples from C. canephora and 72 from C. arabica. These genomic resources will

support for genome assemblies, accelerate breeding of interested traits as well as

manage genetic diversity in coffee species 103.

Bacterial artificial chromosome (BAC) libraries.

Three BAC libraries were developed for each species of C. canephora and C. arabica in

relation to sucrose metabolism 104, genome composition and evolution 105, nematode

and leaf rust resistant 106, bean size and cup quality 107 and mannose-6-phosphate

reductase gene (CaM6PR) 108 (Table 4). These available BAC libraries of Coffea are an

important resource to support genomic studies such as comparative genomics and the

identification of genes and regulatory elements controlling important traits in coffee.

Expressed sequence tags (ESTs)

The development of large EST sequencing projects has supported the identification of

potential genes contributing to quality traits and their interplay with other genes

through essential biochemical pathways. ESTs facilitate the development of whole

transcriptome analysis and the identification of the biosynthetic pathways associated

with the expression of quality and the genes within these pathways 10. The EST

resources available in coffee have been reviewed by Kochko et al. 85 and are

summarised in Table 5.

A total of 246,500 ESTs have been reported with 69,801 for C. canephora, 166,133 for C.

arabica and 10,566 for C. racemosa using different organs of the plant (leaves, flower,

fruits and roots) at different stages of development and maturation 109-115. However, not

all these resources are published or available which limits their use. A number of studies

have used the available resource of ESTs for further research such as microsatellite marker

discovery 116, analysis of transcriptome divergence and analysis of genes involved in the

biosynthesis pathways of lipids and main storage proteins 117, gene structure prediction

This article is protected by copyright. All rights reserved.

Page 16: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eand functional annotation with the detection of a total of 345 pathways in C. arabica and

300 pathways in C. canephora 118. This genomic resource will be very important for further

research on coffee quality improvement.

QTL identification and genetic mapping

The construction of genetic maps is often the first step for molecular dissection of

complex traits (yield components, quality and disease). Genetic maps enable the

mapping of quantitative trait loci (QTLs) controlling these traits and thus provide a

framework for isolation of candidate genes followed by manipulation of genes for yield

increase and quality improvement 119. In many cases, markers linked to QTLs are

directly used in MAS without the need of understanding gene functionality underlying

the QTLs.

Linkage mapping in coffee in general requires more effort and cost than in annual crops

due to its long life cycle, low polymorphism (in the case of C. arabica – with the exception

of Ethiopian cultivars and germplasm), and the absence of a large collection of DNA

markers as well as genomic resources 119. The first genetic maps were constructed in the

diploid C. canephora by Paillard et al. 120 followed by several others constructed for

different coffee populations 51, 56, 96, 102, 120-129 130 (Table 6).

To date, a number of QTLs analyses relating to quality compounds and cup have been

performed on C. canephora and other species, but none has been reported for C.

arabica as indicated in Table 7 50, 51, 56, 131, 132

Co-location of some QTLs and genes involved in different metabolic pathways related

to coffee quality were noted 51. Using such candidate genes in association mapping

may determine the allelic variation of coffee quality 133. The co-localisation between

QTLs relating to organoleptic traits and genes associating with caffeine or CGA

This article is protected by copyright. All rights reserved.

Page 17: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

ebiosynthesis may explain the fact that both CGA and caffeine are involved in conferring

bitterness in the coffee cup 51.

Gene identification

The identification of genes relating to quality is one of the main objectives of several

coffee research groups around the world. Thanks to the concerted efforts on coffee

genomics, a number of coffee candidate genes have been identified and some of them

have been cloned and characterised. These results are useful to the coffee genetics

community, especially those on genes encoding the enzymes of key metabolic

processes. These genes are candidate genes which may control the variability of coffee

quality 10. Genes regulating the main chemical components that are thought to be

involved in the flavour and sensory quality of coffee are listed in Table 8 13, 51, 86, 104, 117,

131, 134-141. Recently, 36,935 unigenes from leaves and fruits of C. eugenioides, one of C.

arabica’s ancestors, were identified by Yuyama et al. 142. A sub-set of these genes

related to sugar metabolism and fruit ripening were examined for their expression.

This will be a valuable resource for molecular and genetics studies of commercial

coffee species. Currently there are works on transcriptomics to understand the

transcriptional regulations during seed development 39 13 or more specifically for

caffeine accumulation 143 and chlorogenic acids 144 or galactomannan 60. These could

facilitate the development of robust genomics/metabolomics fingerprints of coffee

bean quality and ultimately be useful in breeding programs for coffee quality.

Although several genes encoding the biosynthesis of biochemical compounds in coffee

have been identified in C. arabica and C. canephora, there are no genes identified for

trigonelline synthesis and no studies on allelic variation (e.g., SNPs) associated with

low and high levels of the key biochemical compounds which can be utilised in MAS.

However, sequences from the genes that are known can serve as useful references in

This article is protected by copyright. All rights reserved.

Page 18: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

ere-sequencing to detect polymorphisms for genetic mapping of the control of

biochemical compounds in C. arabica.

ASSOCIATION GENOMICS APPROACH USING NEXT GENERATION SEQUENCING (NGS)

AND PERSPECTIVES FOR THE STUDY OF GENETICS OF COFFEE QUALITY

Research on genetics and genomics for coffee, especially in relation to quality is

relatively limited compared to other crops and thus belies its potential and economic

contribution. The reasons could possibly be due to the lack of funding – most coffee

growing regions being developing countries – the complexity of the quality traits and

the limitations of the technology used. More studies have been conducted for robusta

coffee than arabica coffee due to its lower ploidy level and greater genetic diversity.

Previous studies showed considerable genetic variation in compounds defining coffee

quality among coffee species, especially in diploids. In addition, a number of genes

relating to the synthesis of sucrose, caffeine, several sub-groups of CGAs and lipids

have also been identified. These studies provided gene sequences via ESTs that will

facilitate further studies on bean composition relating to coffee quality. However,

there has been no research focusing on allelic variation of these genes in natural

populations except bi-parental mapping of QTLs for trigonelline and CGA contents.

A number of previous studies involving genome-wide association studies (GWAS) using

next generation sequencing (NGS) have used these genetic approaches to detect

candidate genes/QTLs of complex traits such as physical wood properties, resistant genes,

drought tolerance, cold hardiness and timing of bud set summarised by Sexton et al. 145

and successfully applied in various crops including annuals (rice, maize, soybean, wheat) or

perennial crops and forestry trees (pine, cottonwood) summarised by Hall et al. 146. Based

on knowledge of biochemical compounds (and their reference gene sequences) that are

thought to be involved in the flavour and sensory quality, it should be possible to apply

This article is protected by copyright. All rights reserved.

Page 19: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eNGS and GWAS combined with bi-parental QTL mapping to detect valuable haplotypes

from natural populations for use in breeding coffee varieties with better quality.

Application of NGS in plant genomics studies has been extensively reviewed by a

number of authors 147-150. NGS lowers the cost of sequencing and generates a large

amount of data in a single sequencing run, making it feasible to study genetics at the

whole genome level. The steps involved in the NGS strategies (shearing of DNA, ligation,

library preparation) as well as different sequencing platforms (Illumina, Life Technologies,

PacBio) have been described and their advantages and disadvantages have been reviewed

by several authors 151, 152. NGS can be used via an approach of whole genome sequencing or

targeted sequencing.

Two methods of whole genome sequencing can be used including (1) de novo

sequencing - applied for species that do not have pre-existing sequence data, followed

by the use of bioinformatics tools to assemble the sequences and obtain the genomic

map for that species; and (2) re-sequencing performed on individuals of a species that

has already known genome sequence152.

Targeted sequencing includes two approaches, amplicon sequencing and target

enrichment. Amplicon sequencing will sequence amplified regions (PCR amplicons) of

small and selected regions of the genome with lengths of hundreds of base pairs and

with a high depth of coverage to identify common and rare sequence variations.

Sequencing PCR amplicons of sets of candidate genes from DNA bulks will help to

identify the available variation in these genes exploited in a population 148. Target

enrichment is quite similar to amplicon sequencing in terms of using only selected

regions or genes enriched in the library from genomic DNA. However, target

enrichment allows for larger DNA insert sizes and enables a greater amount of total

DNA to be sequenced per sample. For this method, regions of interest in the genome

This article is protected by copyright. All rights reserved.

Page 20: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

ecan be targeted, making it an ideal approach for examining specific gene pathways, or

as a follow-up analysis to GWAS 153.

When making decisions on the approaches of candidate gene or of whole-genome,

one of the most critical factors is the extent of linkage disequilibrium (LD) in the

organism of interest. In genome-wide studies, the extent of LD determines the

mapping resolution that is achieved and the numbers of markers covered 154. Genetic,

biochemical, physiological and phenotypic information must be used to support the

selection of candidate genes 155. Where a developmental or biochemical pathway is

well-understood, candidate gene selection is straightforward; but the researcher is

typically limited to the field of known genes, which presents the risk of overlooking

causal gene mutations not previously identified 146

NGS technology has been used to assemble the whole genome sequence of C.

canephora 86 and to examine genetic change in allopolyploidisation of arabica 44. To

obtain the genomic sequence of C. arabica, as only the draft genome of C. canephora

is available, a de novo approach can be applied. The other approach for obtaining the

genomic sequence of C. arabica is whole genome re-sequencing as the available draft

genomic sequence of C. canephora can be used as reference since it is one of

progenitors of C. arabica. The whole genome sequence will be a key resource of data

to identify gene sequences and SNP markers for most genes including novel and

published genes. EST and BAC sequences can be used as reference sequences to

assemble sequence reads from NGS data to identify genes/alleles and their positions

on the chromosome. In addition, whole-genome re-sequencing and SNP genotyping

data can be used to generate a dense SNP genetic map of the population for QTL

mapping.

This article is protected by copyright. All rights reserved.

Page 21: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eAmplicon sequencing in a larger set of samples could be an appropriate approach to genetic

association study of quality traits, since the biosynthesis pathways, QTLs and candidate

genes of sucrose, caffeine and several subgroups of CGAs are known. This will help to

confirm or validate the candidate genes from the literature and identify the alleles that are

useful for breeding. For those compounds for which candidate genes have never been

identified or very few identified, for example trigonelline and some compounds

belonging to the lipid group, the approach could be to use whole genome sequencing

via NGS to detect SNPs, then using an association genomics approach to detect the link

between SNPs and the traits of interest.

Association studies differ from quantitative trait locus mapping by utilising samples

from genetically diverse populations that contain short stretches of linkage

disequilibrium due to generations of recombination 145. Such studies thus enable

relationships between phenotypic traits and underlying DNA sequence variation to be

understood. Association studies may be suitable for perennial species such as coffee

because they can be carried out on pre-existing populations, in collections or in

selection trials. This approach does not require specific populations created by

controlled crossing (like conventional genetic mapping approaches) which is very

difficult for perennial crops 155. Association or linkage disequilibrium mapping has

become a very popular method for dissecting the genetic basis of complex traits in

plants 146. For the study on volatile and non-volatile compounds in coffee and that are

likely to be playing a role in coffee flavour, once the data on biochemical compounds

and SNPs variation are available, association genomics will be applied to identify key

genes associating coffee quality. The integration of NGS technologies and association

study to analyse complex traits will be a powerful strategy complementing the

traditional method of parent-hybrid map construction 152.

This article is protected by copyright. All rights reserved.

Page 22: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eCONCLUSIONS

While whole genome technology will provide a powerful resource, a good start has

been made with AFLP and SSR markers associated with quality traits identified for

sugar content, caffeine and CGAs in C. arabica. These are already helpful in studies

aimed at genetic improvement of coffee quality, but importantly, they will provide

useful reference points as the genomic approach is developed. Existing coffee BAC

libraries are beneficial in comparative genomics, the identification of genes and

regulatory elements controlling important traits in coffee, and will play an important role

in validating new C. arabica genomic data as it emerges. With the current knowledge of

biochemical compounds that govern coffee bean composition that relates to beverage

quality and available genetic resources, it is possible to apply an association genomics

approach using both whole genome sequencing and targeted sequencing to detect the link

between SNPs and the traits of quality determinants from natural populations for use in

quality improvement via breeding programs. However, caution should be taken when

using this approach for C. arabica since this species has a narrow genetic base, lacks

a reference genome, is a polyploid (4n) and lacks populations from controlled crosses

for analysis of specific traits and the validation of markers or genes once detected. The

aforementioned difficulties can be overcome by the use of wild arabica accessions

from Ethiopia, development of a reference genome sequence for C. arabica, and the

complementary use of the two approaches of targeted re-sequencing and whole

genome re-sequencing in variant discovery respectively. There is no doubt that the

availability of whole genome sequences will provide the greatest stimulus yet to the

understanding of the genetic basis of coffee quality traits, and indeed of many other

aspects of the plant’s biology.

This article is protected by copyright. All rights reserved.

Page 23: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

ACKNOWLEDGEMENTS

This research was supported under Australian Research Council's Linkage Projects

(project number LP130100376). We acknowledge Australia Awards Scholarship (AAS)

and QAAFI/UQ for their ongoing support of this research.

REFERENCES

1. ICO, ICO Annual Review 2011/12, Ed (2013).

2. Anthony F, Bertrand B, Etienne H and Lashermes P, Coffea and Psilanthus, in Wild Crop Relatives: Genomic and Breeding Resources. Springer, pp 41-61 (2011).

3. Maurin O, Davis AP, Chester M, Mvungi EF, Jaufeerally-Fakim Y and Fay MF, Towards a phylogeny for coffea (rubiaceae): Identifying well-supported lineages based on nuclear and plastid DNA sequences. Annals of Botany 100:1565–1583 (2007).

4. Davis AP, Chester M, Maurin O and Fay MF, Searching for the relatives of Coffea (Rubiaceae, Ixoroideae): The circumscription and phylogeny of Coffeeae based on plastid sequence data and morphology. American Journal of Botany 94:313–329 (2007).

5. Tosh J, Davis AP, Dessein S, Block PD, Huysmans S, Fay MF, Smets E and Robbrecht E, Phylogeny of Tricalysia (Rubiaceae) and its relationships with allied genera based on plastid DNA data: Resurrection of the Genus Empogona. Annals of the Missouri Botanical Garden 96:194-213 (2009).

6. Anthony F, Diniz LEC, Combes M-C and Lashermes P, Adaptive radiation in Coffea subgenus Coffea L. (Rubiaceae) in Africa and Madagascar. Plant Systematics and Evolution 285:51–64 (2010).

7. Davis AP, Tosh J, Ruch N and Fay MF, Growing coffee: Psilanthus (Rubiaceae) subsumed on the basis of molecular and morphological data; implications for the size, morphology, distribution and evolutionary history of Coffea. Botanical Journal of the Linnean Society 167:357-377 (2011).

8. Charrier A and Eskes AB, Botany and Genetics of Coffee, in Coffee: Growing, Processing, Sustainable Production - A Guidebook for Growers, Processors, Traders, and Researchers, Ed by Wintgens JN. WILEY-VCH Verlag GmbH & Co. KCaA (2004).

9. Lashermes P, Combes M-C, Robert J, Trouslot P, D'Hont A, Anthony F and Charrier A, Molecular characterisation and origin of the Coffea arabica L. genome. Molecular Genetics and Genomics 261:259-266 (1999).

This article is protected by copyright. All rights reserved.

Page 24: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e10. Leroy T, Ribeyre F, Bertrand B, Charmetant P, Dufour M, Montagnon C, Marraccini P and Pot D, Genetics of coffee quality. Brazilian Journal of Plant Physiology 18:229-242 (2006).

11. Farah A, Coffee as a speciality and functional beverage, in Functional and speciality beverage technology, ed. by Paquin P. Woodhead Publishing in Food Science, Technology and Nutrition, pp 370-395 (2009).

12. Flament I, Coffee flavour chemistry. Chichester, UK (2002).

13. Joët T, Laffargue A, Salmona J, Doulbeau S, Descroix F, Bertrand B, Kochko Ad and Dussert S, Metabolic pathways in tropical dicotyledonous albuminous seeds: Coffea arabica as a case study. New Phytologist 182:146–162 (2009).

14. Holscher H and Steinhart H, Aroma compounds in green coffee, in Food flavours – Generation, Analysis and Process Influence, ed. by Charalambous G. Elsevier Science, Amsterdam, pp 785-803 (1995).

15. Oestreich-Janzen S, Chemistry of coffee, in Comprehensive Natural Products II: Chemistry & Biology, Ed by L. M and Liu H-W. CAFEA GmbH, Hamburg, Germany (2010).

16. Grosch W, Chemistry III: Volatile compounds, in Coffee Recent development, ed. by Clarke RJ and Vitzthum OG. Blackwell Science (2001).

17. Sunarharum WB, Williams DJ and Smyth HE, Review: Complexity of coffee flavor: A compositional and sensory perspective. Food Research International 62:315–325 (2014).

18. Buffo RA and Cardelli-Freire C, Coffee flavour: an overview. Flavour And Fragrance Journal 19:99–104 (2004).

19. Mestdagh F, Wocheslander S, Rodriguez A, Egli A, Davidek T and Blank I, Conversion of green coffee precursors into flavour during roasting of arabica and robusta coffees, in ASIC: The 25th International conference on coffee and science, Ed, Colombia (2014).

20. Vossen Hvd, Bertrand B and Charrier A, Next generation variety development for sustainable production of arabica coffee (Coffea arabica L.): a review. Euphytica (2015).

21. Campa C, Ballester JF, Doulbeau S, Dussert S, Hamon S and Noirot M, Trigonelline and sucrose diversity in wild Coffea species. Food Chemistry 88:39-43 (2004).

22. Campa C, Doulbeau S, Dussert S, Hamon S and Noirot M, Diversity in bean caffeine content among wild Coffea species: evidence of a discontinuous distribution. Food Chemistry 91:633–637 (2005a).

23. Ky C-L, Louarn J, Dussert S, Guyot B, Hamon H and Noirot M, Caffeine, trigonelline, chlorogenic acids and sucrose diversity in wild Coffea arabica L. and C. canephora P. accessions. Food Chemistry 75:223–230 (2001b).

24. Mazzafera P, Trigonelline in coffee. Phytochemistry 30:2309 -2310 (1991).

This article is protected by copyright. All rights reserved.

Page 25: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e25. Bertrand B, Villarreal D, Laffargue A, Posada H, Lashermes P and Dussert S, Comparison of the effectiveness of fatty acids, chlorogenic acids, and elements for the chemometric discrimination of coffee (Coffea arabica L.) varieties and growing origins. Agricultural and Food chemistry 56:2273–2280 (2008).

26. Villarreal D, Laffargue A, Posada H, Bertrand B, Lashermes P and Dussert S, Genotypic and environmental effects on coffee (Coffea arabica L.) bean fatty acid profile: impact on variety and origin chemometric determination. Journal of Agricultural and Food Chemistry 57:11321-11327 (2009).

27. Tessema A, Alamerew S, Kufa T and Garedew W, Genetic diversity analysis for quality attributes of some promissing Coffea arabica germplasm collections in Southwestern Ethiopia. Journal of Biological Sciences 11:236-244 (2011).

28. Moschetto D, Montagnon C, Guyot B, Perriot JJ, Leroy T and Eskes A, Studies on the effect of genotype on cup quality of Coffea canephora. Tropical Science 36:18‐31 (1996).

29. Wintgens JN, Factors influencing the Quality of Green Coffee, in Coffee: Growing, Processing, Sustainable Production - A Guidebook for Growers, Processors, Traders, and Researchers, Ed by Wintgens JN. WILEY-VCH Verlag GmbH & Co. KCaA (2012).

30. Bertrand B, Vaast P, Alpizar E, Etienne H, Davrieux F and Charmetant P, Comparison of bean biochemical composition and beverage quality of Arabica hybrids involving Sudanese-Ethiopian origins with traditional varieties at various elevations in Central America. Tree Physiology 26:1239-1248 (2006).

31. Bertrand B, Boulanger R, Dussert S, Ribeyre F, Berthiot L, Descroix F and Joet T, Climatic factors directly impact the volatile organic compound fingerprint in green Arabica coffee bean as well as coffee beverage quality. Food Chemistry 135:2575-2583 (2012).

32. Joët T, Laffargue A, Descroix F, Doulbeau S, Bertrand B, kochko Ad and Dussert S, Influence of environmental factors, wet processing and their interactions on the biochemical composition of green Arabica coffee beans. Food Chemistry 118:693-701 (2010).

33. Barbosa JN, Borém FM, Cirillo MÂ, Malta MR, Alvarenga AA and Alves HMR, Coffee quality and its interactions with environmental factors in Minas Gerais, Brazil. Journal of Agricultural Science 4 (2012).

34. Muschler RG, Shade improves coffee quality in a sub-optimal coffee-zone of Costa Rica. Agroforestry systems 51:131-139 (2001).

35. Duarte GS, Pereira AA and Farah A, Chlorogenic acids and other relevant compounds in Brazilian coffees processed by semi-dry and wet post-harvesting methods. Food Chemistry 118:851-855 (2010).

36. Avelino J, Barboza B, Araya JC, Fonseca C, Davrieux F, Guyot B and Cilas C, Effects of slope exposure, altitude and yield on coffee quality in two altitude terroirs of Costa Rica, Orosi and Santa Mar´ıa de Dota. Journal of the Science of Food and Agriculture 85:1869–1876 (2005).

This article is protected by copyright. All rights reserved.

Page 26: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e37. Decazy F, Avelino J, Guyot B, Perriot JJ, Pineda C and Cilas C, Quality of different Honduran coffees in relation to several environments. Journal of Food Science 68:2356-2361 (2003).

38. Borém FM, Ribeiro DR, Taveira JHS, Prado MVB, Ferraz V, Tosta MF, Luz MPS and Cortez RM, Genotype and environment Interaction in chemical composition and sensory quality of natural coffees, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).

39. Joët T, Bertrand B and Dussert S, Environmental effects on coffee seed biochemical composition and quality attributes: a genomic perspective, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014b).

40. Santos CAFd, Leitão AE, Pais IP, Lidon FC and Ramalho JC, Perspectives on the potential impacts of climate changes on coffee plant and bean quality. Emir J Food Agric 27:152-163 (2015).

41. Lecolier A, Verdeil J-L, Escoute J, Chrestin H and Noirot M, Laurina mutation affected Coffea arabica tree size and shape mainly through internode dwarfism. Trees 23:1043–1051 (2009).

42. Nic-Can GI and Pena CDl, Chapter 6: Epigenetic Advances on Somatic Embryogenesis of Agronomical and Important Crops, in Epigenetics in plants of agronomic importance: Fundamentals and applications transcriptional regulation and chromatin remodelling in plants, ed. by Alvarez-Venegas R, Peña CDl, Armando J and Casas-Mollano. Springer (2014).

43. Landey RB, Cenci A, Georget F, Bertrand B, Camayo G, Dechamp E, Herrera JC, Santoni S, Lashermes P, Simpson J and Etienne H, High genetic and epigenetic stability in Coffea arabica plants derived from embryogenic suspensions and secondary embryogenesis as revealed by AFLP, MSAP and the phenotypic variation rate. PLOS ONE 8 (2013).

44. Lashermes P, Combes M-C, Hueber Y, Severac D and Dereeper A, Genome rearrangements derived from homoeologous recombination following allopolyploidy speciation in coffee. The Plant Journal 78:674–685 (2014).

45. Bottley A, Chapter 3: Epigenetic variation amongst polyploidy crop species, in Epigenetics in plants of agronomic importance: Fundamentals and applications transcriptional regulation and chromatin remodelling in plants, ed. by Alvarez-Venegas R, Peña CDl, Armando J and Casas-Mollano. Springer (2014).

46. Vidal RO, Mondego JM, Pot D, Ambrosio AB, Andrade AC, Pereira LF, Colombo CA, Vieira LG, Carazzolle MF and Pereira GA, A high-throughput data mining of single nucleotide polymorphisms in Coffea species expressed sequence tags suggests differential homeologous gene expression in the allotetraploid Coffea arabica. Plant Physiol 154:1053-1066 (2010).

47. Combes M-C, Dereeper A, Severac D, Bertrand B and Lashermes P, Contribution of subgenomes to the transcriptome and their intertwined regulation in the allopolyploid Coffea arabica grown at contrasted temperatures. New phytologist 200:251–260 (2013).

This article is protected by copyright. All rights reserved.

Page 27: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e48. Montagnon C, Guyot B, Cilas C and Leroy T, Genetic parameters of several biochemical compounds from green coffee, Coffea canephora. Plant Breeding 117:576-578 (1998).

49. Ky C-L, Louarn J, Guyot B, Charrier A, Hamon H and Noirot M, Relations between and inheritance of chlorogenic acid contents in an interspecific cross between Coffea pseudozanguebariae and Coffea liberica var ‘dewevrei’. Theoretical and Applied Genetics 98:628-637 (1999).

50. Ky C-L, Guyot B, Louarn J, Hamon S and Noirot M, Trigonelline inheritance in the interspecific Coffea pseudozanguebariae × C. liberica var. dewevrei cross. Theoretical and Applied Genetics 102:630–634 (2001).

51. Leroy T, Bellis F, Legnate H and Kananura E, Improving the quality of African robustas: QTLs for yield- and quality-related traits in Coffea canephora. Tree Genetics & Genomes 7:781-798 (2011).

52. Mazzafera P and Carvalho A, Breeding for low seed caffeine content of coffee (Coffea L.) by interspecific hybridization. Euphytica 59:55–60 (1992).

53. Barre P, Akaffou S, Louarn J, Charrier A, Hamon S and Noirot M, Inheritance of caffeine and heteroside contents in an interspecific cross between a cultivated coffee species Coffea liberica var dewevrei and a wild species caffeine-free C. pseudozanguebariae. Theoretical and Applied Genetics 96:306-311 (1998).

54. Akaffou DS, Hamon P, Doulbeau S, Keli J, Legnate H, Campa C, Hamon S, Kochko A and Zoro BIA, Inheritance and relationship between key agronomic and quality traits in an interspecific cross between Coffea pseudozanguebariae Bridson and C. canephora Pierre. Tree Genetics & Genomes 8:1149-1162 (2012).

55. Ky CL, Doulbeau S, Akkaffou S, Charrier A, Hamon S, Louarn J and Noirot M, Inheritance of coffee bean sucrose content in the interspecific cross Coffea pseudozanguebariae × Coffea liberica dewevrei. Plant Breeding 119:165-168 (2000b).

56. Mérot-L’Anthoëne V, Mangin B, Lefebvre-Pautigny F, Jasson S, Rigoreau M, Husson J, Lambot C and Crouzillat D, Comparison of three QTL detection models on biochemical, sensory, and yield characters in Coffea canephora. Tree Genetics & Genomes (2014).

57. Priolli RHM, Paulo, Siqueira WJ, Möller M, Zucchi MI, Ramos LCS, Gallo PB and Colombo CA, Caffeine inheritance in interspecific hybrids of Coffea arabica x Coffea canephora (Gentianales, Rubiaceae). Genetics and Molecular Biology 31:498-504 (2008).

58. Bertrand B, Guyot B, Anthony F and Lashermes P, Impact of the Coffea canephora gene introgression on beverage quality of C. arabica. Theor Appl Genet 107:387–394 (2003).

59. WCR, Assessment of Genetic Diversity in Coffea arabica. World Coffee Research 2014 Annual Report (2014).

This article is protected by copyright. All rights reserved.

Page 28: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e60. Joët T, Laffargue A, Salmona J, Doulbeau S, Descroix F, Bertrand B, Lashermes P and Dussert S, Regulation of galactomannan biosynthesis in coffee seeds. Journal of Experimental Botany 65:323–337 (2014a).

61. Dias EC, Paiva LC, Leal RS, Pimenta C and Kieckbusch TG, Effect of processing and roasting conditions on the quality and chemical composition of coffee produced from different regions in Brazil, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).

62. Motta LB, Soares TCB, Ferrão MAG, Caixeta ET, Lorenzoni RM and Neto JDdS, Molecular characterization of arabica and conilon coffee plants genotypes by SSR and ISSR markers. Brazilian archives of biology and technology 57:728-735 (2014).

63. Al-Murish TM, Elshafei AA, Al-Doss AA and Barakat MN, Genetic diversity of coffee (Coffea arabica L.) in Yemen via SRAP, TRAP and SSR markers. Journal of Food, Agriculture & Environment 11:411-416 (2013).

64. Mishra MK, Sandhyarani N, Suresh N, Kumar SS, Soumya PR, Yashodha MH, Bhat A and Jayarama, Genetic diversity among Indian coffee cultivars determined via molecular markers. Journal of Crop Improvement 26:727–750 (2012b).

65. Vieira ESN, Von Pinho EVD, Carvalho MGG, Esselink DG and Vosman B, Development of microsatellite markers for identifying Brazilian Coffea arabica varieties. Genetics and Molecular Biology 33:507-U120 (2010).

66. Tran HT, Genetic variation in cultivated coffee (Coffea arabica L.) accessions in northern New South Wales, Australia, Ed. Southern Cross University, NSW, Australia (2005).

67. Maluf MP, Silvestrini M, Ruggiero LMdC, Filho OG and Colombo CA, Genetic diversity of cultivated coffea arabica inbred lines assessed by RAPD, AFLP and SSR marker systems. Sci Agric (Piracicaba, Braz) 62:366-373 (2005).

68. Masumbuko LI, Bryngelsson T, Mneney EE and Salomon B, Genetic diversity in Tanzanian Arabica coffee using Random Amplified Polymorphic DNA (RAPD) markers. Hereditas 139:56–63 (2003).

69. Steiger DL, Nagai C, Morden PHMCW, Osgood RV and Ming R, AFLP analysis of genetic diversity within and among Coffea arabica cultivars. Theor Appl Genet 105:209–215 (2002).

70. Geleta M, Herrera I, Monzon A and Bryngelsson TV, Article ID 939820, Genetic diversity of arabica coffee (Coffea arabica L.) in Nicaragua as estimated by Simple Sequence Repeat Markers. The Scientific World Journal 2012 (2012).

71. Teressa A, Crouzillat D, Petiard V and Brouhan P, Genetic diversity of Arabica coffee (Coffea arabica L.) collections. EJAST 1:63-79 (2010).

72. Dessalegn Y, Herselman L and Labuschagne MT, AFLP analysis among Ethiopian arabica coffee genotypes. African Journal of Biotechnology 7:3193-3199 (2008b).

This article is protected by copyright. All rights reserved.

Page 29: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e73. Dessalegn Y, Herselman L and Labuschagne M, Comparison of SSR and AFLP analysis for genetic diversity assessment of Ethiopian arabica coffee genotypes. S Afr J Plant Soil 26:119-125 (2009).

74. Lashermes P, Andrzejewski S, Bertrand B, Combes MC, Dussert S, Graziosi G, Trouslot P and Anthony F, Molecular analysis of introgressive breeding in coffee (Coffea arabica L.). Theor Appl Genet 100:139–146 (2000).

75. Anthony F, Bertrand B, Quiros O, Wilches A, Lashermes P, Berthaud J and Charrier A, Genetic diversity of wild coffee (Coffea arabica L.) using molecular markers. Euphytica 118:53–65 (2001).

76. Aerts R, Berecha G, Gijbels P, Hundera K, Glabeke SV, Vandepitte K, Muys B, Roldan-Ruiz I and Honnay O, Genetic variation and risks of introgression in the wild Coffea arabica gene pool in South-Western Ethiopian montane rainforests. Evolutionary Applications:243-252 (2012).

77. Aga E, Bryngelsson T, Bekele E and Salomon B, Genetic diversity of forest arabica coffee (Coffea arabica L.) in Ethiopia as revealed by Random Amplified Polymorphic DNA (RAPD) analysis. Hereditas 138:36–46 (2003).

78. Silvestrini M, Junqueira MG, Favarin AC, Guerreiro-Filho O, Maluf MP, Silvarolla MB and Colombo CA, Genetic diversity and structure of Ethiopian, Yemen and Brazilian Coffea arabica L. accessions using microsatellites markers. Genet Resour Crop Evol 54:1367–1379 (2007).

79. López-Gartner G, Cortina H, McCouch SR and Moncada MDP, Analysis of genetic structure in a sample of coffee (Coffea arabica L.) using fluorescent SSR markers. Tree Genetics & Genomes 5:435-446 (2009).

80. Ferrao LFV, Caixeta ET, Souza FdF, Zambolim EM, Cruz CD, Zambolim L and Sakiyama NS, Comparative study of different molecular markers for classifying and establishing genetic relationships in Coffea canephora. Plant Syst Evol 299:225–238 (2013).

81. Leroy T, Bellis FD, Legnate H, Musoli P, Kalonji A, Solorzano RGL and Cubry P, Developing core collections to optimize the management and the exploitation of diversity of the coffee Coffea canephora. Genetica 142:185–199 (2014).

82. Cubry P, Bellis FD, Pot D, Musoli P and Leroy T, Global analysis of Coffea canephora Pierre ex Froehner (Rubiaceae) from the Guineo-Congolese region reveals impacts from climatic refuges and migration effects. Genet Resour Crop Evol 60:483–501 (2013a).

83. Musoli P, Cubry P, Aluka P, Billot C, Dufour M, Bellis FD, Bleysse P, D. , Charrier A and Leroy T, Genetic differentiation of wild and cultivated populations: diversity of Coffea canephora Pierre in Uganda. Genome 52:634-646 (2009).

84. Cubry P, de Bellis F, Avia K, Bouchet S, Pot D, Dufour M, Legnate H and Leroy T, An initial assessment of linkage disequilibrium (LD) in coffee trees: LD patterns in groups of Coffea canephora Pierre using microsatellite analysis. BMC genomics 14:10 (2013b).

This article is protected by copyright. All rights reserved.

Page 30: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e85. Kochko AD, Akaffou S, Andrade AC, Campa C, Crouzillat D, Guyot R, Hamon P, Ming R, Mueller LA, Poncet V, Tranchant-Dubreuil C and Hamon S, Advances in Coffea Genomics Advances in Botanical Research 53:23-63 (2010).

86. Denoeud F, Carretero-Paulet L, Dereeper A and Droc G, The coffee genome provides insight into the convergent evolution of caffeine biosynthesis. Science 345:1181-1184 (2014).

87. Noirot M, Poncet V, Barre P, Hamon P, Hamon S and Kochko AD, Genome size variations in diploid African Coffea species. Annals of Botany 92:709-714 (2003).

88. Razafinarivo NJ, Rakotomalala J-J, Brown SC, Bourge M, Hamon S, Kochko A, Poncet V, Dubreuil-Tranchant C, Couturon E, Guyot R and Hamon P, Geographical gradients in the genome size variation of wild coffee trees (Coffea) native to Africa and Indian Ocean islands. Tree Genetics & Genomes 8:1345-1358 (2012).

89. Dereeper A, Bocs S, Rouard M, Guignon V, Ravel S, Tranchant-Dubreuil C, Poncet V, Garsmeur O, Lashermes P and Droc G, The coffee genome hub: a resource for coffee genomes. Nucleic Acids Research (2014).

90. Samson N, Bausher MG, Lee S-B, Jansen RK and Daniell H, The complete nucleotide sequence of the coffee (Coffea arabica L.) chloroplast genome: organization and implications for biotechnology and phylogenetic relationships amongst angiosperms. Plant Biotechnology Journal 5:339–353 (2007).

91. Mueller L, Strickler S, Somingues D, Pereira L, Andrade A, Marraccini P, Ming R, Wai J, Albert V, Giuliano G, Descombes P, Moine D, Guyot R, Poncet V, Hamon P, Hamon S, Tranchant C, De kochko A, Lepelley M, Rigoreau M and Crouzillat D, Towards a better understanding of the Coffea arabica genome structure, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).

92. Morgante M, Scalabrin S, Scaglione D, Cattonaro F, Magni F, Jurman I, Cerutti M, Liverani FS, Navarini L, Terra LD, Pellegrino G, Graziosi G, Vitulo N and Valle G, Progress report on the sequencing and assembly of the allotetraploid Coffea arabica var. Bourbon genome, in Plant and Animal Genome XXIII, Ed, San Diego, CA (2015).

93. Kochko Ad, Crouzillat D, Rigoreau M, Lepelley M, Bellanger L, l'Anthoene VM, Vandecasteele C, Guyot R, Poncet V, Tranchant-Dubreuil C, Hamon P, Hamon S, Couturon E, Descombes P, Moine D, Mueller L, Strickler SR, Andrade A, Luiz-Filipe, Marraccini P, Giuliano G, Fiore A, Pietrella M, Aprea G, Ming R, Wai J, Domingues DS, Paschoal A, Kuhn G, Korlach J, Chin J, Sankoff D, Zheng C and Albert VA, Dihaploid Coffea arabica genome sequencing and assembly, in Plant and Animal Genomes XXIII, Ed, San Diego, CA (2015).

94. Yepes M, Gaitan A, Cristancho MA, Rivera LF, Correa JC, Maldonado CE, Gongora CE, Villegas AM, Posada H, Zimin A, Yorke JA, Mockaitis K and Aldwinckle H, Building high quality reference genome assemblies using PacBio long reads for the allotetraploid Coffea arabica and its diploid ancestral maternal species Coffea eugenioides, in Plant and Animal Genome XXIV, Ed, San Diego, CA (2016).

95. Mahe L, Combes MC and Lashermes P, Comparison between a coffee single copy chromosomal region and Arabidopsis duplicated counterparts evidenced high level

This article is protected by copyright. All rights reserved.

Page 31: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

esynteny between the coffee genome and the ancestral Arabidopsis genome. Plant Molecular Biology 64:699-711 (2007).

96. Lefebvre-Pautigny F, Wu F, Philippot M, Rigoreau M, Priyono, Zouine M, Frasse P, Bouzayen M, Broun P, Pétiard V, Tanksley SD and Crouzillat D, High resolution synteny maps allowing direct comparisons between the coffee and tomato genomes. Tree Genetics & Genomes 6:565-577 (2010).

97. Guyot R, Lefebvre-Pautigny F, Tranchant-Dubreuil C, Rigoreau M, Hamon P, Leroy T, Hamon S, Poncet V, Crouzillat D and de Kochko A, Ancestral synteny shared between distantly-related plant species from the asterid (Coffea canephora and Solanum Sp.) and rosid (Vitis vinifera) clades. BMC Genomics 13:103 (2012).

98. Yu Q, Guyot R, Kochko Ad, Byers A, rez RN-P, Langston BJ, Dubreuil-Tranchant C, Paterson AH, Poncet Vr, Nagai C and Ming R, Micro-collinearity and genome evolution in the vicinity of an ethylene receptor gene of cultivated diploid and allotetraploid coffee species (Coffea). The Plant Journal 67:305–317 (2011).

99. Cenci A, Combes M-C and Lashermes P, Genome evolution in diploid and tetraploid Coffea species as revealed by comparative analysis of orthologous genome segments. Plant Molecular Biology 78:135–145 (2012).

100. Herrera J-C, Romero J-V, Camayo G-C, Caetano C-M and Cortina H-A, Evidence of intergenomic relationships in triploid hybrids of coffee (Coffea sp.) as revealed by meiotic behavior and genomic in situ hybridization. Tropical Plant Biology 5:207-217 (2012).

101. Pot D, Bouchet S, Marraccini P, De bellis F, Cubry P, Jourdan I, Pereira LFP, Vieira LGE, Ferreira LP, Musoli P, Legnate H and Leroy T, Nucleotide diversity of genes involved in sucrose metabolism. Towards the identification of candidates genes controlling sucrose variability in Coffea sp., in International conference on coffee science - ASIC, Ed, Montpellier, France (2007).

102. Priolli RHG, Ramos LCS, Pot D and Moller M, Construction of a genetic map based on an interspecific F2 population between Coffea arabica and Coffea canephora and its usefulness for quality related traits, in 22nd International Conference on Coffee Science, ASIC 2008, Ed, Campinas, SP, Brazil, pp 882-890 (2009).

103. Resende M, Caixeta E, Alkimin ER, Sousa TV, Resende MDV, Chamala S and Neves LG, High-throughput targeted genotyping of Coffea arabica and Coffea canephora using next generation sequencing, in Plant and Animal Genome XXIV, Ed, San Diego, CA (2016).

104. Leroy T, Marraccini P, Dufour M, Montagnon C, Lashermes P, Sabau X, Ferreira LP, Jourdan I, Pot D, Andrade AC, Glaszmann JC, Vieira LG and Piffanelli P, Construction and characterization of a Coffea canephora BAC library to study the organization of sucrose biosynthesis genes. Theoretical and Applied Genetics 111:1032-1041 (2005).

105. Dereeper A, Guyot R, Tranchant-Dubreuil C and Anthony Fo, BAC-end sequences analysis provides first insights into coffee (Coffea canephora P.) genome composition and evolution. Plant Molecular Biology 83:177-189 (2013).

This article is protected by copyright. All rights reserved.

Page 32: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e106. Noir S, Patheyron S, Combes M-C, Lashermes P and Chalhoub B, Construction and characterisation of a BAC library for genome analysis of the allotetraploid coffee species (Coffea arabica L.). Theoretical and Applied Genetics 109:225-230 (2004).

107. Jones MR, Byers A, Skelton RL, Yu Q, Nagai C and Moore PH, Construction of an arabica coffee BAC library for molecular dissection of an allotetraploid genome., in 21st International Conference on Coffee Science (ASIC), Ed, Montpellier, France, p 49 (2006).

108. Cacao SMB, Silva NV, Domingues DS and Vieira LGE, Construction and characterization of a BAC library from the Coffea arabica genotype Timor Hybrid CIFC 832/2. Genetica 141:217–226 (2013).

109. Fernandez D, Santos P, Agostini C, Bon M and Petito A, Coffee (Coffea arabica L.) genes early expressed during infection by the rust fungus (Hemileia vastatrix). Molecular Genetics and Genomics 5:527–536 (2004).

110. Lin C, Mueller LA, Carthy JM, Crouzillat D, Pétiard V and Tanksley S, Coffee and tomato share common gene repertoires as revealed by deep sequencing of seed and cherry transcripts. Theoretical and Applied Genetics 112:114-130 (2005).

111. Vieira LGE, Andrade AC and Colombo CA, Brazilian coffee genome project: an EST-based genomic resource. Brazilian Journal of Plant Physiology 18:95-108 (2006).

112. Montoya G, Vuong H, Cristancho M, Moncada P and Yepes M, Sequence analysis from leaves, flowers and fruits of Coffea arabica var. Caturra, in 21st International Conference on Coffee Science (ASIC), Ed, Montpellier, France (2006).

113. De Nardi B, Dreos R, Del Terra L and Martellossi C, Differential responses of Coffea arabica L. leaves and roots to chemically induced systemic acquired resistance. Genome 49:1594–1605 (2006).

114. Poncet V, Rondeau M, Tranchant C, Cayrel A, Hamon S, de Kochko A and Hamon P, SSR mining in coffee tree EST databases: potential use of EST–SSRs as markers for the Coffea genus. Molecular Genetics and Genomics 436–449 (2006).

115. Salmona J, Dussert S, Descroix F, de Kochko A, Bertrand B and Joet T, Deciphering transcriptional networks that govern Coffea arabica seed development using combined cDNA array and real-time RT-PCR approaches. Plant Molecular Biology 66:105-124 (2008).

116. Aggarwal R, Hendre P, Varshney R, Bhat P, Krishnakumar V and Singh L, Identification, characterization and utilization of EST-derived genetic microsatellite markers for genome analyses of coffee and related species. Theoretical and Applied Genetics 114:359–372 (2007).

117. Privat I, Bardil A, Gomez AB and Severac D, The 'PUCE CAFE' Project: the first 15K coffee microarray, a new tool for discovering candidate genes correlated to agronomic and quality traits. BMC Genomics 12:5 (2011).

118. Mondego JM, Vidal RO, Carazzolle MF and Tokuda EK, An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora. BMC Plant Biology 11:30 (2011).

This article is protected by copyright. All rights reserved.

Page 33: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e119. Vega FE, Ebert AW and Ming R, Coffee germplasm resources, genomics, and breeding, in Plant Breeding Reviews, Ed by Janick J. John Wiley & Sons, Inc. (2008).

120. Paillard M, Lashermes P and Pétiard V, Construction of a molecular linkage map in coffee. Theoretical and Applied Genetics 93:41-47 (1996).

121. Pearl H, Nagai C, Moore P, Steiger D, Osgood R and Ming R, Construction of a genetic map for arabica coffee. Theoretical and Applied Genetics 108:829-835 (2004).

122. Nagai C, Jones MR, Byers AE, Adamski DJ and Ming R, Development and characterization of a true F2 population for genetic and QTL mapping in Arabica, in 21st International Conference on Coffee Science Ed, Montpellier, France, pp 771-777 (2007).

123. Lashermes P, Combes MC, Prakash NS, Trouslot P, Lorieux M and Charrier A, Genetic linkage map of Coffea canephora: effect of segregation distortion and analysis of recombination rate in male and female meioses. Genome 44:589-595 (2001).

124. Coulibaly I, Revol B, Noirot M, Poncet V, Lorieux M, Carasco-Lacombe C, Minier J, Dufour M and Hamon P, AFLP and SSR polymorphism in a Coffea interspecific backcross progeny [(C. heterocalyx × C. canephora) × C. canephora]. Theoretical and Applied Genetics 107:1148-1155 (2003).

125. Crouzillat D, Rigoreau M, Bellanger L, Priyono P, Mawardi S, Syahrudi, McCarthy J, Tanksley S, Zaenudin I and Pétiard V, A Robusta consensus genetic map using RFLP and microsatellite markers for the detection of QTL, in 20th International Conference on Coffee Science (ASIC 2004), Ed, Bangalore, India, pp 546-553 (2005).

126. Amidou ND, Michel N, Serge H and Valérie P, Genetic basis of species differentiation between Coffea liberica Hiern and C. canephora Pierre: Analysis of an interspecific cross. Genetic Resources and Crop Evolution 54:1011-1021 (2007).

127. Ky C-L, Barre P, Lorieux M, Trouslot P, Akaffou S, Louarn J, Charrier A, Hamon S and Noirot M, Interspecific genetic linkage map, segregation distortion and genetic conversion in coffee (Coffea sp.). Theoretical and Applied Genetics 101:669-676 (2000).

128. Gartner GAL, McCouch SR and Moncada MDP, A genetic map of an interspecific diploid pseudo testcross population of coffee. Euphytica 192:305-323 (2013).

129. Teixeira-Cabral TA, Sakiyama NS, Zambolim L, Pereira AA and Schuster I, Single-locus inheritance and partial linkage map of Coffea arabica L. Crop Breeding and Applied Biotechnology 4:416-421 (2004).

130. Moncada P, Tovar E, Montoya JC, Gonzalez A, Spindel J and McCouch S, A genetic of linkage map of coffee (Coffea arabica L) and QTL for yield, plant height and bean size, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).

131. Campa C, Noirot M, Bourgeois M, Pervent M, Ky C, Chrestin H, Hamon S and de Kochko A, Genetic mapping of a caffeoyl-coenzyme A 3-0-methyltransferase gene

This article is protected by copyright. All rights reserved.

Page 34: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

ein coffee trees. Impact on chlorogenic acid content. Theoretical and Applied Genetics 107:751–756 (2003).

132. Ky C-L, Barre P and Noirot M, Genetic investigations on the caffeine and chlorogenic acid relationship in an interspecific cross between Coffea liberica dewevrei and C. pseudozanguebariae. Tree Genetics & Genomes 9:1043-1049 (2013).

133. Hendre PS and Aggarwal RK, DNA markers: Development and application for genetic improvement of coffee, in Genomics Assisted Crop Improvement: Vol 2: Genomics Applications in Crops, Ed by Varshney RK and Tuberosa R. Springer, pp 399–434 (2007).

134. Geromel C, Ferreira LP, Guerreiro SMC, Cavalari AA, Pot D, Pereira LFP, Leroy T, Vieira LGE and Marraccini PMaP, Biochemical and genomic analysis of sucrose metabolism during coffee (Coffea arabica) fruit development. Journal of Experimental Botany 57:3243–3258 (2006).

135. Privat I, Foucrier S, Prins A, Epalle T, Eychenne M, Kandalaft L, Caillet V, Lin C, Tanksley S and Foyer C, Differential regulation of grain sucrose accumulation and metabolism in Coffea arabica (Arabica) and Coffea canephora (Robusta) revealed through gene expression and enzyme activity analysis. New Phytologist 178:781-797 (2008).

136. Mizuno K, Okuda A, Kato M, Yoneyama N, Tanaka H, Ashihara H and Fujimura T, Isolation of a new dual-functional caffeine synthase gene encoding an enzyme for the conversion of 7-methylxanthine to caffeine from coffee (Coffea arabica L.). FEBS letters 534:75-81 (2003b).

137. Simkin AJ, Qian T, Caillet V, Michoux F, Ben Amor M, Lin C, Tanksley S and McCarthy J, Oleosin gene family of Coffea canephora: Quantitative expression analysis of five oleosin genes in developing and germinating coffee grain. Journal of plant physiology 163:691-708 (2006).

138. Mahesh V, Rakotomalala JJ, Le Gal L, Vigne H, De Kochko A, Hamon S, Noirot M and Campa C, Isolation and genetic mapping of a Coffea canephora phenylalanine ammonia-lyase gene (CcPAL1) and its involvement in the accumulation of caffeoyl quinic acids. Plant cell reports 25:986-992 (2006).

139. Lepelley M, Mahesh V, McCarthy J, Rigoreau M, Crouzillat D, Chabrillange N, de Kochko A and Campa C, Characterization, high-resolution mapping and differential expression of three homologous PAL genes in Coffea canephora Pierre (Rubiaceae). Planta 236:313-326 (2012).

140. Perrois C, Strickler S, G M, M L, L B, S M, J H, L M and I. P, Differential regulation of caffeine metabolism in Coffea arabica (Arabica) and Coffea canephora (Robusta). Planta (2014).

141. Mizuno K, Matsuzaki M, Kanazawa S, Tokiwano T, Yoshizawa Y and Kato M, Conversion of nicotinic acid to trigonelline is catalyzed by N-methyltransferase belonged to motif B' methyltransferase family in Coffea arabica. Biochemical and Biophysical Research Communications 452:1060–1066 (2014).

This article is protected by copyright. All rights reserved.

Page 35: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e142. Yuyama PM, Ivamoto ST, Carazzolle MF, Reis Júnior O, Pereira GAG, Sakuray LM, Ruiz M, Leroy T, Charmetant P, Domingues DS and Pereira LFP, Transcriptome analysis of leaves and fruits of Coffea eugenioides, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).

143. Privat I, Perrois C, Mathieu G, Strickler SR, Lepelley M, Bedon L, Michaux S, Husson J and Mueller L, Differential regulation of caffeine metabolism in Coffea arabica and Coffea canephora, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).

144. Joët T, Salmona J, Laffargue A, Descroix F and Dussert S, Use of the growing environment as a source of variation to identify the quantitative trait transcripts and modules of co‐expressed genes that determine chlorogenic acid accumulation. Plant, cell & environment 33:1220-1233 (2010b).

145. Sexton TR, Henry RJ, Harwood CE, Thomas DS, McManus LJ, Raymond C, Henson M and Shepherd M, Pectin Methylesterase genes influence solid wood properties of Eucalyptus pilularis. Plant Physiology 158:531–541 (2012).

146. Hall D, Tegstrom C and Ingvarsson PK, Using association mapping to dissect the genetic basis of complex traits in plants. Briefings in functional genomics 9:157-165 (2010).

147. Henry RJ, Next-generation sequencing for understanding and accelerating crop domestication. Briefings in functional genomics 11:51-56 (2011).

148. Henry RJ, Edwards M, Waters DL, Bundock P, Sexton TR, Masouleh AK, Nock CJ and Pattemore J, Application of large-scale sequencing to marker discovery in plants. Journal of biosciences 37:829-841 (2012).

149. Varshney RK, Terauchi R and McCouch SR, Harvesting the Promising Fruits of Genomics: Applying Genome Sequencing Technologies to Crop Breeding. PLOS biology 12 (2014).

150. Kumar S, Banks TW and Cloutier S, SNP discovery through next-generation sequencing and its applications. International journal of plant genomics 2012 (2012).

151. Edwards M, Whole-Genome Sequencing for Marker Discovery, in Molecular Markers in Plants, Ed by Henry RJ. Wiley-Blackwell (2013).

152. Gao Q, Yue G, Li W, Wang J, Xu J and Yin Y, Recent Progress Using High‐throughput Sequencing Technologies in Plant Molecular Breeding. Journal of integrative plant biology 54:215-227 (2012).

153. Illumina, An introduction to Next-generation sequencing technology. (2013).

154. Whitt SR and Buckler IV ES, Using natural allelic diversity to evaluate gene function, in Plant Functional Genomics: Methods and Protocols, ed. by Grotewold E. Humana Press, Totowa, NJ, pp 123-139 (2003).

155. Neale DB and Savolainen O, Association genetics of complex traits in conifers. Trends in Plant Science 9:325-330 (2004).

This article is protected by copyright. All rights reserved.

Page 36: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

Table 1. Variation of the main biochemical compounds determining quality in coffee

Compound Level C. arabica

(dmb – g kg-1)

C. canephora

(dmb – g kg-1)

Other species

(dmb – g kg-1)

Sucrose Lowest 7.40 4.05 3.80

Highest 11.10 7.05 10.70

Caffeine Lowest 0.96 1.51 0.00

Highest 1.62 3.30 2.64

Trigonelline Lowest 0.88 0.75 0.39

Highest 2.90 1.24 1.77

Lipids Ave. 15 10 8-30

CGAs Lowest 4.34 7 0.8

Highest 4.80 14.4 11.9

Source: (15-18)

This article is protected by copyright. All rights reserved.

Page 37: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

Table 2: Inheritance mode and heritability of coffee quality determinants

Trait Intraspecific hybrids Interspecific hybrids

Sucrose content Polygenic inheritance

NSH = 0.11

Male & female additive &

dominant effects

Polygenic & additive inheritance

Higher heritability value

Caffeine Additive inheritance

NSH = 0.33; BSH = 0.76

NSH = 0.80

Female additive effects

Polygenic & additive inheritance

One major gene with two alleles

No caffeine = recessive gene

(caf1)

High BSH

Trigonelline Male & female additive and

dominant effects

NSH = 0.38

Not additive, maternal

inheritance

heritability = 0.71

Fat content NSH = 0.74

Male additive

CGAs NSH = 0.36

Female additive effects

Nuclear inheritance

3-FQA isomer controlled by one

major gene

NSH = narrow-sense heritability; BSH = broad-sense heritability

Source: (41-49)

This article is protected by copyright. All rights reserved.

Page 38: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

Table 3. Genome sequence data available for the two main domesticated coffee

species

Whole genome sequence of C. canephora

Genome

size

(Mb)

Coverage

No of

contigs

No of

scaffolds

No of

genes

MicroRNA

precursors

Organellar-to-

nuclear genome

transfers

Est. 710 30 x 25,216 13,345 25,574 92 2,573

Chloroplast genome sequence of C. arabica

Genome

size (bp)

IRs No of

genes

Coding region

protein

genes

tRNA

genes

rRNA

genes

genes containing

introns

155,189 25,943 130 79 29 4 18

This article is protected by copyright. All rights reserved.

Page 39: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

Table 4: Bacterial Artificial Chromosome libraries of two main coffee species

Genotypes No of

clones

Ave. size of

clones (kb)

Coverage Authors

C. canephora

IF 126 55,296 135 10.6 x Leroy et al., 2005

IF 200 DH 36,864 166 8.6 x Dereeper et al., 2013

IF 200 DH 36,864 121 6.3 x Dereeper et al., 2013

C. arabica

IAPAR 59 80,813 130 8 x Noir et al., 2004

Hybrid (Mokka x Catimor) 52,416 94 4 x Jones et al., 2006

Timor Hybrid CIFC 832/2 56,832 118 5-6 x Cacao et al., 2013

This article is protected by copyright. All rights reserved.

Page 40: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

Table 5: Expressed Sequence Tags from the three main coffee species

Project/

organisation

Tissues

ESTs of species Unigenes/

genes C.

arabica

C.

canephora

C.

racemosa

Fernandez et al. (98) leaves 527

Nestle and Cornell

Uni

Lin et al. (99)

seeds

47,000 13,175

Brazilian Genome

Vieira et al. (100)

leaves and

fruits

130,792 12,381 10,566 33,000

CENICAFE

Montoya et al. (101)

leaf, flower

and fruit

32,961 10,799

De Nardi et al. (102) leaves and

roots

1,587

French IRD

Poncet et al. (103)

leaves and

fruits

10,420 5,534

Salmona et al. (104) seeds 266

Total (246,500) 166,133 69,801 10,566 62,508

This article is protected by copyright. All rights reserved.

Page 41: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

Table 6. Genetic linkage maps of coffee

Markers Population used Length

(cM)

No of

linkage

groups

Traits/

purpose

Authors

C. arabica

AFLP pseudo-F2

population

(Mokka hybrid x

Catimor)

1,802.8 31 source-sink traits (110)

AFLP F2 of Tall Mokka and

Catimor

1,042.4 40 cupping quality

and morphology

(111)

AFLP and SSR F2 of C. arabica x C.

canephora

1,011 37 quality and

productivity

(92)

RAPD F2 of (Mundo Nov x

Hybrido de Timor) x

Hybrido de Timor

540.6 8 Partial linkage

map

(118)

SSR F2 and F3 of Caturra

x CCC1046

3800 22 yield, plant height and fruit size

(119)

C. canephora

RFLP and

RAPD

doubled haploid

(IF200)

1,402 15 QTL analysis (109)

RFLP and SSR doubled haploid and 1,041 11 segregation (112)

This article is protected by copyright. All rights reserved.

Page 42: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eTC (IF200 x DH) distortion

RFLP and SSR (C. heterocalyx x C.

canephora) x C.

canephora

1,360 15 QTL analysis (113)

RFLP and SSR BP409 x Q121 1,258 11 yield and vigour (114)

AFLP and

ISSR

(C.canephora x C.

liberica) x C. liberica

1,502 16 morphological

traits

(115)

Conserved

Ortholog Set

(COS)

Intraspecific hybrid

of C. canephora

1,331 11 synteny maps

between coffee

and tomato

(86)

SSR pseudo-backcross of

C. canephora

1,290 11 yield and quality (44)

SSR 2 populations of

intraspecific hybrid

of C. canephora

1,201 11 yield and quality (49)

Other species

AFLP and

RFLP

(C.

pseudozanguebariae

x C. liberica var.

dewevrei (DEW)) x

DEW

1,144 14 biochemical

traits (caffeine,

CGAs, sucrose)

(116)

SSR C. liberica x C.

eugenioides

798.68 11 QTL analysis (117)

This article is protected by copyright. All rights reserved.

Page 43: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

e

Table 7: Quantitative Trait Loci and Linkage Groups in coffee

Traits No of QTLs and LGs

C. canephora Other species

Sucrose 9 QTLs/A, E, I, J

Caffeine 14 QTLs/A, C, E, I, K 2 QTLs/A, G

Trigonelline 5 QTLs/F, G, I, K 1 QTL/G

Lipids 7 QTLs/B, C, E, G, I

CGAs 23 QTLs/A, B, D, E, F, I, J, K 1 QTL/A

Cup quality 14 QTLs/A, B, D, E, G, H, I

Source: (43, 44, 49, 120, 121)

This article is protected by copyright. All rights reserved.

Page 44: Advances in genomics for the improvement of quality in Coffee381677/UQ381677... · 2019-10-09 · genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while genotype

Acc

epte

d A

rticl

eTable 8: Genes controlling biochemical compounds determining quality traits in coffee

Species Sucrose Caffeine Trigonelline Fatty acids CGAs

C. arabica CaSUS1 &

CaSUS2,

CaInv3

CmXRS1, CTS2, CCS1, CaXMT1-2, CaMXMT1-2, and CaDXMT1-2

CTgS1 &

CTgS2

OLE2,

DGAT

4CL8, HQT,

F5H1 & POD

C.

canephora

CcSUS1,

CcSUS2

& 8 new

genes

CmXRS1, CTS2,

CCS1 &

23 genes

relating to

alkaloid

catalytic

N/A CcOLE1-5,

CcSTO1

& 15 other

genes

Many genes

involved in

phenylpropa

-noid

pathway

Source: (7, 44, 79, 93, 106, 120, 123-130)

This article is protected by copyright. All rights reserved.