Cargando…

An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora

BACKGROUND: Coffee is one of the world's most important crops; it is consumed worldwide and plays a significant role in the economy of producing countries. Coffea arabica and C. canephora are responsible for 70 and 30% of commercial production, respectively. C. arabica is an allotetraploid from...

Descripción completa

Detalles Bibliográficos
Autores principales: Mondego, Jorge MC, Vidal, Ramon O, Carazzolle, Marcelo F, Tokuda, Eric K, Parizzi, Lucas P, Costa, Gustavo GL, Pereira, Luiz FP, Andrade, Alan C, Colombo, Carlos A, Vieira, Luiz GE, Pereira, Gonçalo AG
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045888/
https://www.ncbi.nlm.nih.gov/pubmed/21303543
http://dx.doi.org/10.1186/1471-2229-11-30
_version_ 1782198878027644928
author Mondego, Jorge MC
Vidal, Ramon O
Carazzolle, Marcelo F
Tokuda, Eric K
Parizzi, Lucas P
Costa, Gustavo GL
Pereira, Luiz FP
Andrade, Alan C
Colombo, Carlos A
Vieira, Luiz GE
Pereira, Gonçalo AG
author_facet Mondego, Jorge MC
Vidal, Ramon O
Carazzolle, Marcelo F
Tokuda, Eric K
Parizzi, Lucas P
Costa, Gustavo GL
Pereira, Luiz FP
Andrade, Alan C
Colombo, Carlos A
Vieira, Luiz GE
Pereira, Gonçalo AG
author_sort Mondego, Jorge MC
collection PubMed
description BACKGROUND: Coffee is one of the world's most important crops; it is consumed worldwide and plays a significant role in the economy of producing countries. Coffea arabica and C. canephora are responsible for 70 and 30% of commercial production, respectively. C. arabica is an allotetraploid from a recent hybridization of the diploid species, C. canephora and C. eugenioides. C. arabica has lower genetic diversity and results in a higher quality beverage than C. canephora. Research initiatives have been launched to produce genomic and transcriptomic data about Coffea spp. as a strategy to improve breeding efficiency. RESULTS: Assembling the expressed sequence tags (ESTs) of C. arabica and C. canephora produced by the Brazilian Coffee Genome Project and the Nestlé-Cornell Consortium revealed 32,007 clusters of C. arabica and 16,665 clusters of C. canephora. We detected different GC3 profiles between these species that are related to their genome structure and mating system. BLAST analysis revealed similarities between coffee and grape (Vitis vinifera) genes. Using KA/KS analysis, we identified coffee genes under purifying and positive selection. Protein domain and gene ontology analyses suggested differences between Coffea spp. data, mainly in relation to complex sugar synthases and nucleotide binding proteins. OrthoMCL was used to identify specific and prevalent coffee protein families when compared to five other plant species. Among the interesting families annotated are new cystatins, glycine-rich proteins and RALF-like peptides. Hierarchical clustering was used to independently group C. arabica and C. canephora expression clusters according to expression data extracted from EST libraries, resulting in the identification of differentially expressed genes. Based on these results, we emphasize gene annotation and discuss plant defenses, abiotic stress and cup quality-related functional categories. CONCLUSION: We present the first comprehensive genome-wide transcript profile study of C. arabica and C. canephora, which can be freely assessed by the scientific community at http://www.lge.ibi.unicamp.br/coffea. Our data reveal the presence of species-specific/prevalent genes in coffee that may help to explain particular characteristics of these two crops. The identification of differentially expressed transcripts offers a starting point for the correlation between gene expression profiles and Coffea spp. developmental traits, providing valuable insights for coffee breeding and biotechnology, especially concerning sugar metabolism and stress tolerance.
format Text
id pubmed-3045888
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30458882011-03-01 An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora Mondego, Jorge MC Vidal, Ramon O Carazzolle, Marcelo F Tokuda, Eric K Parizzi, Lucas P Costa, Gustavo GL Pereira, Luiz FP Andrade, Alan C Colombo, Carlos A Vieira, Luiz GE Pereira, Gonçalo AG BMC Plant Biol Research Article BACKGROUND: Coffee is one of the world's most important crops; it is consumed worldwide and plays a significant role in the economy of producing countries. Coffea arabica and C. canephora are responsible for 70 and 30% of commercial production, respectively. C. arabica is an allotetraploid from a recent hybridization of the diploid species, C. canephora and C. eugenioides. C. arabica has lower genetic diversity and results in a higher quality beverage than C. canephora. Research initiatives have been launched to produce genomic and transcriptomic data about Coffea spp. as a strategy to improve breeding efficiency. RESULTS: Assembling the expressed sequence tags (ESTs) of C. arabica and C. canephora produced by the Brazilian Coffee Genome Project and the Nestlé-Cornell Consortium revealed 32,007 clusters of C. arabica and 16,665 clusters of C. canephora. We detected different GC3 profiles between these species that are related to their genome structure and mating system. BLAST analysis revealed similarities between coffee and grape (Vitis vinifera) genes. Using KA/KS analysis, we identified coffee genes under purifying and positive selection. Protein domain and gene ontology analyses suggested differences between Coffea spp. data, mainly in relation to complex sugar synthases and nucleotide binding proteins. OrthoMCL was used to identify specific and prevalent coffee protein families when compared to five other plant species. Among the interesting families annotated are new cystatins, glycine-rich proteins and RALF-like peptides. Hierarchical clustering was used to independently group C. arabica and C. canephora expression clusters according to expression data extracted from EST libraries, resulting in the identification of differentially expressed genes. Based on these results, we emphasize gene annotation and discuss plant defenses, abiotic stress and cup quality-related functional categories. CONCLUSION: We present the first comprehensive genome-wide transcript profile study of C. arabica and C. canephora, which can be freely assessed by the scientific community at http://www.lge.ibi.unicamp.br/coffea. Our data reveal the presence of species-specific/prevalent genes in coffee that may help to explain particular characteristics of these two crops. The identification of differentially expressed transcripts offers a starting point for the correlation between gene expression profiles and Coffea spp. developmental traits, providing valuable insights for coffee breeding and biotechnology, especially concerning sugar metabolism and stress tolerance. BioMed Central 2011-02-08 /pmc/articles/PMC3045888/ /pubmed/21303543 http://dx.doi.org/10.1186/1471-2229-11-30 Text en Copyright ©2011 Mondego et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mondego, Jorge MC
Vidal, Ramon O
Carazzolle, Marcelo F
Tokuda, Eric K
Parizzi, Lucas P
Costa, Gustavo GL
Pereira, Luiz FP
Andrade, Alan C
Colombo, Carlos A
Vieira, Luiz GE
Pereira, Gonçalo AG
An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
title An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
title_full An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
title_fullStr An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
title_full_unstemmed An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
title_short An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora
title_sort est-based analysis identifies new genes and reveals distinctive gene expression features of coffea arabica and coffea canephora
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045888/
https://www.ncbi.nlm.nih.gov/pubmed/21303543
http://dx.doi.org/10.1186/1471-2229-11-30
work_keys_str_mv AT mondegojorgemc anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT vidalramono anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT carazzollemarcelof anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT tokudaerick anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT parizzilucasp anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT costagustavogl anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT pereiraluizfp anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT andradealanc anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT colombocarlosa anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT vieiraluizge anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT pereiragoncaloag anestbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT mondegojorgemc estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT vidalramono estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT carazzollemarcelof estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT tokudaerick estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT parizzilucasp estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT costagustavogl estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT pereiraluizfp estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT andradealanc estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT colombocarlosa estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT vieiraluizge estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora
AT pereiragoncaloag estbasedanalysisidentifiesnewgenesandrevealsdistinctivegeneexpressionfeaturesofcoffeaarabicaandcoffeacanephora