Cargando…

Sequencing and analysis of the gene-rich space of cowpea

BACKGROUND: Cowpea, Vigna unguiculata (L.) Walp., is one of the most important food and forage legumes in the semi-arid tropics because of its drought tolerance and ability to grow on poor quality soils. Approximately 80% of cowpea production takes place in the dry savannahs of tropical West and Cen...

Descripción completa

Detalles Bibliográficos
Autores principales: Timko, Michael P, Rushton, Paul J, Laudeman, Thomas W, Bokowiec, Marta T, Chipumuro, Edmond, Cheung, Foo, Town, Christopher D, Chen, Xianfeng
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2279124/
https://www.ncbi.nlm.nih.gov/pubmed/18304330
http://dx.doi.org/10.1186/1471-2164-9-103
_version_ 1782152057985171456
author Timko, Michael P
Rushton, Paul J
Laudeman, Thomas W
Bokowiec, Marta T
Chipumuro, Edmond
Cheung, Foo
Town, Christopher D
Chen, Xianfeng
author_facet Timko, Michael P
Rushton, Paul J
Laudeman, Thomas W
Bokowiec, Marta T
Chipumuro, Edmond
Cheung, Foo
Town, Christopher D
Chen, Xianfeng
author_sort Timko, Michael P
collection PubMed
description BACKGROUND: Cowpea, Vigna unguiculata (L.) Walp., is one of the most important food and forage legumes in the semi-arid tropics because of its drought tolerance and ability to grow on poor quality soils. Approximately 80% of cowpea production takes place in the dry savannahs of tropical West and Central Africa, mostly by poor subsistence farmers. Despite its economic and social importance in the developing world, cowpea remains to a large extent an underexploited crop. Among the major goals of cowpea breeding and improvement programs is the stacking of desirable agronomic traits, such as disease and pest resistance and response to abiotic stresses. Implementation of marker-assisted selection and breeding programs is severely limited by a paucity of trait-linked markers and a general lack of information on gene structure and organization. With a nuclear genome size estimated at ~620 Mb, the cowpea genome is an ideal target for reduced representation sequencing. RESULTS: We report here the sequencing and analysis of the gene-rich, hypomethylated portion of the cowpea genome selectively cloned by methylation filtration (MF) technology. Over 250,000 gene-space sequence reads (GSRs) with an average length of 610 bp were generated, yielding ~160 Mb of sequence information. The GSRs were assembled, annotated by BLAST homology searches of four public protein annotation databases and four plant proteomes (A. thaliana, M. truncatula, O. sativa, and P. trichocarpa), and analyzed using various domain and gene modeling tools. A total of 41,260 GSR assemblies and singletons were annotated, of which 19,786 have unique GenBank accession numbers. Within the GSR dataset, 29% of the sequences were annotated using the Arabidopsis Gene Ontology (GO) with the largest categories of assigned function being catalytic activity and metabolic processes, groups that include the majority of cellular enzymes and components of amino acid, carbohydrate and lipid metabolism. A total of 5,888 GSRs had homology to genes encoding transcription factors (TFs) and transcription associated factors (TAFs) representing about 5% of the total annotated sequences in the dataset. Sixty-two (62) of the 64 well-characterized plant transcription factor (TF) gene families are represented in the cowpea GSRs, and these families are of similar size and phylogenetic organization to those characterized in other plants. The cowpea GSRs also provides a rich source of genes involved in photoperiodic control, symbiosis, and defense-related responses. Comparisons to available databases revealed that about 74% of cowpea ESTs and 70% of all legume ESTs were represented in the GSR dataset. As approximately 12% of all GSRs contain an identifiable simple-sequence repeat, the dataset is a powerful resource for the design of microsatellite markers. CONCLUSION: The availability of extensive publicly available genomic data for cowpea, a non-model legume with significant importance in the developing world, represents a significant step forward in legume research. Not only does the gene space sequence enable the detailed analysis of gene structure, gene family organization and phylogenetic relationships within cowpea, but it also facilitates the characterization of syntenic relationships with other cultivated and model legumes, and will contribute to determining patterns of chromosomal evolution in the Leguminosae. The micro and macrosyntenic relationships detected between cowpea and other cultivated and model legumes should simplify the identification of informative markers for marker-assisted trait selection and map-based gene isolation necessary for cowpea improvement.
format Text
id pubmed-2279124
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22791242008-04-03 Sequencing and analysis of the gene-rich space of cowpea Timko, Michael P Rushton, Paul J Laudeman, Thomas W Bokowiec, Marta T Chipumuro, Edmond Cheung, Foo Town, Christopher D Chen, Xianfeng BMC Genomics Research Article BACKGROUND: Cowpea, Vigna unguiculata (L.) Walp., is one of the most important food and forage legumes in the semi-arid tropics because of its drought tolerance and ability to grow on poor quality soils. Approximately 80% of cowpea production takes place in the dry savannahs of tropical West and Central Africa, mostly by poor subsistence farmers. Despite its economic and social importance in the developing world, cowpea remains to a large extent an underexploited crop. Among the major goals of cowpea breeding and improvement programs is the stacking of desirable agronomic traits, such as disease and pest resistance and response to abiotic stresses. Implementation of marker-assisted selection and breeding programs is severely limited by a paucity of trait-linked markers and a general lack of information on gene structure and organization. With a nuclear genome size estimated at ~620 Mb, the cowpea genome is an ideal target for reduced representation sequencing. RESULTS: We report here the sequencing and analysis of the gene-rich, hypomethylated portion of the cowpea genome selectively cloned by methylation filtration (MF) technology. Over 250,000 gene-space sequence reads (GSRs) with an average length of 610 bp were generated, yielding ~160 Mb of sequence information. The GSRs were assembled, annotated by BLAST homology searches of four public protein annotation databases and four plant proteomes (A. thaliana, M. truncatula, O. sativa, and P. trichocarpa), and analyzed using various domain and gene modeling tools. A total of 41,260 GSR assemblies and singletons were annotated, of which 19,786 have unique GenBank accession numbers. Within the GSR dataset, 29% of the sequences were annotated using the Arabidopsis Gene Ontology (GO) with the largest categories of assigned function being catalytic activity and metabolic processes, groups that include the majority of cellular enzymes and components of amino acid, carbohydrate and lipid metabolism. A total of 5,888 GSRs had homology to genes encoding transcription factors (TFs) and transcription associated factors (TAFs) representing about 5% of the total annotated sequences in the dataset. Sixty-two (62) of the 64 well-characterized plant transcription factor (TF) gene families are represented in the cowpea GSRs, and these families are of similar size and phylogenetic organization to those characterized in other plants. The cowpea GSRs also provides a rich source of genes involved in photoperiodic control, symbiosis, and defense-related responses. Comparisons to available databases revealed that about 74% of cowpea ESTs and 70% of all legume ESTs were represented in the GSR dataset. As approximately 12% of all GSRs contain an identifiable simple-sequence repeat, the dataset is a powerful resource for the design of microsatellite markers. CONCLUSION: The availability of extensive publicly available genomic data for cowpea, a non-model legume with significant importance in the developing world, represents a significant step forward in legume research. Not only does the gene space sequence enable the detailed analysis of gene structure, gene family organization and phylogenetic relationships within cowpea, but it also facilitates the characterization of syntenic relationships with other cultivated and model legumes, and will contribute to determining patterns of chromosomal evolution in the Leguminosae. The micro and macrosyntenic relationships detected between cowpea and other cultivated and model legumes should simplify the identification of informative markers for marker-assisted trait selection and map-based gene isolation necessary for cowpea improvement. BioMed Central 2008-02-27 /pmc/articles/PMC2279124/ /pubmed/18304330 http://dx.doi.org/10.1186/1471-2164-9-103 Text en Copyright © 2008 Timko et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Timko, Michael P
Rushton, Paul J
Laudeman, Thomas W
Bokowiec, Marta T
Chipumuro, Edmond
Cheung, Foo
Town, Christopher D
Chen, Xianfeng
Sequencing and analysis of the gene-rich space of cowpea
title Sequencing and analysis of the gene-rich space of cowpea
title_full Sequencing and analysis of the gene-rich space of cowpea
title_fullStr Sequencing and analysis of the gene-rich space of cowpea
title_full_unstemmed Sequencing and analysis of the gene-rich space of cowpea
title_short Sequencing and analysis of the gene-rich space of cowpea
title_sort sequencing and analysis of the gene-rich space of cowpea
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2279124/
https://www.ncbi.nlm.nih.gov/pubmed/18304330
http://dx.doi.org/10.1186/1471-2164-9-103
work_keys_str_mv AT timkomichaelp sequencingandanalysisofthegenerichspaceofcowpea
AT rushtonpaulj sequencingandanalysisofthegenerichspaceofcowpea
AT laudemanthomasw sequencingandanalysisofthegenerichspaceofcowpea
AT bokowiecmartat sequencingandanalysisofthegenerichspaceofcowpea
AT chipumuroedmond sequencingandanalysisofthegenerichspaceofcowpea
AT cheungfoo sequencingandanalysisofthegenerichspaceofcowpea
AT townchristopherd sequencingandanalysisofthegenerichspaceofcowpea
AT chenxianfeng sequencingandanalysisofthegenerichspaceofcowpea