Cargando…

A preliminary analysis of genome structure and composition in Gossypium hirsutum

BACKGROUND: Upland cotton has the highest yield, and accounts for > 95% of world cotton production. Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural, functional, and evolutionary studies of the species. Here, we employed GeneTrek and BAC t...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Wangzhen, Cai, Caiping, Wang, Changbiao, Zhao, Liang, Wang, Lei, Zhang, Tianzhen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2481271/
https://www.ncbi.nlm.nih.gov/pubmed/18590573
http://dx.doi.org/10.1186/1471-2164-9-314
_version_ 1782157974663331840
author Guo, Wangzhen
Cai, Caiping
Wang, Changbiao
Zhao, Liang
Wang, Lei
Zhang, Tianzhen
author_facet Guo, Wangzhen
Cai, Caiping
Wang, Changbiao
Zhao, Liang
Wang, Lei
Zhang, Tianzhen
author_sort Guo, Wangzhen
collection PubMed
description BACKGROUND: Upland cotton has the highest yield, and accounts for > 95% of world cotton production. Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural, functional, and evolutionary studies of the species. Here, we employed GeneTrek and BAC tagging information approaches to predict the general composition and structure of the allotetraploid cotton genome. RESULTS: 142 BAC sequences from Gossypium hirsutum cv. Maxxa were downloaded and confirmed. These BAC sequence analysis revealed that the tetraploid cotton genome contains over 70,000 candidate genes with duplicated gene copies in homoeologous A- and D-subgenome regions. Gene distribution is uneven, with gene-rich and gene-free regions of the genome. Twenty-one percent of the 142 BACs lacked genes. BAC gene density ranged from 0 to 33.2 per 100 kb, whereas most gene islands contained only one gene with an average of 1.5 genes per island. Retro-elements were found to be a major component, first an enriched LTR/gypsy and second LTR/copia. Most LTR retrotransposons were truncated and in nested structures. In addition, 166 polymorphic loci amplified with SSRs developed from 70 BAC clones were tagged on our backbone genetic map. Seventy-five percent (125/166) of the polymorphic loci were tagged on the D-subgenome. By comprehensively analyzing the molecular size of amplified products among tetraploid G. hirsutum cv. Maxxa, acc. TM-1, and G. barbadense cv. Hai7124, and diploid G. herbaceum var. africanum and G. raimondii, 37 BACs, 12 from the A- and 25 from the D-subgenome, were further anchored to their corresponding subgenome chromosomes. After a large amount of genes sequence comparison from different subgenome BACs, the result showed that introns might have no contribution to different subgenome size in Gossypium. CONCLUSION: This study provides us with the first glimpse of cotton genome complexity and serves as a foundation for tetraploid cotton whole genomesequencing in the future.
format Text
id pubmed-2481271
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24812712008-07-23 A preliminary analysis of genome structure and composition in Gossypium hirsutum Guo, Wangzhen Cai, Caiping Wang, Changbiao Zhao, Liang Wang, Lei Zhang, Tianzhen BMC Genomics Research Article BACKGROUND: Upland cotton has the highest yield, and accounts for > 95% of world cotton production. Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural, functional, and evolutionary studies of the species. Here, we employed GeneTrek and BAC tagging information approaches to predict the general composition and structure of the allotetraploid cotton genome. RESULTS: 142 BAC sequences from Gossypium hirsutum cv. Maxxa were downloaded and confirmed. These BAC sequence analysis revealed that the tetraploid cotton genome contains over 70,000 candidate genes with duplicated gene copies in homoeologous A- and D-subgenome regions. Gene distribution is uneven, with gene-rich and gene-free regions of the genome. Twenty-one percent of the 142 BACs lacked genes. BAC gene density ranged from 0 to 33.2 per 100 kb, whereas most gene islands contained only one gene with an average of 1.5 genes per island. Retro-elements were found to be a major component, first an enriched LTR/gypsy and second LTR/copia. Most LTR retrotransposons were truncated and in nested structures. In addition, 166 polymorphic loci amplified with SSRs developed from 70 BAC clones were tagged on our backbone genetic map. Seventy-five percent (125/166) of the polymorphic loci were tagged on the D-subgenome. By comprehensively analyzing the molecular size of amplified products among tetraploid G. hirsutum cv. Maxxa, acc. TM-1, and G. barbadense cv. Hai7124, and diploid G. herbaceum var. africanum and G. raimondii, 37 BACs, 12 from the A- and 25 from the D-subgenome, were further anchored to their corresponding subgenome chromosomes. After a large amount of genes sequence comparison from different subgenome BACs, the result showed that introns might have no contribution to different subgenome size in Gossypium. CONCLUSION: This study provides us with the first glimpse of cotton genome complexity and serves as a foundation for tetraploid cotton whole genomesequencing in the future. BioMed Central 2008-07-01 /pmc/articles/PMC2481271/ /pubmed/18590573 http://dx.doi.org/10.1186/1471-2164-9-314 Text en Copyright © 2008 Guo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Guo, Wangzhen
Cai, Caiping
Wang, Changbiao
Zhao, Liang
Wang, Lei
Zhang, Tianzhen
A preliminary analysis of genome structure and composition in Gossypium hirsutum
title A preliminary analysis of genome structure and composition in Gossypium hirsutum
title_full A preliminary analysis of genome structure and composition in Gossypium hirsutum
title_fullStr A preliminary analysis of genome structure and composition in Gossypium hirsutum
title_full_unstemmed A preliminary analysis of genome structure and composition in Gossypium hirsutum
title_short A preliminary analysis of genome structure and composition in Gossypium hirsutum
title_sort preliminary analysis of genome structure and composition in gossypium hirsutum
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2481271/
https://www.ncbi.nlm.nih.gov/pubmed/18590573
http://dx.doi.org/10.1186/1471-2164-9-314
work_keys_str_mv AT guowangzhen apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT caicaiping apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT wangchangbiao apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT zhaoliang apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT wanglei apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT zhangtianzhen apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT guowangzhen preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT caicaiping preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT wangchangbiao preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT zhaoliang preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT wanglei preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum
AT zhangtianzhen preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum