Cargando…
A preliminary analysis of genome structure and composition in Gossypium hirsutum
BACKGROUND: Upland cotton has the highest yield, and accounts for > 95% of world cotton production. Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural, functional, and evolutionary studies of the species. Here, we employed GeneTrek and BAC t...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2481271/ https://www.ncbi.nlm.nih.gov/pubmed/18590573 http://dx.doi.org/10.1186/1471-2164-9-314 |
_version_ | 1782157974663331840 |
---|---|
author | Guo, Wangzhen Cai, Caiping Wang, Changbiao Zhao, Liang Wang, Lei Zhang, Tianzhen |
author_facet | Guo, Wangzhen Cai, Caiping Wang, Changbiao Zhao, Liang Wang, Lei Zhang, Tianzhen |
author_sort | Guo, Wangzhen |
collection | PubMed |
description | BACKGROUND: Upland cotton has the highest yield, and accounts for > 95% of world cotton production. Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural, functional, and evolutionary studies of the species. Here, we employed GeneTrek and BAC tagging information approaches to predict the general composition and structure of the allotetraploid cotton genome. RESULTS: 142 BAC sequences from Gossypium hirsutum cv. Maxxa were downloaded and confirmed. These BAC sequence analysis revealed that the tetraploid cotton genome contains over 70,000 candidate genes with duplicated gene copies in homoeologous A- and D-subgenome regions. Gene distribution is uneven, with gene-rich and gene-free regions of the genome. Twenty-one percent of the 142 BACs lacked genes. BAC gene density ranged from 0 to 33.2 per 100 kb, whereas most gene islands contained only one gene with an average of 1.5 genes per island. Retro-elements were found to be a major component, first an enriched LTR/gypsy and second LTR/copia. Most LTR retrotransposons were truncated and in nested structures. In addition, 166 polymorphic loci amplified with SSRs developed from 70 BAC clones were tagged on our backbone genetic map. Seventy-five percent (125/166) of the polymorphic loci were tagged on the D-subgenome. By comprehensively analyzing the molecular size of amplified products among tetraploid G. hirsutum cv. Maxxa, acc. TM-1, and G. barbadense cv. Hai7124, and diploid G. herbaceum var. africanum and G. raimondii, 37 BACs, 12 from the A- and 25 from the D-subgenome, were further anchored to their corresponding subgenome chromosomes. After a large amount of genes sequence comparison from different subgenome BACs, the result showed that introns might have no contribution to different subgenome size in Gossypium. CONCLUSION: This study provides us with the first glimpse of cotton genome complexity and serves as a foundation for tetraploid cotton whole genomesequencing in the future. |
format | Text |
id | pubmed-2481271 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-24812712008-07-23 A preliminary analysis of genome structure and composition in Gossypium hirsutum Guo, Wangzhen Cai, Caiping Wang, Changbiao Zhao, Liang Wang, Lei Zhang, Tianzhen BMC Genomics Research Article BACKGROUND: Upland cotton has the highest yield, and accounts for > 95% of world cotton production. Decoding upland cotton genomes will undoubtedly provide the ultimate reference and resource for structural, functional, and evolutionary studies of the species. Here, we employed GeneTrek and BAC tagging information approaches to predict the general composition and structure of the allotetraploid cotton genome. RESULTS: 142 BAC sequences from Gossypium hirsutum cv. Maxxa were downloaded and confirmed. These BAC sequence analysis revealed that the tetraploid cotton genome contains over 70,000 candidate genes with duplicated gene copies in homoeologous A- and D-subgenome regions. Gene distribution is uneven, with gene-rich and gene-free regions of the genome. Twenty-one percent of the 142 BACs lacked genes. BAC gene density ranged from 0 to 33.2 per 100 kb, whereas most gene islands contained only one gene with an average of 1.5 genes per island. Retro-elements were found to be a major component, first an enriched LTR/gypsy and second LTR/copia. Most LTR retrotransposons were truncated and in nested structures. In addition, 166 polymorphic loci amplified with SSRs developed from 70 BAC clones were tagged on our backbone genetic map. Seventy-five percent (125/166) of the polymorphic loci were tagged on the D-subgenome. By comprehensively analyzing the molecular size of amplified products among tetraploid G. hirsutum cv. Maxxa, acc. TM-1, and G. barbadense cv. Hai7124, and diploid G. herbaceum var. africanum and G. raimondii, 37 BACs, 12 from the A- and 25 from the D-subgenome, were further anchored to their corresponding subgenome chromosomes. After a large amount of genes sequence comparison from different subgenome BACs, the result showed that introns might have no contribution to different subgenome size in Gossypium. CONCLUSION: This study provides us with the first glimpse of cotton genome complexity and serves as a foundation for tetraploid cotton whole genomesequencing in the future. BioMed Central 2008-07-01 /pmc/articles/PMC2481271/ /pubmed/18590573 http://dx.doi.org/10.1186/1471-2164-9-314 Text en Copyright © 2008 Guo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Guo, Wangzhen Cai, Caiping Wang, Changbiao Zhao, Liang Wang, Lei Zhang, Tianzhen A preliminary analysis of genome structure and composition in Gossypium hirsutum |
title | A preliminary analysis of genome structure and composition in Gossypium hirsutum |
title_full | A preliminary analysis of genome structure and composition in Gossypium hirsutum |
title_fullStr | A preliminary analysis of genome structure and composition in Gossypium hirsutum |
title_full_unstemmed | A preliminary analysis of genome structure and composition in Gossypium hirsutum |
title_short | A preliminary analysis of genome structure and composition in Gossypium hirsutum |
title_sort | preliminary analysis of genome structure and composition in gossypium hirsutum |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2481271/ https://www.ncbi.nlm.nih.gov/pubmed/18590573 http://dx.doi.org/10.1186/1471-2164-9-314 |
work_keys_str_mv | AT guowangzhen apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT caicaiping apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT wangchangbiao apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT zhaoliang apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT wanglei apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT zhangtianzhen apreliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT guowangzhen preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT caicaiping preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT wangchangbiao preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT zhaoliang preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT wanglei preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum AT zhangtianzhen preliminaryanalysisofgenomestructureandcompositioningossypiumhirsutum |