Cargando…
Characterization of the genome of bald cypress
BACKGROUND: Bald cypress (Taxodium distichum var. distichum) is a coniferous tree of tremendous ecological and economic importance. It is a member of the family Cupressaceae which also includes cypresses, redwoods, sequoias, thujas, and junipers. While the bald cypress genome is more than three time...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228858/ https://www.ncbi.nlm.nih.gov/pubmed/22077969 http://dx.doi.org/10.1186/1471-2164-12-553 |
_version_ | 1782217883641708544 |
---|---|
author | Liu, Wenxuan Thummasuwan, Supaphan Sehgal, Sunish K Chouvarine, Philippe Peterson, Daniel G |
author_facet | Liu, Wenxuan Thummasuwan, Supaphan Sehgal, Sunish K Chouvarine, Philippe Peterson, Daniel G |
author_sort | Liu, Wenxuan |
collection | PubMed |
description | BACKGROUND: Bald cypress (Taxodium distichum var. distichum) is a coniferous tree of tremendous ecological and economic importance. It is a member of the family Cupressaceae which also includes cypresses, redwoods, sequoias, thujas, and junipers. While the bald cypress genome is more than three times the size of the human genome, its 1C DNA content is amongst the smallest of any conifer. To learn more about the genome of bald cypress and gain insight into the evolution of Cupressaceae genomes, we performed a Cot analysis and used Cot filtration to study Taxodium DNA. Additionally, we constructed a 6.7 genome-equivalent BAC library that we screened with known Taxodium genes and select repeats. RESULTS: The bald cypress genome is composed of 90% repetitive DNA with most sequences being found in low to mid copy numbers. The most abundant repeats are found in fewer than 25,000 copies per genome. Approximately 7.4% of the genome is single/low-copy DNA (i.e., sequences found in 1 to 5 copies). Sequencing of highly repetitive Cot clones indicates that most Taxodium repeats are highly diverged from previously characterized plant repeat sequences. The bald cypress BAC library consists of 606,336 clones (average insert size of 113 kb) and collectively provides 6.7-fold genome equivalent coverage of the bald cypress genome. Macroarray screening with known genes produced, on average, about 1.5 positive clones per probe per genome-equivalent. Library screening with Cot-1 DNA revealed that approximately 83% of BAC clones contain repetitive sequences iterated 10(3 )to 10(4 )times per genome. CONCLUSIONS: The BAC library for bald cypress is the first to be generated for a conifer species outside of the family Pinaceae. The Taxodium BAC library was shown to be useful in gene isolation and genome characterization and should be an important tool in gymnosperm comparative genomics, physical mapping, genome sequencing, and gene/polymorphism discovery. The single/low-copy (SL) component of bald cypress is 4.6 times the size of the Arabidopsis genome. As suggested for other gymnosperms, the large amount of SL DNA in Taxodium is likely the result of divergence among ancient repeat copies and gene/pseudogene duplication. |
format | Online Article Text |
id | pubmed-3228858 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32288582011-12-12 Characterization of the genome of bald cypress Liu, Wenxuan Thummasuwan, Supaphan Sehgal, Sunish K Chouvarine, Philippe Peterson, Daniel G BMC Genomics Research Article BACKGROUND: Bald cypress (Taxodium distichum var. distichum) is a coniferous tree of tremendous ecological and economic importance. It is a member of the family Cupressaceae which also includes cypresses, redwoods, sequoias, thujas, and junipers. While the bald cypress genome is more than three times the size of the human genome, its 1C DNA content is amongst the smallest of any conifer. To learn more about the genome of bald cypress and gain insight into the evolution of Cupressaceae genomes, we performed a Cot analysis and used Cot filtration to study Taxodium DNA. Additionally, we constructed a 6.7 genome-equivalent BAC library that we screened with known Taxodium genes and select repeats. RESULTS: The bald cypress genome is composed of 90% repetitive DNA with most sequences being found in low to mid copy numbers. The most abundant repeats are found in fewer than 25,000 copies per genome. Approximately 7.4% of the genome is single/low-copy DNA (i.e., sequences found in 1 to 5 copies). Sequencing of highly repetitive Cot clones indicates that most Taxodium repeats are highly diverged from previously characterized plant repeat sequences. The bald cypress BAC library consists of 606,336 clones (average insert size of 113 kb) and collectively provides 6.7-fold genome equivalent coverage of the bald cypress genome. Macroarray screening with known genes produced, on average, about 1.5 positive clones per probe per genome-equivalent. Library screening with Cot-1 DNA revealed that approximately 83% of BAC clones contain repetitive sequences iterated 10(3 )to 10(4 )times per genome. CONCLUSIONS: The BAC library for bald cypress is the first to be generated for a conifer species outside of the family Pinaceae. The Taxodium BAC library was shown to be useful in gene isolation and genome characterization and should be an important tool in gymnosperm comparative genomics, physical mapping, genome sequencing, and gene/polymorphism discovery. The single/low-copy (SL) component of bald cypress is 4.6 times the size of the Arabidopsis genome. As suggested for other gymnosperms, the large amount of SL DNA in Taxodium is likely the result of divergence among ancient repeat copies and gene/pseudogene duplication. BioMed Central 2011-11-11 /pmc/articles/PMC3228858/ /pubmed/22077969 http://dx.doi.org/10.1186/1471-2164-12-553 Text en Copyright ©2011 Liu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Liu, Wenxuan Thummasuwan, Supaphan Sehgal, Sunish K Chouvarine, Philippe Peterson, Daniel G Characterization of the genome of bald cypress |
title | Characterization of the genome of bald cypress |
title_full | Characterization of the genome of bald cypress |
title_fullStr | Characterization of the genome of bald cypress |
title_full_unstemmed | Characterization of the genome of bald cypress |
title_short | Characterization of the genome of bald cypress |
title_sort | characterization of the genome of bald cypress |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228858/ https://www.ncbi.nlm.nih.gov/pubmed/22077969 http://dx.doi.org/10.1186/1471-2164-12-553 |
work_keys_str_mv | AT liuwenxuan characterizationofthegenomeofbaldcypress AT thummasuwansupaphan characterizationofthegenomeofbaldcypress AT sehgalsunishk characterizationofthegenomeofbaldcypress AT chouvarinephilippe characterizationofthegenomeofbaldcypress AT petersondanielg characterizationofthegenomeofbaldcypress |