Cargando…
Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species
BACKGROUND: The environment has been playing an instrumental role in shaping and maintaining the morphological, physiological and biochemical diversities of prokaryotes. It has been debatable whether the whole-genome Guanine-Cytosine (GC) content levels of prokaryotic organisms are correlated with t...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3024870/ https://www.ncbi.nlm.nih.gov/pubmed/21172057 http://dx.doi.org/10.1186/1471-2105-11-S11-S7 |
_version_ | 1782196824556175360 |
---|---|
author | Zheng, Hao Wu, Hongwei |
author_facet | Zheng, Hao Wu, Hongwei |
author_sort | Zheng, Hao |
collection | PubMed |
description | BACKGROUND: The environment has been playing an instrumental role in shaping and maintaining the morphological, physiological and biochemical diversities of prokaryotes. It has been debatable whether the whole-genome Guanine-Cytosine (GC) content levels of prokaryotic organisms are correlated with their optimal growth temperatures. Since the GC content is variable within a genome, we here focus on the correlation between the genic GC content levels and the temperature range conditions of prokaryotic organisms. RESULTS: The GC content levels in the coding regions of four genes were consistently identified as correlated with the temperature range condition when the association analysis was applied to (i) the 722 mesophilic and 93 thermophilic/hyperthermophilic organisms regardless of their phylogeny, oxygen requirement, salinity, or habitat conditions, and (ii) partial lists of organisms when organisms with certain phylogeny, oxygen requirement, salinity or habitat conditions were excluded. These four genes are K01251 (adenosylhomocysteinase), K03724 (DNA repair and recombination proteins), K07588 (LAO/AO transport system kinase), and K09122 (hypothetical protein). To further validate the identified correlation relationships, we examined to what extent the temperature range condition of an organism can be predicted based on the GC content levels in the coding regions of the selected genes. The 84.52% accuracy for the complete genomes, the 84.09% accuracy for the in-progress genomes, and 82.70% accuracy for the metagenomes, especially when being compared to the 50% accuracy rendered by random guessing, suggested that the temperature range condition of a prokaryotic organism can generally be predicted based on the GC content levels of the selected genomic regions. CONCLUSIONS: The results rendered by various statistical tests and prediction tests indicated that the GC content levels of the coding/non-coding regions of certain genes are highly likely to be correlated with the temperature range conditions of prokaryotic organisms. Therefore, it is promising to carry out “reverse ecology” and to complete the ecological characterizations of prokaryotic organisms, i.e., to infer their temperature range conditions based on the GC content levels of certain genomic regions. |
format | Text |
id | pubmed-3024870 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30248702011-01-22 Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species Zheng, Hao Wu, Hongwei BMC Bioinformatics Research BACKGROUND: The environment has been playing an instrumental role in shaping and maintaining the morphological, physiological and biochemical diversities of prokaryotes. It has been debatable whether the whole-genome Guanine-Cytosine (GC) content levels of prokaryotic organisms are correlated with their optimal growth temperatures. Since the GC content is variable within a genome, we here focus on the correlation between the genic GC content levels and the temperature range conditions of prokaryotic organisms. RESULTS: The GC content levels in the coding regions of four genes were consistently identified as correlated with the temperature range condition when the association analysis was applied to (i) the 722 mesophilic and 93 thermophilic/hyperthermophilic organisms regardless of their phylogeny, oxygen requirement, salinity, or habitat conditions, and (ii) partial lists of organisms when organisms with certain phylogeny, oxygen requirement, salinity or habitat conditions were excluded. These four genes are K01251 (adenosylhomocysteinase), K03724 (DNA repair and recombination proteins), K07588 (LAO/AO transport system kinase), and K09122 (hypothetical protein). To further validate the identified correlation relationships, we examined to what extent the temperature range condition of an organism can be predicted based on the GC content levels in the coding regions of the selected genes. The 84.52% accuracy for the complete genomes, the 84.09% accuracy for the in-progress genomes, and 82.70% accuracy for the metagenomes, especially when being compared to the 50% accuracy rendered by random guessing, suggested that the temperature range condition of a prokaryotic organism can generally be predicted based on the GC content levels of the selected genomic regions. CONCLUSIONS: The results rendered by various statistical tests and prediction tests indicated that the GC content levels of the coding/non-coding regions of certain genes are highly likely to be correlated with the temperature range conditions of prokaryotic organisms. Therefore, it is promising to carry out “reverse ecology” and to complete the ecological characterizations of prokaryotic organisms, i.e., to infer their temperature range conditions based on the GC content levels of certain genomic regions. BioMed Central 2010-12-14 /pmc/articles/PMC3024870/ /pubmed/21172057 http://dx.doi.org/10.1186/1471-2105-11-S11-S7 Text en Copyright ©2010 Wu and Zheng; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Zheng, Hao Wu, Hongwei Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species |
title | Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species |
title_full | Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species |
title_fullStr | Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species |
title_full_unstemmed | Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species |
title_short | Gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species |
title_sort | gene-centric association analysis for the correlation between the guanine-cytosine content levels and temperature range conditions of prokaryotic species |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3024870/ https://www.ncbi.nlm.nih.gov/pubmed/21172057 http://dx.doi.org/10.1186/1471-2105-11-S11-S7 |
work_keys_str_mv | AT zhenghao genecentricassociationanalysisforthecorrelationbetweentheguaninecytosinecontentlevelsandtemperaturerangeconditionsofprokaryoticspecies AT wuhongwei genecentricassociationanalysisforthecorrelationbetweentheguaninecytosinecontentlevelsandtemperaturerangeconditionsofprokaryoticspecies |