Cargando…
Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes
Genomes provide a platform for storage of chemical information that must be stable under the context in which an organism thrives. The 2‘-deoxyguanosine (G) nucleotide has the potential to provide additional chemical information beyond its Watson-Crick base-pairing capacity. Sequences with four or m...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6200779/ https://www.ncbi.nlm.nih.gov/pubmed/30356061 http://dx.doi.org/10.1038/s41598-018-33944-4 |
_version_ | 1783365391347613696 |
---|---|
author | Ding, Yun Fleming, Aaron M. Burrows, Cynthia J. |
author_facet | Ding, Yun Fleming, Aaron M. Burrows, Cynthia J. |
author_sort | Ding, Yun |
collection | PubMed |
description | Genomes provide a platform for storage of chemical information that must be stable under the context in which an organism thrives. The 2‘-deoxyguanosine (G) nucleotide has the potential to provide additional chemical information beyond its Watson-Crick base-pairing capacity. Sequences with four or more runs of three G nucleotides each are potential G-quadruplex forming sequences (PQSs) that can adopt G-quadruplex folds. Herein, we analyzed sequenced genomes from the NCBI database to determine the PQS densities of the genome sequences. First, we found organisms with large genomes, including humans, alligators, and maize, have similar densities of PQSs (~300 PQSs/Mbp), and the genomes are significantly enriched in PQSs with more than four G tracks. Analysis of microorganism genomes found a greater diversity of PQS densities. In general, PQS densities positively tracked with the GC% of the genome. Exceptions to this observation were the genomes from thermophiles that had many more PQSs than expected by random chance. Analysis of the location of these PQSs in annotated genomes from the order Thermales showed these G-rich sequences to be randomly distributed; in contrast, in the order Deinococcales the PQSs were enriched and biased around transcription start sites of genes. Four representative PQSs, two each from the Thermales and Deinococcales, were studied by biophysical methods to establish the ability of them to fold to G-quadruplexes. The experiments found the two PQSs in the Thermales did not adopt G-quadruplex folds, while the two most common in the Deinococcales adopted stable parallel-stranded G-quadruplexes. The findings lead to a hypothesis that thermophilic organisms are enriched with PQSs as an unavoidable consequence to stabilize thermally their genomes to live at high temperature; in contrast, the genomes from stress-resistant bacteria found in the Deinococcales may utilize PQSs for gene regulatory purposes. |
format | Online Article Text |
id | pubmed-6200779 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-62007792018-10-26 Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes Ding, Yun Fleming, Aaron M. Burrows, Cynthia J. Sci Rep Article Genomes provide a platform for storage of chemical information that must be stable under the context in which an organism thrives. The 2‘-deoxyguanosine (G) nucleotide has the potential to provide additional chemical information beyond its Watson-Crick base-pairing capacity. Sequences with four or more runs of three G nucleotides each are potential G-quadruplex forming sequences (PQSs) that can adopt G-quadruplex folds. Herein, we analyzed sequenced genomes from the NCBI database to determine the PQS densities of the genome sequences. First, we found organisms with large genomes, including humans, alligators, and maize, have similar densities of PQSs (~300 PQSs/Mbp), and the genomes are significantly enriched in PQSs with more than four G tracks. Analysis of microorganism genomes found a greater diversity of PQS densities. In general, PQS densities positively tracked with the GC% of the genome. Exceptions to this observation were the genomes from thermophiles that had many more PQSs than expected by random chance. Analysis of the location of these PQSs in annotated genomes from the order Thermales showed these G-rich sequences to be randomly distributed; in contrast, in the order Deinococcales the PQSs were enriched and biased around transcription start sites of genes. Four representative PQSs, two each from the Thermales and Deinococcales, were studied by biophysical methods to establish the ability of them to fold to G-quadruplexes. The experiments found the two PQSs in the Thermales did not adopt G-quadruplex folds, while the two most common in the Deinococcales adopted stable parallel-stranded G-quadruplexes. The findings lead to a hypothesis that thermophilic organisms are enriched with PQSs as an unavoidable consequence to stabilize thermally their genomes to live at high temperature; in contrast, the genomes from stress-resistant bacteria found in the Deinococcales may utilize PQSs for gene regulatory purposes. Nature Publishing Group UK 2018-10-24 /pmc/articles/PMC6200779/ /pubmed/30356061 http://dx.doi.org/10.1038/s41598-018-33944-4 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Ding, Yun Fleming, Aaron M. Burrows, Cynthia J. Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes |
title | Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes |
title_full | Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes |
title_fullStr | Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes |
title_full_unstemmed | Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes |
title_short | Case studies on potential G-quadruplex-forming sequences from the bacterial orders Deinococcales and Thermales derived from a survey of published genomes |
title_sort | case studies on potential g-quadruplex-forming sequences from the bacterial orders deinococcales and thermales derived from a survey of published genomes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6200779/ https://www.ncbi.nlm.nih.gov/pubmed/30356061 http://dx.doi.org/10.1038/s41598-018-33944-4 |
work_keys_str_mv | AT dingyun casestudiesonpotentialgquadruplexformingsequencesfromthebacterialordersdeinococcalesandthermalesderivedfromasurveyofpublishedgenomes AT flemingaaronm casestudiesonpotentialgquadruplexformingsequencesfromthebacterialordersdeinococcalesandthermalesderivedfromasurveyofpublishedgenomes AT burrowscynthiaj casestudiesonpotentialgquadruplexformingsequencesfromthebacterialordersdeinococcalesandthermalesderivedfromasurveyofpublishedgenomes |