Cargando…

Environment and taxonomy shape the genomic signature of prokaryotic extremophiles

This study provides comprehensive quantitative evidence suggesting that adaptations to extreme temperatures and pH imprint a discernible environmental component in the genomic signature of microbial extremophiles. Both supervised and unsupervised machine learning algorithms were used to analyze geno...

Descripción completa

Detalles Bibliográficos
Autores principales: Arias, Pablo Millán, Butler, Joseph, Randhawa, Gurjit S., Soltysiak, Maximillian P. M., Hill, Kathleen A., Kari, Lila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10522608/
https://www.ncbi.nlm.nih.gov/pubmed/37752120
http://dx.doi.org/10.1038/s41598-023-42518-y
_version_ 1785110388622229504
author Arias, Pablo Millán
Butler, Joseph
Randhawa, Gurjit S.
Soltysiak, Maximillian P. M.
Hill, Kathleen A.
Kari, Lila
author_facet Arias, Pablo Millán
Butler, Joseph
Randhawa, Gurjit S.
Soltysiak, Maximillian P. M.
Hill, Kathleen A.
Kari, Lila
author_sort Arias, Pablo Millán
collection PubMed
description This study provides comprehensive quantitative evidence suggesting that adaptations to extreme temperatures and pH imprint a discernible environmental component in the genomic signature of microbial extremophiles. Both supervised and unsupervised machine learning algorithms were used to analyze genomic signatures, each computed as the k-mer frequency vector of a 500 kbp DNA fragment arbitrarily selected to represent a genome. Computational experiments classified/clustered genomic signatures extracted from a curated dataset of [Formula: see text] extremophile (temperature, pH) bacteria and archaea genomes, at multiple scales of analysis, [Formula: see text] . The supervised learning resulted in high accuracies for taxonomic classifications at [Formula: see text] , and medium to medium-high accuracies for environment category classifications of the same datasets at [Formula: see text] . For [Formula: see text] , our findings were largely consistent with amino acid compositional biases and codon usage patterns in coding regions, previously attributed to extreme environment adaptations. The unsupervised learning of unlabelled sequences identified several exemplars of hyperthermophilic organisms with large similarities in their genomic signatures, in spite of belonging to different domains in the Tree of Life.
format Online
Article
Text
id pubmed-10522608
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105226082023-09-28 Environment and taxonomy shape the genomic signature of prokaryotic extremophiles Arias, Pablo Millán Butler, Joseph Randhawa, Gurjit S. Soltysiak, Maximillian P. M. Hill, Kathleen A. Kari, Lila Sci Rep Article This study provides comprehensive quantitative evidence suggesting that adaptations to extreme temperatures and pH imprint a discernible environmental component in the genomic signature of microbial extremophiles. Both supervised and unsupervised machine learning algorithms were used to analyze genomic signatures, each computed as the k-mer frequency vector of a 500 kbp DNA fragment arbitrarily selected to represent a genome. Computational experiments classified/clustered genomic signatures extracted from a curated dataset of [Formula: see text] extremophile (temperature, pH) bacteria and archaea genomes, at multiple scales of analysis, [Formula: see text] . The supervised learning resulted in high accuracies for taxonomic classifications at [Formula: see text] , and medium to medium-high accuracies for environment category classifications of the same datasets at [Formula: see text] . For [Formula: see text] , our findings were largely consistent with amino acid compositional biases and codon usage patterns in coding regions, previously attributed to extreme environment adaptations. The unsupervised learning of unlabelled sequences identified several exemplars of hyperthermophilic organisms with large similarities in their genomic signatures, in spite of belonging to different domains in the Tree of Life. Nature Publishing Group UK 2023-09-26 /pmc/articles/PMC10522608/ /pubmed/37752120 http://dx.doi.org/10.1038/s41598-023-42518-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Arias, Pablo Millán
Butler, Joseph
Randhawa, Gurjit S.
Soltysiak, Maximillian P. M.
Hill, Kathleen A.
Kari, Lila
Environment and taxonomy shape the genomic signature of prokaryotic extremophiles
title Environment and taxonomy shape the genomic signature of prokaryotic extremophiles
title_full Environment and taxonomy shape the genomic signature of prokaryotic extremophiles
title_fullStr Environment and taxonomy shape the genomic signature of prokaryotic extremophiles
title_full_unstemmed Environment and taxonomy shape the genomic signature of prokaryotic extremophiles
title_short Environment and taxonomy shape the genomic signature of prokaryotic extremophiles
title_sort environment and taxonomy shape the genomic signature of prokaryotic extremophiles
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10522608/
https://www.ncbi.nlm.nih.gov/pubmed/37752120
http://dx.doi.org/10.1038/s41598-023-42518-y
work_keys_str_mv AT ariaspablomillan environmentandtaxonomyshapethegenomicsignatureofprokaryoticextremophiles
AT butlerjoseph environmentandtaxonomyshapethegenomicsignatureofprokaryoticextremophiles
AT randhawagurjits environmentandtaxonomyshapethegenomicsignatureofprokaryoticextremophiles
AT soltysiakmaximillianpm environmentandtaxonomyshapethegenomicsignatureofprokaryoticextremophiles
AT hillkathleena environmentandtaxonomyshapethegenomicsignatureofprokaryoticextremophiles
AT karilila environmentandtaxonomyshapethegenomicsignatureofprokaryoticextremophiles