Cargando…

Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota

Environmental factors affect the growth of microorganisms and therefore alter the composition of microbiota. Correlative analysis of the relationship between metagenomic composition and the environmental gradient can help elucidate key environmental factors and establishment principles for microbial...

Descripción completa

Detalles Bibliográficos
Autores principales: Satoh, Soichirou, Tanaka, Rei, Yokono, Makio, Endoh, Daiji, Yabuki, Tetsuo, Tanaka, Ayumi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894459/
https://www.ncbi.nlm.nih.gov/pubmed/36730456
http://dx.doi.org/10.1371/journal.pone.0281288
_version_ 1784881744877453312
author Satoh, Soichirou
Tanaka, Rei
Yokono, Makio
Endoh, Daiji
Yabuki, Tetsuo
Tanaka, Ayumi
author_facet Satoh, Soichirou
Tanaka, Rei
Yokono, Makio
Endoh, Daiji
Yabuki, Tetsuo
Tanaka, Ayumi
author_sort Satoh, Soichirou
collection PubMed
description Environmental factors affect the growth of microorganisms and therefore alter the composition of microbiota. Correlative analysis of the relationship between metagenomic composition and the environmental gradient can help elucidate key environmental factors and establishment principles for microbial communities. However, a reasonable method to quantitatively compare whole metagenomic data and identify the primary environmental factors for the establishment of microbiota has not been reported so far. In this study, we developed a method to compare whole proteomes deduced from metagenomic shotgun sequencing data, and quantitatively display their phylogenetic relationships as metagenomic trees. We called this method Metagenomic Phylogeny by Average Sequence Similarity (MPASS). We also compared one of the metagenomic trees with dendrograms of environmental factors using a comparison tool for phylogenetic trees. The MPASS method correctly constructed metagenomic trees of simulated metagenomes and soil and water samples. The topology of the metagenomic tree of samples from the Kirishima hot springs area in Japan was highly similarity to that of the dendrograms based on previously reported environmental factors for this area. The topology of the metagenomic tree also reflected the dynamics of microbiota at the taxonomic and functional levels. Our results strongly suggest that MPASS can successfully classify metagenomic shotgun sequencing data based on the similarity of whole protein-coding sequences, and will be useful for the identification of principal environmental factors for the establishment of microbial communities. Custom Perl script for the MPASS pipeline is available at https://github.com/s0sat/MPASS.
format Online
Article
Text
id pubmed-9894459
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-98944592023-02-03 Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota Satoh, Soichirou Tanaka, Rei Yokono, Makio Endoh, Daiji Yabuki, Tetsuo Tanaka, Ayumi PLoS One Research Article Environmental factors affect the growth of microorganisms and therefore alter the composition of microbiota. Correlative analysis of the relationship between metagenomic composition and the environmental gradient can help elucidate key environmental factors and establishment principles for microbial communities. However, a reasonable method to quantitatively compare whole metagenomic data and identify the primary environmental factors for the establishment of microbiota has not been reported so far. In this study, we developed a method to compare whole proteomes deduced from metagenomic shotgun sequencing data, and quantitatively display their phylogenetic relationships as metagenomic trees. We called this method Metagenomic Phylogeny by Average Sequence Similarity (MPASS). We also compared one of the metagenomic trees with dendrograms of environmental factors using a comparison tool for phylogenetic trees. The MPASS method correctly constructed metagenomic trees of simulated metagenomes and soil and water samples. The topology of the metagenomic tree of samples from the Kirishima hot springs area in Japan was highly similarity to that of the dendrograms based on previously reported environmental factors for this area. The topology of the metagenomic tree also reflected the dynamics of microbiota at the taxonomic and functional levels. Our results strongly suggest that MPASS can successfully classify metagenomic shotgun sequencing data based on the similarity of whole protein-coding sequences, and will be useful for the identification of principal environmental factors for the establishment of microbial communities. Custom Perl script for the MPASS pipeline is available at https://github.com/s0sat/MPASS. Public Library of Science 2023-02-02 /pmc/articles/PMC9894459/ /pubmed/36730456 http://dx.doi.org/10.1371/journal.pone.0281288 Text en © 2023 Satoh et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Satoh, Soichirou
Tanaka, Rei
Yokono, Makio
Endoh, Daiji
Yabuki, Tetsuo
Tanaka, Ayumi
Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota
title Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota
title_full Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota
title_fullStr Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota
title_full_unstemmed Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota
title_short Phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota
title_sort phylogeny analysis of whole protein-coding genes in metagenomic data detected an environmental gradient for the microbiota
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9894459/
https://www.ncbi.nlm.nih.gov/pubmed/36730456
http://dx.doi.org/10.1371/journal.pone.0281288
work_keys_str_mv AT satohsoichirou phylogenyanalysisofwholeproteincodinggenesinmetagenomicdatadetectedanenvironmentalgradientforthemicrobiota
AT tanakarei phylogenyanalysisofwholeproteincodinggenesinmetagenomicdatadetectedanenvironmentalgradientforthemicrobiota
AT yokonomakio phylogenyanalysisofwholeproteincodinggenesinmetagenomicdatadetectedanenvironmentalgradientforthemicrobiota
AT endohdaiji phylogenyanalysisofwholeproteincodinggenesinmetagenomicdatadetectedanenvironmentalgradientforthemicrobiota
AT yabukitetsuo phylogenyanalysisofwholeproteincodinggenesinmetagenomicdatadetectedanenvironmentalgradientforthemicrobiota
AT tanakaayumi phylogenyanalysisofwholeproteincodinggenesinmetagenomicdatadetectedanenvironmentalgradientforthemicrobiota