Cargando…

Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes

BACKGROUND: Microorganisms in activated sludge (AS) play key roles in the wastewater treatment processes. However, their ecological behaviors and differences from microorganisms in other environments have mainly been studied using the 16S rRNA gene that may not truly represent in situ functions. RES...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Lin, Mei, Ran, Liu, Wen-Tso, Ren, Hongqiang, Zhang, Xu-Xiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014675/
https://www.ncbi.nlm.nih.gov/pubmed/32046778
http://dx.doi.org/10.1186/s40168-020-0794-3
_version_ 1783496683624071168
author Ye, Lin
Mei, Ran
Liu, Wen-Tso
Ren, Hongqiang
Zhang, Xu-Xiang
author_facet Ye, Lin
Mei, Ran
Liu, Wen-Tso
Ren, Hongqiang
Zhang, Xu-Xiang
author_sort Ye, Lin
collection PubMed
description BACKGROUND: Microorganisms in activated sludge (AS) play key roles in the wastewater treatment processes. However, their ecological behaviors and differences from microorganisms in other environments have mainly been studied using the 16S rRNA gene that may not truly represent in situ functions. RESULTS: Here, we present 2045 archaeal and bacterial metagenome-assembled genomes (MAGs) recovered from 1.35 Tb of metagenomic data generated from 114 AS samples of 23 full-scale wastewater treatment plants (WWTPs). We found that the AS MAGs have obvious plant-specific features and that few proteins are shared by different WWTPs, especially for WWTPs located in geographically distant areas. Further, we developed a novel machine learning approach that can distinguish between AS MAGs and MAGs from other environments based on the clusters of orthologous groups of proteins with an accuracy of 96%. With the aid of machine learning, we also identified some functional features (e.g., functions related to aerobic metabolism, nutrient sensing/acquisition, and biofilm formation) that are likely vital for AS bacteria to adapt themselves in wastewater treatment bioreactors. CONCLUSIONS: Our work reveals that, although the bacterial species in different municipal WWTPs could be different, they may have similar deterministic functional features that allow them to adapt to the AS systems. Also, we provide valuable genome resources and a novel approach for future investigation and better understanding of the microbiome of AS and other ecosystems.
format Online
Article
Text
id pubmed-7014675
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70146752020-02-18 Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes Ye, Lin Mei, Ran Liu, Wen-Tso Ren, Hongqiang Zhang, Xu-Xiang Microbiome Research BACKGROUND: Microorganisms in activated sludge (AS) play key roles in the wastewater treatment processes. However, their ecological behaviors and differences from microorganisms in other environments have mainly been studied using the 16S rRNA gene that may not truly represent in situ functions. RESULTS: Here, we present 2045 archaeal and bacterial metagenome-assembled genomes (MAGs) recovered from 1.35 Tb of metagenomic data generated from 114 AS samples of 23 full-scale wastewater treatment plants (WWTPs). We found that the AS MAGs have obvious plant-specific features and that few proteins are shared by different WWTPs, especially for WWTPs located in geographically distant areas. Further, we developed a novel machine learning approach that can distinguish between AS MAGs and MAGs from other environments based on the clusters of orthologous groups of proteins with an accuracy of 96%. With the aid of machine learning, we also identified some functional features (e.g., functions related to aerobic metabolism, nutrient sensing/acquisition, and biofilm formation) that are likely vital for AS bacteria to adapt themselves in wastewater treatment bioreactors. CONCLUSIONS: Our work reveals that, although the bacterial species in different municipal WWTPs could be different, they may have similar deterministic functional features that allow them to adapt to the AS systems. Also, we provide valuable genome resources and a novel approach for future investigation and better understanding of the microbiome of AS and other ecosystems. BioMed Central 2020-02-11 /pmc/articles/PMC7014675/ /pubmed/32046778 http://dx.doi.org/10.1186/s40168-020-0794-3 Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ye, Lin
Mei, Ran
Liu, Wen-Tso
Ren, Hongqiang
Zhang, Xu-Xiang
Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
title Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
title_full Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
title_fullStr Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
title_full_unstemmed Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
title_short Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
title_sort machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014675/
https://www.ncbi.nlm.nih.gov/pubmed/32046778
http://dx.doi.org/10.1186/s40168-020-0794-3
work_keys_str_mv AT yelin machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses
AT meiran machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses
AT liuwentso machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses
AT renhongqiang machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses
AT zhangxuxiang machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses