Cargando…
Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes
BACKGROUND: Microorganisms in activated sludge (AS) play key roles in the wastewater treatment processes. However, their ecological behaviors and differences from microorganisms in other environments have mainly been studied using the 16S rRNA gene that may not truly represent in situ functions. RES...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014675/ https://www.ncbi.nlm.nih.gov/pubmed/32046778 http://dx.doi.org/10.1186/s40168-020-0794-3 |
_version_ | 1783496683624071168 |
---|---|
author | Ye, Lin Mei, Ran Liu, Wen-Tso Ren, Hongqiang Zhang, Xu-Xiang |
author_facet | Ye, Lin Mei, Ran Liu, Wen-Tso Ren, Hongqiang Zhang, Xu-Xiang |
author_sort | Ye, Lin |
collection | PubMed |
description | BACKGROUND: Microorganisms in activated sludge (AS) play key roles in the wastewater treatment processes. However, their ecological behaviors and differences from microorganisms in other environments have mainly been studied using the 16S rRNA gene that may not truly represent in situ functions. RESULTS: Here, we present 2045 archaeal and bacterial metagenome-assembled genomes (MAGs) recovered from 1.35 Tb of metagenomic data generated from 114 AS samples of 23 full-scale wastewater treatment plants (WWTPs). We found that the AS MAGs have obvious plant-specific features and that few proteins are shared by different WWTPs, especially for WWTPs located in geographically distant areas. Further, we developed a novel machine learning approach that can distinguish between AS MAGs and MAGs from other environments based on the clusters of orthologous groups of proteins with an accuracy of 96%. With the aid of machine learning, we also identified some functional features (e.g., functions related to aerobic metabolism, nutrient sensing/acquisition, and biofilm formation) that are likely vital for AS bacteria to adapt themselves in wastewater treatment bioreactors. CONCLUSIONS: Our work reveals that, although the bacterial species in different municipal WWTPs could be different, they may have similar deterministic functional features that allow them to adapt to the AS systems. Also, we provide valuable genome resources and a novel approach for future investigation and better understanding of the microbiome of AS and other ecosystems. |
format | Online Article Text |
id | pubmed-7014675 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-70146752020-02-18 Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes Ye, Lin Mei, Ran Liu, Wen-Tso Ren, Hongqiang Zhang, Xu-Xiang Microbiome Research BACKGROUND: Microorganisms in activated sludge (AS) play key roles in the wastewater treatment processes. However, their ecological behaviors and differences from microorganisms in other environments have mainly been studied using the 16S rRNA gene that may not truly represent in situ functions. RESULTS: Here, we present 2045 archaeal and bacterial metagenome-assembled genomes (MAGs) recovered from 1.35 Tb of metagenomic data generated from 114 AS samples of 23 full-scale wastewater treatment plants (WWTPs). We found that the AS MAGs have obvious plant-specific features and that few proteins are shared by different WWTPs, especially for WWTPs located in geographically distant areas. Further, we developed a novel machine learning approach that can distinguish between AS MAGs and MAGs from other environments based on the clusters of orthologous groups of proteins with an accuracy of 96%. With the aid of machine learning, we also identified some functional features (e.g., functions related to aerobic metabolism, nutrient sensing/acquisition, and biofilm formation) that are likely vital for AS bacteria to adapt themselves in wastewater treatment bioreactors. CONCLUSIONS: Our work reveals that, although the bacterial species in different municipal WWTPs could be different, they may have similar deterministic functional features that allow them to adapt to the AS systems. Also, we provide valuable genome resources and a novel approach for future investigation and better understanding of the microbiome of AS and other ecosystems. BioMed Central 2020-02-11 /pmc/articles/PMC7014675/ /pubmed/32046778 http://dx.doi.org/10.1186/s40168-020-0794-3 Text en © The Author(s). 2020 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Ye, Lin Mei, Ran Liu, Wen-Tso Ren, Hongqiang Zhang, Xu-Xiang Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes |
title | Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes |
title_full | Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes |
title_fullStr | Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes |
title_full_unstemmed | Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes |
title_short | Machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes |
title_sort | machine learning-aided analyses of thousands of draft genomes reveal specific features of activated sludge processes |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7014675/ https://www.ncbi.nlm.nih.gov/pubmed/32046778 http://dx.doi.org/10.1186/s40168-020-0794-3 |
work_keys_str_mv | AT yelin machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses AT meiran machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses AT liuwentso machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses AT renhongqiang machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses AT zhangxuxiang machinelearningaidedanalysesofthousandsofdraftgenomesrevealspecificfeaturesofactivatedsludgeprocesses |