Cargando…
METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
BACKGROUND: Advances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints o...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8851854/ https://www.ncbi.nlm.nih.gov/pubmed/35172890 http://dx.doi.org/10.1186/s40168-021-01213-8 |
_version_ | 1784652912123707392 |
---|---|
author | Zhou, Zhichao Tran, Patricia Q. Breister, Adam M. Liu, Yang Kieft, Kristopher Cowley, Elise S. Karaoz, Ulas Anantharaman, Karthik |
author_facet | Zhou, Zhichao Tran, Patricia Q. Breister, Adam M. Liu, Yang Kieft, Kristopher Cowley, Elise S. Karaoz, Ulas Anantharaman, Karthik |
author_sort | Zhou, Zhichao |
collection | PubMed |
description | BACKGROUND: Advances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints of microorganisms, decipher their functional capacities and activities, and reconstruct their roles in biogeochemical processes. Currently available tools for analyses of genomic data can annotate and depict metabolic functions to some extent; however, no standardized approaches are currently available for the comprehensive characterization of metabolic predictions, metabolite exchanges, microbial interactions, and microbial contributions to biogeochemical cycling. RESULTS: We present METABOLIC (METabolic And BiogeOchemistry anaLyses In miCrobes), a scalable software to advance microbial ecology and biogeochemistry studies using genomes at the resolution of individual organisms and/or microbial communities. The genome-scale workflow includes annotation of microbial genomes, motif validation of biochemically validated conserved protein residues, metabolic pathway analyses, and calculation of contributions to individual biogeochemical transformations and cycles. The community-scale workflow supplements genome-scale analyses with determination of genome abundance in the microbiome, potential microbial metabolic handoffs and metabolite exchange, reconstruction of functional networks, and determination of microbial contributions to biogeochemical cycles. METABOLIC can take input genomes from isolates, metagenome-assembled genomes, or single-cell genomes. Results are presented in the form of tables for metabolism and a variety of visualizations including biogeochemical cycling potential, representation of sequential metabolic transformations, community-scale microbial functional networks using a newly defined metric “MW-score” (metabolic weight score), and metabolic Sankey diagrams. METABOLIC takes ~ 3 h with 40 CPU threads to process ~ 100 genomes and corresponding metagenomic reads within which the most compute-demanding part of hmmsearch takes ~ 45 min, while it takes ~ 5 h to complete hmmsearch for ~ 3600 genomes. Tests of accuracy, robustness, and consistency suggest METABOLIC provides better performance compared to other software and online servers. To highlight the utility and versatility of METABOLIC, we demonstrate its capabilities on diverse metagenomic datasets from the marine subsurface, terrestrial subsurface, meadow soil, deep sea, freshwater lakes, wastewater, and the human gut. CONCLUSION: METABOLIC enables the consistent and reproducible study of microbial community ecology and biogeochemistry using a foundation of genome-informed microbial metabolism, and will advance the integration of uncultivated organisms into metabolic and biogeochemical models. METABOLIC is written in Perl and R and is freely available under GPLv3 at https://github.com/AnantharamanLab/METABOLIC. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01213-8. |
format | Online Article Text |
id | pubmed-8851854 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-88518542022-02-22 METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks Zhou, Zhichao Tran, Patricia Q. Breister, Adam M. Liu, Yang Kieft, Kristopher Cowley, Elise S. Karaoz, Ulas Anantharaman, Karthik Microbiome Methodology BACKGROUND: Advances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints of microorganisms, decipher their functional capacities and activities, and reconstruct their roles in biogeochemical processes. Currently available tools for analyses of genomic data can annotate and depict metabolic functions to some extent; however, no standardized approaches are currently available for the comprehensive characterization of metabolic predictions, metabolite exchanges, microbial interactions, and microbial contributions to biogeochemical cycling. RESULTS: We present METABOLIC (METabolic And BiogeOchemistry anaLyses In miCrobes), a scalable software to advance microbial ecology and biogeochemistry studies using genomes at the resolution of individual organisms and/or microbial communities. The genome-scale workflow includes annotation of microbial genomes, motif validation of biochemically validated conserved protein residues, metabolic pathway analyses, and calculation of contributions to individual biogeochemical transformations and cycles. The community-scale workflow supplements genome-scale analyses with determination of genome abundance in the microbiome, potential microbial metabolic handoffs and metabolite exchange, reconstruction of functional networks, and determination of microbial contributions to biogeochemical cycles. METABOLIC can take input genomes from isolates, metagenome-assembled genomes, or single-cell genomes. Results are presented in the form of tables for metabolism and a variety of visualizations including biogeochemical cycling potential, representation of sequential metabolic transformations, community-scale microbial functional networks using a newly defined metric “MW-score” (metabolic weight score), and metabolic Sankey diagrams. METABOLIC takes ~ 3 h with 40 CPU threads to process ~ 100 genomes and corresponding metagenomic reads within which the most compute-demanding part of hmmsearch takes ~ 45 min, while it takes ~ 5 h to complete hmmsearch for ~ 3600 genomes. Tests of accuracy, robustness, and consistency suggest METABOLIC provides better performance compared to other software and online servers. To highlight the utility and versatility of METABOLIC, we demonstrate its capabilities on diverse metagenomic datasets from the marine subsurface, terrestrial subsurface, meadow soil, deep sea, freshwater lakes, wastewater, and the human gut. CONCLUSION: METABOLIC enables the consistent and reproducible study of microbial community ecology and biogeochemistry using a foundation of genome-informed microbial metabolism, and will advance the integration of uncultivated organisms into metabolic and biogeochemical models. METABOLIC is written in Perl and R and is freely available under GPLv3 at https://github.com/AnantharamanLab/METABOLIC. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01213-8. BioMed Central 2022-02-16 /pmc/articles/PMC8851854/ /pubmed/35172890 http://dx.doi.org/10.1186/s40168-021-01213-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Zhou, Zhichao Tran, Patricia Q. Breister, Adam M. Liu, Yang Kieft, Kristopher Cowley, Elise S. Karaoz, Ulas Anantharaman, Karthik METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks |
title | METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks |
title_full | METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks |
title_fullStr | METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks |
title_full_unstemmed | METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks |
title_short | METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks |
title_sort | metabolic: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8851854/ https://www.ncbi.nlm.nih.gov/pubmed/35172890 http://dx.doi.org/10.1186/s40168-021-01213-8 |
work_keys_str_mv | AT zhouzhichao metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks AT tranpatriciaq metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks AT breisteradamm metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks AT liuyang metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks AT kieftkristopher metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks AT cowleyelises metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks AT karaozulas metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks AT anantharamankarthik metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks |