Cargando…

METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks

BACKGROUND: Advances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints o...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Zhichao, Tran, Patricia Q., Breister, Adam M., Liu, Yang, Kieft, Kristopher, Cowley, Elise S., Karaoz, Ulas, Anantharaman, Karthik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8851854/
https://www.ncbi.nlm.nih.gov/pubmed/35172890
http://dx.doi.org/10.1186/s40168-021-01213-8
_version_ 1784652912123707392
author Zhou, Zhichao
Tran, Patricia Q.
Breister, Adam M.
Liu, Yang
Kieft, Kristopher
Cowley, Elise S.
Karaoz, Ulas
Anantharaman, Karthik
author_facet Zhou, Zhichao
Tran, Patricia Q.
Breister, Adam M.
Liu, Yang
Kieft, Kristopher
Cowley, Elise S.
Karaoz, Ulas
Anantharaman, Karthik
author_sort Zhou, Zhichao
collection PubMed
description BACKGROUND: Advances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints of microorganisms, decipher their functional capacities and activities, and reconstruct their roles in biogeochemical processes. Currently available tools for analyses of genomic data can annotate and depict metabolic functions to some extent; however, no standardized approaches are currently available for the comprehensive characterization of metabolic predictions, metabolite exchanges, microbial interactions, and microbial contributions to biogeochemical cycling. RESULTS: We present METABOLIC (METabolic And BiogeOchemistry anaLyses In miCrobes), a scalable software to advance microbial ecology and biogeochemistry studies using genomes at the resolution of individual organisms and/or microbial communities. The genome-scale workflow includes annotation of microbial genomes, motif validation of biochemically validated conserved protein residues, metabolic pathway analyses, and calculation of contributions to individual biogeochemical transformations and cycles. The community-scale workflow supplements genome-scale analyses with determination of genome abundance in the microbiome, potential microbial metabolic handoffs and metabolite exchange, reconstruction of functional networks, and determination of microbial contributions to biogeochemical cycles. METABOLIC can take input genomes from isolates, metagenome-assembled genomes, or single-cell genomes. Results are presented in the form of tables for metabolism and a variety of visualizations including biogeochemical cycling potential, representation of sequential metabolic transformations, community-scale microbial functional networks using a newly defined metric “MW-score” (metabolic weight score), and metabolic Sankey diagrams. METABOLIC takes ~ 3 h with 40 CPU threads to process ~ 100 genomes and corresponding metagenomic reads within which the most compute-demanding part of hmmsearch takes ~ 45 min, while it takes ~ 5 h to complete hmmsearch for ~ 3600 genomes. Tests of accuracy, robustness, and consistency suggest METABOLIC provides better performance compared to other software and online servers. To highlight the utility and versatility of METABOLIC, we demonstrate its capabilities on diverse metagenomic datasets from the marine subsurface, terrestrial subsurface, meadow soil, deep sea, freshwater lakes, wastewater, and the human gut. CONCLUSION: METABOLIC enables the consistent and reproducible study of microbial community ecology and biogeochemistry using a foundation of genome-informed microbial metabolism, and will advance the integration of uncultivated organisms into metabolic and biogeochemical models. METABOLIC is written in Perl and R and is freely available under GPLv3 at https://github.com/AnantharamanLab/METABOLIC. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01213-8.
format Online
Article
Text
id pubmed-8851854
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88518542022-02-22 METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks Zhou, Zhichao Tran, Patricia Q. Breister, Adam M. Liu, Yang Kieft, Kristopher Cowley, Elise S. Karaoz, Ulas Anantharaman, Karthik Microbiome Methodology BACKGROUND: Advances in microbiome science are being driven in large part due to our ability to study and infer microbial ecology from genomes reconstructed from mixed microbial communities using metagenomics and single-cell genomics. Such omics-based techniques allow us to read genomic blueprints of microorganisms, decipher their functional capacities and activities, and reconstruct their roles in biogeochemical processes. Currently available tools for analyses of genomic data can annotate and depict metabolic functions to some extent; however, no standardized approaches are currently available for the comprehensive characterization of metabolic predictions, metabolite exchanges, microbial interactions, and microbial contributions to biogeochemical cycling. RESULTS: We present METABOLIC (METabolic And BiogeOchemistry anaLyses In miCrobes), a scalable software to advance microbial ecology and biogeochemistry studies using genomes at the resolution of individual organisms and/or microbial communities. The genome-scale workflow includes annotation of microbial genomes, motif validation of biochemically validated conserved protein residues, metabolic pathway analyses, and calculation of contributions to individual biogeochemical transformations and cycles. The community-scale workflow supplements genome-scale analyses with determination of genome abundance in the microbiome, potential microbial metabolic handoffs and metabolite exchange, reconstruction of functional networks, and determination of microbial contributions to biogeochemical cycles. METABOLIC can take input genomes from isolates, metagenome-assembled genomes, or single-cell genomes. Results are presented in the form of tables for metabolism and a variety of visualizations including biogeochemical cycling potential, representation of sequential metabolic transformations, community-scale microbial functional networks using a newly defined metric “MW-score” (metabolic weight score), and metabolic Sankey diagrams. METABOLIC takes ~ 3 h with 40 CPU threads to process ~ 100 genomes and corresponding metagenomic reads within which the most compute-demanding part of hmmsearch takes ~ 45 min, while it takes ~ 5 h to complete hmmsearch for ~ 3600 genomes. Tests of accuracy, robustness, and consistency suggest METABOLIC provides better performance compared to other software and online servers. To highlight the utility and versatility of METABOLIC, we demonstrate its capabilities on diverse metagenomic datasets from the marine subsurface, terrestrial subsurface, meadow soil, deep sea, freshwater lakes, wastewater, and the human gut. CONCLUSION: METABOLIC enables the consistent and reproducible study of microbial community ecology and biogeochemistry using a foundation of genome-informed microbial metabolism, and will advance the integration of uncultivated organisms into metabolic and biogeochemical models. METABOLIC is written in Perl and R and is freely available under GPLv3 at https://github.com/AnantharamanLab/METABOLIC. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01213-8. BioMed Central 2022-02-16 /pmc/articles/PMC8851854/ /pubmed/35172890 http://dx.doi.org/10.1186/s40168-021-01213-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Zhou, Zhichao
Tran, Patricia Q.
Breister, Adam M.
Liu, Yang
Kieft, Kristopher
Cowley, Elise S.
Karaoz, Ulas
Anantharaman, Karthik
METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
title METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
title_full METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
title_fullStr METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
title_full_unstemmed METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
title_short METABOLIC: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
title_sort metabolic: high-throughput profiling of microbial genomes for functional traits, metabolism, biogeochemistry, and community-scale functional networks
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8851854/
https://www.ncbi.nlm.nih.gov/pubmed/35172890
http://dx.doi.org/10.1186/s40168-021-01213-8
work_keys_str_mv AT zhouzhichao metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks
AT tranpatriciaq metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks
AT breisteradamm metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks
AT liuyang metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks
AT kieftkristopher metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks
AT cowleyelises metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks
AT karaozulas metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks
AT anantharamankarthik metabolichighthroughputprofilingofmicrobialgenomesforfunctionaltraitsmetabolismbiogeochemistryandcommunityscalefunctionalnetworks