Cargando…
MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets
BACKGROUND: Enabled by rapid advances in sequencing technology, metagenomic studies aim to characterize entire communities of microbes bypassing the need for culturing individual bacterial members. One major goal of metagenomic studies is to identify specific functional adaptations of microbial comm...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3090767/ https://www.ncbi.nlm.nih.gov/pubmed/21554767 http://dx.doi.org/10.1186/1753-6561-5-S2-S9 |
_version_ | 1782203176851603456 |
---|---|
author | Liu, Bo Pop, Mihai |
author_facet | Liu, Bo Pop, Mihai |
author_sort | Liu, Bo |
collection | PubMed |
description | BACKGROUND: Enabled by rapid advances in sequencing technology, metagenomic studies aim to characterize entire communities of microbes bypassing the need for culturing individual bacterial members. One major goal of metagenomic studies is to identify specific functional adaptations of microbial communities to their habitats. The functional profile and the abundances for a sample can be estimated by mapping metagenomic sequences to the global metabolic network consisting of thousands of molecular reactions. Here we describe a powerful analytical method (MetaPath) that can identify differentially abundant pathways in metagenomic datasets, relying on a combination of metagenomic sequence data and prior metabolic pathway knowledge. METHODS: First, we introduce a scoring function for an arbitrary subnetwork and find the max-weight subnetwork in the global network by a greedy search algorithm. Then we compute two p values (p(abund) and p(struct)) using nonparametric approaches to answer two different statistical questions: (1) is this subnetwork differentically abundant? (2) What is the probability of finding such good subnetworks by chance given the data and network structure? Finally, significant metabolic subnetworks are discovered based on these two p values. RESULTS: In order to validate our methods, we have designed a simulated metabolic pathways dataset and show that MetaPath outperforms other commonly used approaches. We also demonstrate the power of our methods in analyzing two publicly available metagenomic datasets, and show that the subnetworks identified by MetaPath provide valuable insights into the biological activities of the microbiome. CONCLUSIONS: We have introduced a statistical method for finding significant metabolic subnetworks from metagenomic datasets. Compared with previous methods, results from MetaPath are more robust against noise in the data, and have significantly higher sensitivity and specificity (when tested on simulated datasets). When applied to two publicly available metagenomic datasets, the output of MetaPath is consistent with previous observations and also provides several new insights into the metabolic activity of the gut microbiome. The software is freely available at http://metapath.cbcb.umd.edu. |
format | Text |
id | pubmed-3090767 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30907672011-05-28 MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets Liu, Bo Pop, Mihai BMC Proc Proceedings BACKGROUND: Enabled by rapid advances in sequencing technology, metagenomic studies aim to characterize entire communities of microbes bypassing the need for culturing individual bacterial members. One major goal of metagenomic studies is to identify specific functional adaptations of microbial communities to their habitats. The functional profile and the abundances for a sample can be estimated by mapping metagenomic sequences to the global metabolic network consisting of thousands of molecular reactions. Here we describe a powerful analytical method (MetaPath) that can identify differentially abundant pathways in metagenomic datasets, relying on a combination of metagenomic sequence data and prior metabolic pathway knowledge. METHODS: First, we introduce a scoring function for an arbitrary subnetwork and find the max-weight subnetwork in the global network by a greedy search algorithm. Then we compute two p values (p(abund) and p(struct)) using nonparametric approaches to answer two different statistical questions: (1) is this subnetwork differentically abundant? (2) What is the probability of finding such good subnetworks by chance given the data and network structure? Finally, significant metabolic subnetworks are discovered based on these two p values. RESULTS: In order to validate our methods, we have designed a simulated metabolic pathways dataset and show that MetaPath outperforms other commonly used approaches. We also demonstrate the power of our methods in analyzing two publicly available metagenomic datasets, and show that the subnetworks identified by MetaPath provide valuable insights into the biological activities of the microbiome. CONCLUSIONS: We have introduced a statistical method for finding significant metabolic subnetworks from metagenomic datasets. Compared with previous methods, results from MetaPath are more robust against noise in the data, and have significantly higher sensitivity and specificity (when tested on simulated datasets). When applied to two publicly available metagenomic datasets, the output of MetaPath is consistent with previous observations and also provides several new insights into the metabolic activity of the gut microbiome. The software is freely available at http://metapath.cbcb.umd.edu. BioMed Central 2011-05-28 /pmc/articles/PMC3090767/ /pubmed/21554767 http://dx.doi.org/10.1186/1753-6561-5-S2-S9 Text en Copyright ©2011 Liu and Pop; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Liu, Bo Pop, Mihai MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets |
title | MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets |
title_full | MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets |
title_fullStr | MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets |
title_full_unstemmed | MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets |
title_short | MetaPath: identifying differentially abundant metabolic pathways in metagenomic datasets |
title_sort | metapath: identifying differentially abundant metabolic pathways in metagenomic datasets |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3090767/ https://www.ncbi.nlm.nih.gov/pubmed/21554767 http://dx.doi.org/10.1186/1753-6561-5-S2-S9 |
work_keys_str_mv | AT liubo metapathidentifyingdifferentiallyabundantmetabolicpathwaysinmetagenomicdatasets AT popmihai metapathidentifyingdifferentiallyabundantmetabolicpathwaysinmetagenomicdatasets |