Cargando…
Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
BACKGROUND: With the advent of high-throughput targeted metabolic profiling techniques, the question of how to interpret and analyze the resulting vast amount of data becomes more and more important. In this work we address the reconstruction of metabolic reactions from cross-sectional metabolomics...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224437/ https://www.ncbi.nlm.nih.gov/pubmed/21281499 http://dx.doi.org/10.1186/1752-0509-5-21 |
_version_ | 1782217384052916224 |
---|---|
author | Krumsiek, Jan Suhre, Karsten Illig, Thomas Adamski, Jerzy Theis, Fabian J |
author_facet | Krumsiek, Jan Suhre, Karsten Illig, Thomas Adamski, Jerzy Theis, Fabian J |
author_sort | Krumsiek, Jan |
collection | PubMed |
description | BACKGROUND: With the advent of high-throughput targeted metabolic profiling techniques, the question of how to interpret and analyze the resulting vast amount of data becomes more and more important. In this work we address the reconstruction of metabolic reactions from cross-sectional metabolomics data, that is without the requirement for time-resolved measurements or specific system perturbations. Previous studies in this area mainly focused on Pearson correlation coefficients, which however are generally incapable of distinguishing between direct and indirect metabolic interactions. RESULTS: In our new approach we propose the application of a Gaussian graphical model (GGM), an undirected probabilistic graphical model estimating the conditional dependence between variables. GGMs are based on partial correlation coefficients, that is pairwise Pearson correlation coefficients conditioned against the correlation with all other metabolites. We first demonstrate the general validity of the method and its advantages over regular correlation networks with computer-simulated reaction systems. Then we estimate a GGM on data from a large human population cohort, covering 1020 fasting blood serum samples with 151 quantified metabolites. The GGM is much sparser than the correlation network, shows a modular structure with respect to metabolite classes, and is stable to the choice of samples in the data set. On the example of human fatty acid metabolism, we demonstrate for the first time that high partial correlation coefficients generally correspond to known metabolic reactions. This feature is evaluated both manually by investigating specific pairs of high-scoring metabolites, and then systematically on a literature-curated model of fatty acid synthesis and degradation. Our method detects many known reactions along with possibly novel pathway interactions, representing candidates for further experimental examination. CONCLUSIONS: In summary, we demonstrate strong signatures of intracellular pathways in blood serum data, and provide a valuable tool for the unbiased reconstruction of metabolic reactions from large-scale metabolomics data sets. |
format | Online Article Text |
id | pubmed-3224437 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32244372011-11-30 Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data Krumsiek, Jan Suhre, Karsten Illig, Thomas Adamski, Jerzy Theis, Fabian J BMC Syst Biol Research Article BACKGROUND: With the advent of high-throughput targeted metabolic profiling techniques, the question of how to interpret and analyze the resulting vast amount of data becomes more and more important. In this work we address the reconstruction of metabolic reactions from cross-sectional metabolomics data, that is without the requirement for time-resolved measurements or specific system perturbations. Previous studies in this area mainly focused on Pearson correlation coefficients, which however are generally incapable of distinguishing between direct and indirect metabolic interactions. RESULTS: In our new approach we propose the application of a Gaussian graphical model (GGM), an undirected probabilistic graphical model estimating the conditional dependence between variables. GGMs are based on partial correlation coefficients, that is pairwise Pearson correlation coefficients conditioned against the correlation with all other metabolites. We first demonstrate the general validity of the method and its advantages over regular correlation networks with computer-simulated reaction systems. Then we estimate a GGM on data from a large human population cohort, covering 1020 fasting blood serum samples with 151 quantified metabolites. The GGM is much sparser than the correlation network, shows a modular structure with respect to metabolite classes, and is stable to the choice of samples in the data set. On the example of human fatty acid metabolism, we demonstrate for the first time that high partial correlation coefficients generally correspond to known metabolic reactions. This feature is evaluated both manually by investigating specific pairs of high-scoring metabolites, and then systematically on a literature-curated model of fatty acid synthesis and degradation. Our method detects many known reactions along with possibly novel pathway interactions, representing candidates for further experimental examination. CONCLUSIONS: In summary, we demonstrate strong signatures of intracellular pathways in blood serum data, and provide a valuable tool for the unbiased reconstruction of metabolic reactions from large-scale metabolomics data sets. BioMed Central 2011-01-31 /pmc/articles/PMC3224437/ /pubmed/21281499 http://dx.doi.org/10.1186/1752-0509-5-21 Text en Copyright ©2011 Krumsiek et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Krumsiek, Jan Suhre, Karsten Illig, Thomas Adamski, Jerzy Theis, Fabian J Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data |
title | Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data |
title_full | Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data |
title_fullStr | Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data |
title_full_unstemmed | Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data |
title_short | Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data |
title_sort | gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224437/ https://www.ncbi.nlm.nih.gov/pubmed/21281499 http://dx.doi.org/10.1186/1752-0509-5-21 |
work_keys_str_mv | AT krumsiekjan gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata AT suhrekarsten gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata AT illigthomas gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata AT adamskijerzy gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata AT theisfabianj gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata |