Cargando…

Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data

BACKGROUND: With the advent of high-throughput targeted metabolic profiling techniques, the question of how to interpret and analyze the resulting vast amount of data becomes more and more important. In this work we address the reconstruction of metabolic reactions from cross-sectional metabolomics...

Descripción completa

Detalles Bibliográficos
Autores principales: Krumsiek, Jan, Suhre, Karsten, Illig, Thomas, Adamski, Jerzy, Theis, Fabian J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224437/
https://www.ncbi.nlm.nih.gov/pubmed/21281499
http://dx.doi.org/10.1186/1752-0509-5-21
_version_ 1782217384052916224
author Krumsiek, Jan
Suhre, Karsten
Illig, Thomas
Adamski, Jerzy
Theis, Fabian J
author_facet Krumsiek, Jan
Suhre, Karsten
Illig, Thomas
Adamski, Jerzy
Theis, Fabian J
author_sort Krumsiek, Jan
collection PubMed
description BACKGROUND: With the advent of high-throughput targeted metabolic profiling techniques, the question of how to interpret and analyze the resulting vast amount of data becomes more and more important. In this work we address the reconstruction of metabolic reactions from cross-sectional metabolomics data, that is without the requirement for time-resolved measurements or specific system perturbations. Previous studies in this area mainly focused on Pearson correlation coefficients, which however are generally incapable of distinguishing between direct and indirect metabolic interactions. RESULTS: In our new approach we propose the application of a Gaussian graphical model (GGM), an undirected probabilistic graphical model estimating the conditional dependence between variables. GGMs are based on partial correlation coefficients, that is pairwise Pearson correlation coefficients conditioned against the correlation with all other metabolites. We first demonstrate the general validity of the method and its advantages over regular correlation networks with computer-simulated reaction systems. Then we estimate a GGM on data from a large human population cohort, covering 1020 fasting blood serum samples with 151 quantified metabolites. The GGM is much sparser than the correlation network, shows a modular structure with respect to metabolite classes, and is stable to the choice of samples in the data set. On the example of human fatty acid metabolism, we demonstrate for the first time that high partial correlation coefficients generally correspond to known metabolic reactions. This feature is evaluated both manually by investigating specific pairs of high-scoring metabolites, and then systematically on a literature-curated model of fatty acid synthesis and degradation. Our method detects many known reactions along with possibly novel pathway interactions, representing candidates for further experimental examination. CONCLUSIONS: In summary, we demonstrate strong signatures of intracellular pathways in blood serum data, and provide a valuable tool for the unbiased reconstruction of metabolic reactions from large-scale metabolomics data sets.
format Online
Article
Text
id pubmed-3224437
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32244372011-11-30 Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data Krumsiek, Jan Suhre, Karsten Illig, Thomas Adamski, Jerzy Theis, Fabian J BMC Syst Biol Research Article BACKGROUND: With the advent of high-throughput targeted metabolic profiling techniques, the question of how to interpret and analyze the resulting vast amount of data becomes more and more important. In this work we address the reconstruction of metabolic reactions from cross-sectional metabolomics data, that is without the requirement for time-resolved measurements or specific system perturbations. Previous studies in this area mainly focused on Pearson correlation coefficients, which however are generally incapable of distinguishing between direct and indirect metabolic interactions. RESULTS: In our new approach we propose the application of a Gaussian graphical model (GGM), an undirected probabilistic graphical model estimating the conditional dependence between variables. GGMs are based on partial correlation coefficients, that is pairwise Pearson correlation coefficients conditioned against the correlation with all other metabolites. We first demonstrate the general validity of the method and its advantages over regular correlation networks with computer-simulated reaction systems. Then we estimate a GGM on data from a large human population cohort, covering 1020 fasting blood serum samples with 151 quantified metabolites. The GGM is much sparser than the correlation network, shows a modular structure with respect to metabolite classes, and is stable to the choice of samples in the data set. On the example of human fatty acid metabolism, we demonstrate for the first time that high partial correlation coefficients generally correspond to known metabolic reactions. This feature is evaluated both manually by investigating specific pairs of high-scoring metabolites, and then systematically on a literature-curated model of fatty acid synthesis and degradation. Our method detects many known reactions along with possibly novel pathway interactions, representing candidates for further experimental examination. CONCLUSIONS: In summary, we demonstrate strong signatures of intracellular pathways in blood serum data, and provide a valuable tool for the unbiased reconstruction of metabolic reactions from large-scale metabolomics data sets. BioMed Central 2011-01-31 /pmc/articles/PMC3224437/ /pubmed/21281499 http://dx.doi.org/10.1186/1752-0509-5-21 Text en Copyright ©2011 Krumsiek et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Krumsiek, Jan
Suhre, Karsten
Illig, Thomas
Adamski, Jerzy
Theis, Fabian J
Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
title Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
title_full Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
title_fullStr Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
title_full_unstemmed Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
title_short Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
title_sort gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224437/
https://www.ncbi.nlm.nih.gov/pubmed/21281499
http://dx.doi.org/10.1186/1752-0509-5-21
work_keys_str_mv AT krumsiekjan gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata
AT suhrekarsten gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata
AT illigthomas gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata
AT adamskijerzy gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata
AT theisfabianj gaussiangraphicalmodelingreconstructspathwayreactionsfromhighthroughputmetabolomicsdata