Cargando…

Identification of metabolic network models from incomplete high-throughput datasets

Motivation: High-throughput measurement techniques for metabolism and gene expression provide a wealth of information for the identification of metabolic network models. Yet, missing observations scattered over the dataset restrict the number of effectively available datapoints and make classical re...

Descripción completa

Detalles Bibliográficos
Autores principales:	Berthoumieux, Sara, Brilli, Matteo, de Jong, Hidde, Kahn, Daniel, Cinquemani, Eugenio
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2011
Materias:	Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117355/ https://www.ncbi.nlm.nih.gov/pubmed/21685069 http://dx.doi.org/10.1093/bioinformatics/btr225

_version_	1782206321436655616
author	Berthoumieux, Sara Brilli, Matteo de Jong, Hidde Kahn, Daniel Cinquemani, Eugenio
author_facet	Berthoumieux, Sara Brilli, Matteo de Jong, Hidde Kahn, Daniel Cinquemani, Eugenio
author_sort	Berthoumieux, Sara
collection	PubMed
description	Motivation: High-throughput measurement techniques for metabolism and gene expression provide a wealth of information for the identification of metabolic network models. Yet, missing observations scattered over the dataset restrict the number of effectively available datapoints and make classical regression techniques inaccurate or inapplicable. Thorough exploitation of the data by identification techniques that explicitly cope with missing observations is therefore of major importance. Results: We develop a maximum-likelihood approach for the estimation of unknown parameters of metabolic network models that relies on the integration of statistical priors to compensate for the missing data. In the context of the linlog metabolic modeling framework, we implement the identification method by an Expectation-Maximization (EM) algorithm and by a simpler direct numerical optimization method. We evaluate performance of our methods by comparison to existing approaches, and show that our EM method provides the best results over a variety of simulated scenarios. We then apply the EM algorithm to a real problem, the identification of a model for the Escherichia coli central carbon metabolism, based on challenging experimental data from the literature. This leads to promising results and allows us to highlight critical identification issues. Contact: sara.berthoumieux@inria.fr; eugenio.cinquemani@inria.fr Supplementary information: Supplementary data are available at Bioinformatics online.
format	Online Article Text
id	pubmed-3117355
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-31173552011-06-17 Identification of metabolic network models from incomplete high-throughput datasets Berthoumieux, Sara Brilli, Matteo de Jong, Hidde Kahn, Daniel Cinquemani, Eugenio Bioinformatics Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Motivation: High-throughput measurement techniques for metabolism and gene expression provide a wealth of information for the identification of metabolic network models. Yet, missing observations scattered over the dataset restrict the number of effectively available datapoints and make classical regression techniques inaccurate or inapplicable. Thorough exploitation of the data by identification techniques that explicitly cope with missing observations is therefore of major importance. Results: We develop a maximum-likelihood approach for the estimation of unknown parameters of metabolic network models that relies on the integration of statistical priors to compensate for the missing data. In the context of the linlog metabolic modeling framework, we implement the identification method by an Expectation-Maximization (EM) algorithm and by a simpler direct numerical optimization method. We evaluate performance of our methods by comparison to existing approaches, and show that our EM method provides the best results over a variety of simulated scenarios. We then apply the EM algorithm to a real problem, the identification of a model for the Escherichia coli central carbon metabolism, based on challenging experimental data from the literature. This leads to promising results and allows us to highlight critical identification issues. Contact: sara.berthoumieux@inria.fr; eugenio.cinquemani@inria.fr Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-07-01 2011-06-14 /pmc/articles/PMC3117355/ /pubmed/21685069 http://dx.doi.org/10.1093/bioinformatics/btr225 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Berthoumieux, Sara Brilli, Matteo de Jong, Hidde Kahn, Daniel Cinquemani, Eugenio Identification of metabolic network models from incomplete high-throughput datasets
title	Identification of metabolic network models from incomplete high-throughput datasets
title_full	Identification of metabolic network models from incomplete high-throughput datasets
title_fullStr	Identification of metabolic network models from incomplete high-throughput datasets
title_full_unstemmed	Identification of metabolic network models from incomplete high-throughput datasets
title_short	Identification of metabolic network models from incomplete high-throughput datasets
title_sort	identification of metabolic network models from incomplete high-throughput datasets
topic	Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117355/ https://www.ncbi.nlm.nih.gov/pubmed/21685069 http://dx.doi.org/10.1093/bioinformatics/btr225
work_keys_str_mv	AT berthoumieuxsara identificationofmetabolicnetworkmodelsfromincompletehighthroughputdatasets AT brillimatteo identificationofmetabolicnetworkmodelsfromincompletehighthroughputdatasets AT dejonghidde identificationofmetabolicnetworkmodelsfromincompletehighthroughputdatasets AT kahndaniel identificationofmetabolicnetworkmodelsfromincompletehighthroughputdatasets AT cinquemanieugenio identificationofmetabolicnetworkmodelsfromincompletehighthroughputdatasets

Identification of metabolic network models from incomplete high-throughput datasets

Ejemplares similares