Cargando…

A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale

Constraint-based (CB) metabolic models provide a mathematical framework and scaffold for in silico cell metabolism analysis and manipulation. In the past decade, significant efforts have been done to model human metabolism, enabled by the increased availability of multi-omics datasets and curated ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Vieira, Vítor, Ferreira, Jorge, Rocha, Miguel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9278738/
https://www.ncbi.nlm.nih.gov/pubmed/35749559
http://dx.doi.org/10.1371/journal.pcbi.1009294
_version_ 1784746248845131776
author Vieira, Vítor
Ferreira, Jorge
Rocha, Miguel
author_facet Vieira, Vítor
Ferreira, Jorge
Rocha, Miguel
author_sort Vieira, Vítor
collection PubMed
description Constraint-based (CB) metabolic models provide a mathematical framework and scaffold for in silico cell metabolism analysis and manipulation. In the past decade, significant efforts have been done to model human metabolism, enabled by the increased availability of multi-omics datasets and curated genome-scale reconstructions, as well as the development of several algorithms for context-specific model (CSM) reconstruction. Although CSM reconstruction has revealed insights on the deregulated metabolism of several pathologies, the process of reconstructing representative models of human tissues still lacks benchmarks and appropriate integrated software frameworks, since many tools required for this process are still disperse across various software platforms, some of which are proprietary. In this work, we address this challenge by assembling a scalable CSM reconstruction pipeline capable of integrating transcriptomics data in CB models. We combined omics preprocessing methods inspired by previous efforts with in-house implementations of existing CSM algorithms and new model refinement and validation routines, all implemented in the Troppo Python-based open-source framework. The pipeline was validated with multi-omics datasets from the Cancer Cell Line Encyclopedia (CCLE), also including reference fluxomics measurements for the MCF7 cell line. We reconstructed over 6000 models based on the Human-GEM template model for 733 cell lines featured in the CCLE, using MCF7 models as reference to find the best parameter combinations. These reference models outperform earlier studies using the same template by comparing gene essentiality and fluxomics experiments. We also analysed the heterogeneity of breast cancer cell lines, identifying key changes in metabolism related to cancer aggressiveness. Despite the many challenges in CB modelling, we demonstrate using our pipeline that combining transcriptomics data in metabolic models can be used to investigate key metabolic shifts. Significant limitations were found on these models ability for reliable quantitative flux prediction, thus motivating further work in genome-wide phenotype prediction.
format Online
Article
Text
id pubmed-9278738
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-92787382022-07-14 A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale Vieira, Vítor Ferreira, Jorge Rocha, Miguel PLoS Comput Biol Research Article Constraint-based (CB) metabolic models provide a mathematical framework and scaffold for in silico cell metabolism analysis and manipulation. In the past decade, significant efforts have been done to model human metabolism, enabled by the increased availability of multi-omics datasets and curated genome-scale reconstructions, as well as the development of several algorithms for context-specific model (CSM) reconstruction. Although CSM reconstruction has revealed insights on the deregulated metabolism of several pathologies, the process of reconstructing representative models of human tissues still lacks benchmarks and appropriate integrated software frameworks, since many tools required for this process are still disperse across various software platforms, some of which are proprietary. In this work, we address this challenge by assembling a scalable CSM reconstruction pipeline capable of integrating transcriptomics data in CB models. We combined omics preprocessing methods inspired by previous efforts with in-house implementations of existing CSM algorithms and new model refinement and validation routines, all implemented in the Troppo Python-based open-source framework. The pipeline was validated with multi-omics datasets from the Cancer Cell Line Encyclopedia (CCLE), also including reference fluxomics measurements for the MCF7 cell line. We reconstructed over 6000 models based on the Human-GEM template model for 733 cell lines featured in the CCLE, using MCF7 models as reference to find the best parameter combinations. These reference models outperform earlier studies using the same template by comparing gene essentiality and fluxomics experiments. We also analysed the heterogeneity of breast cancer cell lines, identifying key changes in metabolism related to cancer aggressiveness. Despite the many challenges in CB modelling, we demonstrate using our pipeline that combining transcriptomics data in metabolic models can be used to investigate key metabolic shifts. Significant limitations were found on these models ability for reliable quantitative flux prediction, thus motivating further work in genome-wide phenotype prediction. Public Library of Science 2022-06-24 /pmc/articles/PMC9278738/ /pubmed/35749559 http://dx.doi.org/10.1371/journal.pcbi.1009294 Text en © 2022 Vieira et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Vieira, Vítor
Ferreira, Jorge
Rocha, Miguel
A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
title A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
title_full A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
title_fullStr A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
title_full_unstemmed A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
title_short A pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
title_sort pipeline for the reconstruction and evaluation of context-specific human metabolic models at a large-scale
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9278738/
https://www.ncbi.nlm.nih.gov/pubmed/35749559
http://dx.doi.org/10.1371/journal.pcbi.1009294
work_keys_str_mv AT vieiravitor apipelineforthereconstructionandevaluationofcontextspecifichumanmetabolicmodelsatalargescale
AT ferreirajorge apipelineforthereconstructionandevaluationofcontextspecifichumanmetabolicmodelsatalargescale
AT rochamiguel apipelineforthereconstructionandevaluationofcontextspecifichumanmetabolicmodelsatalargescale
AT vieiravitor pipelineforthereconstructionandevaluationofcontextspecifichumanmetabolicmodelsatalargescale
AT ferreirajorge pipelineforthereconstructionandevaluationofcontextspecifichumanmetabolicmodelsatalargescale
AT rochamiguel pipelineforthereconstructionandevaluationofcontextspecifichumanmetabolicmodelsatalargescale