Cargando…

Evaluation of reaction gap-filling accuracy by randomization

BACKGROUND: Completion of genome-scale flux-balance models using computational reaction gap-filling is a widely used approach, but its accuracy is not well known. RESULTS: We report on computational experiments of reaction gap filling in which we generated degraded versions of the EcoCyc-20.0-GEM mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Latendresse, Mario, Karp, Peter D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5813426/
https://www.ncbi.nlm.nih.gov/pubmed/29444634
http://dx.doi.org/10.1186/s12859-018-2050-4
_version_ 1783300194095333376
author Latendresse, Mario
Karp, Peter D.
author_facet Latendresse, Mario
Karp, Peter D.
author_sort Latendresse, Mario
collection PubMed
description BACKGROUND: Completion of genome-scale flux-balance models using computational reaction gap-filling is a widely used approach, but its accuracy is not well known. RESULTS: We report on computational experiments of reaction gap filling in which we generated degraded versions of the EcoCyc-20.0-GEM model by randomly removing flux-carrying reactions from a growing model. We gap-filled the degraded models and compared the resulting gap-filled models with the original model. Gap-filling was performed by the Pathway Tools MetaFlux software using its General Development Mode (GenDev) and its Fast Development Mode (FastDev). We explored 12 GenDev variants including two linear solvers (SCIP and CPLEX) for solving the Mixed Integer Linear Programming (MILP) problems for gap filling; three different sets of linear constraints were applied; and two MILP methods were implemented. We compared these 13 variants according to accuracy, speed, and amount of information returned to the user. CONCLUSIONS: We observed large variation among the performance of the 13 gap-filling variants. Although no variant was best in all dimensions, we found one variant that was fast, accurate, and returned more information to the user. Some gap-filling variants were inaccurate, producing solutions that were non-minimum or invalid (did not enable model growth). The best GenDev variant showed a best average precision of 87% and a best average recall of 61%. FastDev showed an average precision of 71% and an average recall of 59%. Thus, using the most accurate variant, approximately 13% of the gap-filled reactions were incorrect (were not the reactions removed from the model), and 39% of gap-filled reactions were not found, suggesting that curation is still an important aspect of metabolic-model development.
format Online
Article
Text
id pubmed-5813426
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58134262018-02-16 Evaluation of reaction gap-filling accuracy by randomization Latendresse, Mario Karp, Peter D. BMC Bioinformatics Research Article BACKGROUND: Completion of genome-scale flux-balance models using computational reaction gap-filling is a widely used approach, but its accuracy is not well known. RESULTS: We report on computational experiments of reaction gap filling in which we generated degraded versions of the EcoCyc-20.0-GEM model by randomly removing flux-carrying reactions from a growing model. We gap-filled the degraded models and compared the resulting gap-filled models with the original model. Gap-filling was performed by the Pathway Tools MetaFlux software using its General Development Mode (GenDev) and its Fast Development Mode (FastDev). We explored 12 GenDev variants including two linear solvers (SCIP and CPLEX) for solving the Mixed Integer Linear Programming (MILP) problems for gap filling; three different sets of linear constraints were applied; and two MILP methods were implemented. We compared these 13 variants according to accuracy, speed, and amount of information returned to the user. CONCLUSIONS: We observed large variation among the performance of the 13 gap-filling variants. Although no variant was best in all dimensions, we found one variant that was fast, accurate, and returned more information to the user. Some gap-filling variants were inaccurate, producing solutions that were non-minimum or invalid (did not enable model growth). The best GenDev variant showed a best average precision of 87% and a best average recall of 61%. FastDev showed an average precision of 71% and an average recall of 59%. Thus, using the most accurate variant, approximately 13% of the gap-filled reactions were incorrect (were not the reactions removed from the model), and 39% of gap-filled reactions were not found, suggesting that curation is still an important aspect of metabolic-model development. BioMed Central 2018-02-14 /pmc/articles/PMC5813426/ /pubmed/29444634 http://dx.doi.org/10.1186/s12859-018-2050-4 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Latendresse, Mario
Karp, Peter D.
Evaluation of reaction gap-filling accuracy by randomization
title Evaluation of reaction gap-filling accuracy by randomization
title_full Evaluation of reaction gap-filling accuracy by randomization
title_fullStr Evaluation of reaction gap-filling accuracy by randomization
title_full_unstemmed Evaluation of reaction gap-filling accuracy by randomization
title_short Evaluation of reaction gap-filling accuracy by randomization
title_sort evaluation of reaction gap-filling accuracy by randomization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5813426/
https://www.ncbi.nlm.nih.gov/pubmed/29444634
http://dx.doi.org/10.1186/s12859-018-2050-4
work_keys_str_mv AT latendressemario evaluationofreactiongapfillingaccuracybyrandomization
AT karppeterd evaluationofreactiongapfillingaccuracybyrandomization