Cargando…

Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning

Mechanistic models explicitly represent hypothesized biological knowledge. As such, they offer more generalizability than data-driven models. However, identifying model curation efforts that improve performance for mechanistic models is nontrivial. Here, we develop a solution to this problem for gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Medlock, Gregory L., Papin, Jason A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cell Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6975163/
https://www.ncbi.nlm.nih.gov/pubmed/31926940
http://dx.doi.org/10.1016/j.cels.2019.11.006
_version_ 1783490244735139840
author Medlock, Gregory L.
Papin, Jason A.
author_facet Medlock, Gregory L.
Papin, Jason A.
author_sort Medlock, Gregory L.
collection PubMed
description Mechanistic models explicitly represent hypothesized biological knowledge. As such, they offer more generalizability than data-driven models. However, identifying model curation efforts that improve performance for mechanistic models is nontrivial. Here, we develop a solution to this problem for genome-scale metabolic models. We generate an ensemble of models, each equally consistent with experimental data, then perform simulations with them. We apply machine learning to the simulation output to identify model structure variation that maximally influences simulations. These variants are high-priority candidates for curation through removal, addition, or reannotation in the model. We apply this approach, automated metabolic model ensemble-driven elimination of uncertainty with statistical learning (AMMEDEUS), to 29 bacterial species to improve gene essentiality predictions. We explore targets for individual species and compile pan-species targets to improve the database used during model construction. AMMEDEUS is an automated and performance-driven recommendation system that complements intuition during curation of biochemical knowledgebases.
format Online
Article
Text
id pubmed-6975163
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cell Press
record_format MEDLINE/PubMed
spelling pubmed-69751632020-01-28 Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning Medlock, Gregory L. Papin, Jason A. Cell Syst Article Mechanistic models explicitly represent hypothesized biological knowledge. As such, they offer more generalizability than data-driven models. However, identifying model curation efforts that improve performance for mechanistic models is nontrivial. Here, we develop a solution to this problem for genome-scale metabolic models. We generate an ensemble of models, each equally consistent with experimental data, then perform simulations with them. We apply machine learning to the simulation output to identify model structure variation that maximally influences simulations. These variants are high-priority candidates for curation through removal, addition, or reannotation in the model. We apply this approach, automated metabolic model ensemble-driven elimination of uncertainty with statistical learning (AMMEDEUS), to 29 bacterial species to improve gene essentiality predictions. We explore targets for individual species and compile pan-species targets to improve the database used during model construction. AMMEDEUS is an automated and performance-driven recommendation system that complements intuition during curation of biochemical knowledgebases. Cell Press 2020-01-22 /pmc/articles/PMC6975163/ /pubmed/31926940 http://dx.doi.org/10.1016/j.cels.2019.11.006 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Medlock, Gregory L.
Papin, Jason A.
Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning
title Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning
title_full Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning
title_fullStr Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning
title_full_unstemmed Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning
title_short Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning
title_sort guiding the refinement of biochemical knowledgebases with ensembles of metabolic networks and machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6975163/
https://www.ncbi.nlm.nih.gov/pubmed/31926940
http://dx.doi.org/10.1016/j.cels.2019.11.006
work_keys_str_mv AT medlockgregoryl guidingtherefinementofbiochemicalknowledgebaseswithensemblesofmetabolicnetworksandmachinelearning
AT papinjasona guidingtherefinementofbiochemicalknowledgebaseswithensemblesofmetabolicnetworksandmachinelearning