Cargando…

Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming

(13)C metabolic flux analysis ((13)C-MFA) has been widely used to measure in vivo enzyme reaction rates (i.e., metabolic flux) in microorganisms. Mining the relationship between environmental and genetic factors and metabolic fluxes hidden in existing fluxomic data will lead to predictive models tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Stephen Gang, Wang, Yuxuan, Jiang, Wu, Oyetunde, Tolutola, Yao, Ruilian, Zhang, Xuehong, Shimizu, Kazuyuki, Tang, Yinjie J., Bao, Forrest Sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4836714/
https://www.ncbi.nlm.nih.gov/pubmed/27092947
http://dx.doi.org/10.1371/journal.pcbi.1004838
_version_ 1782427770768326656
author Wu, Stephen Gang
Wang, Yuxuan
Jiang, Wu
Oyetunde, Tolutola
Yao, Ruilian
Zhang, Xuehong
Shimizu, Kazuyuki
Tang, Yinjie J.
Bao, Forrest Sheng
author_facet Wu, Stephen Gang
Wang, Yuxuan
Jiang, Wu
Oyetunde, Tolutola
Yao, Ruilian
Zhang, Xuehong
Shimizu, Kazuyuki
Tang, Yinjie J.
Bao, Forrest Sheng
author_sort Wu, Stephen Gang
collection PubMed
description (13)C metabolic flux analysis ((13)C-MFA) has been widely used to measure in vivo enzyme reaction rates (i.e., metabolic flux) in microorganisms. Mining the relationship between environmental and genetic factors and metabolic fluxes hidden in existing fluxomic data will lead to predictive models that can significantly accelerate flux quantification. In this paper, we present a web-based platform MFlux (http://mflux.org) that predicts the bacterial central metabolism via machine learning, leveraging data from approximately 100 (13)C-MFA papers on heterotrophic bacterial metabolisms. Three machine learning methods, namely Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), and Decision Tree, were employed to study the sophisticated relationship between influential factors and metabolic fluxes. We performed a grid search of the best parameter set for each algorithm and verified their performance through 10-fold cross validations. SVM yields the highest accuracy among all three algorithms. Further, we employed quadratic programming to adjust flux profiles to satisfy stoichiometric constraints. Multiple case studies have shown that MFlux can reasonably predict fluxomes as a function of bacterial species, substrate types, growth rate, oxygen conditions, and cultivation methods. Due to the interest of studying model organism under particular carbon sources, bias of fluxome in the dataset may limit the applicability of machine learning models. This problem can be resolved after more papers on (13)C-MFA are published for non-model species.
format Online
Article
Text
id pubmed-4836714
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48367142016-04-29 Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming Wu, Stephen Gang Wang, Yuxuan Jiang, Wu Oyetunde, Tolutola Yao, Ruilian Zhang, Xuehong Shimizu, Kazuyuki Tang, Yinjie J. Bao, Forrest Sheng PLoS Comput Biol Research Article (13)C metabolic flux analysis ((13)C-MFA) has been widely used to measure in vivo enzyme reaction rates (i.e., metabolic flux) in microorganisms. Mining the relationship between environmental and genetic factors and metabolic fluxes hidden in existing fluxomic data will lead to predictive models that can significantly accelerate flux quantification. In this paper, we present a web-based platform MFlux (http://mflux.org) that predicts the bacterial central metabolism via machine learning, leveraging data from approximately 100 (13)C-MFA papers on heterotrophic bacterial metabolisms. Three machine learning methods, namely Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), and Decision Tree, were employed to study the sophisticated relationship between influential factors and metabolic fluxes. We performed a grid search of the best parameter set for each algorithm and verified their performance through 10-fold cross validations. SVM yields the highest accuracy among all three algorithms. Further, we employed quadratic programming to adjust flux profiles to satisfy stoichiometric constraints. Multiple case studies have shown that MFlux can reasonably predict fluxomes as a function of bacterial species, substrate types, growth rate, oxygen conditions, and cultivation methods. Due to the interest of studying model organism under particular carbon sources, bias of fluxome in the dataset may limit the applicability of machine learning models. This problem can be resolved after more papers on (13)C-MFA are published for non-model species. Public Library of Science 2016-04-19 /pmc/articles/PMC4836714/ /pubmed/27092947 http://dx.doi.org/10.1371/journal.pcbi.1004838 Text en © 2016 Wu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wu, Stephen Gang
Wang, Yuxuan
Jiang, Wu
Oyetunde, Tolutola
Yao, Ruilian
Zhang, Xuehong
Shimizu, Kazuyuki
Tang, Yinjie J.
Bao, Forrest Sheng
Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming
title Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming
title_full Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming
title_fullStr Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming
title_full_unstemmed Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming
title_short Rapid Prediction of Bacterial Heterotrophic Fluxomics Using Machine Learning and Constraint Programming
title_sort rapid prediction of bacterial heterotrophic fluxomics using machine learning and constraint programming
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4836714/
https://www.ncbi.nlm.nih.gov/pubmed/27092947
http://dx.doi.org/10.1371/journal.pcbi.1004838
work_keys_str_mv AT wustephengang rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT wangyuxuan rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT jiangwu rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT oyetundetolutola rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT yaoruilian rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT zhangxuehong rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT shimizukazuyuki rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT tangyinjiej rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming
AT baoforrestsheng rapidpredictionofbacterialheterotrophicfluxomicsusingmachinelearningandconstraintprogramming