Cargando…

SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions

BACKGROUND: The topology of metabolic networks is both well-studied and remarkably well-conserved across many species. The regulation of these networks, however, is much more poorly characterized, though it is known to be divergent across organisms—two characteristics that make it difficult to model...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Justin Y., Nguyen, Britney, Orosco, Carlos, Styczynski, Mark P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8268592/
https://www.ncbi.nlm.nih.gov/pubmed/34238207
http://dx.doi.org/10.1186/s12859-021-04281-7
_version_ 1783720391009632256
author Lee, Justin Y.
Nguyen, Britney
Orosco, Carlos
Styczynski, Mark P.
author_facet Lee, Justin Y.
Nguyen, Britney
Orosco, Carlos
Styczynski, Mark P.
author_sort Lee, Justin Y.
collection PubMed
description BACKGROUND: The topology of metabolic networks is both well-studied and remarkably well-conserved across many species. The regulation of these networks, however, is much more poorly characterized, though it is known to be divergent across organisms—two characteristics that make it difficult to model metabolic networks accurately. While many computational methods have been built to unravel transcriptional regulation, there have been few approaches developed for systems-scale analysis and study of metabolic regulation. Here, we present a stepwise machine learning framework that applies established algorithms to identify regulatory interactions in metabolic systems based on metabolic data: stepwise classification of unknown regulation, or SCOUR. RESULTS: We evaluated our framework on both noiseless and noisy data, using several models of varying sizes and topologies to show that our approach is generalizable. We found that, when testing on data under the most realistic conditions (low sampling frequency and high noise), SCOUR could identify reaction fluxes controlled only by the concentration of a single metabolite (its primary substrate) with high accuracy. The positive predictive value (PPV) for identifying reactions controlled by the concentration of two metabolites ranged from 32 to 88% for noiseless data, 9.2 to 49% for either low sampling frequency/low noise or high sampling frequency/high noise data, and 6.6–27% for low sampling frequency/high noise data, with results typically sufficiently high for lab validation to be a practical endeavor. While the PPVs for reactions controlled by three metabolites were lower, they were still in most cases significantly better than random classification. CONCLUSIONS: SCOUR uses a novel approach to synthetically generate the training data needed to identify regulators of reaction fluxes in a given metabolic system, enabling metabolomics and fluxomics data to be leveraged for regulatory structure inference. By identifying and triaging the most likely candidate regulatory interactions, SCOUR can drastically reduce the amount of time needed to identify and experimentally validate metabolic regulatory interactions. As high-throughput experimental methods for testing these interactions are further developed, SCOUR will provide critical impact in the development of predictive metabolic models in new organisms and pathways. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04281-7.
format Online
Article
Text
id pubmed-8268592
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82685922021-07-12 SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions Lee, Justin Y. Nguyen, Britney Orosco, Carlos Styczynski, Mark P. BMC Bioinformatics Methodology Article BACKGROUND: The topology of metabolic networks is both well-studied and remarkably well-conserved across many species. The regulation of these networks, however, is much more poorly characterized, though it is known to be divergent across organisms—two characteristics that make it difficult to model metabolic networks accurately. While many computational methods have been built to unravel transcriptional regulation, there have been few approaches developed for systems-scale analysis and study of metabolic regulation. Here, we present a stepwise machine learning framework that applies established algorithms to identify regulatory interactions in metabolic systems based on metabolic data: stepwise classification of unknown regulation, or SCOUR. RESULTS: We evaluated our framework on both noiseless and noisy data, using several models of varying sizes and topologies to show that our approach is generalizable. We found that, when testing on data under the most realistic conditions (low sampling frequency and high noise), SCOUR could identify reaction fluxes controlled only by the concentration of a single metabolite (its primary substrate) with high accuracy. The positive predictive value (PPV) for identifying reactions controlled by the concentration of two metabolites ranged from 32 to 88% for noiseless data, 9.2 to 49% for either low sampling frequency/low noise or high sampling frequency/high noise data, and 6.6–27% for low sampling frequency/high noise data, with results typically sufficiently high for lab validation to be a practical endeavor. While the PPVs for reactions controlled by three metabolites were lower, they were still in most cases significantly better than random classification. CONCLUSIONS: SCOUR uses a novel approach to synthetically generate the training data needed to identify regulators of reaction fluxes in a given metabolic system, enabling metabolomics and fluxomics data to be leveraged for regulatory structure inference. By identifying and triaging the most likely candidate regulatory interactions, SCOUR can drastically reduce the amount of time needed to identify and experimentally validate metabolic regulatory interactions. As high-throughput experimental methods for testing these interactions are further developed, SCOUR will provide critical impact in the development of predictive metabolic models in new organisms and pathways. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04281-7. BioMed Central 2021-07-08 /pmc/articles/PMC8268592/ /pubmed/34238207 http://dx.doi.org/10.1186/s12859-021-04281-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Lee, Justin Y.
Nguyen, Britney
Orosco, Carlos
Styczynski, Mark P.
SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions
title SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions
title_full SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions
title_fullStr SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions
title_full_unstemmed SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions
title_short SCOUR: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions
title_sort scour: a stepwise machine learning framework for predicting metabolite-dependent regulatory interactions
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8268592/
https://www.ncbi.nlm.nih.gov/pubmed/34238207
http://dx.doi.org/10.1186/s12859-021-04281-7
work_keys_str_mv AT leejustiny scourastepwisemachinelearningframeworkforpredictingmetabolitedependentregulatoryinteractions
AT nguyenbritney scourastepwisemachinelearningframeworkforpredictingmetabolitedependentregulatoryinteractions
AT oroscocarlos scourastepwisemachinelearningframeworkforpredictingmetabolitedependentregulatoryinteractions
AT styczynskimarkp scourastepwisemachinelearningframeworkforpredictingmetabolitedependentregulatoryinteractions