Cargando…

Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance

Thinning is a sub-sampling technique to reduce the memory footprint of Markov chain Monte Carlo. Despite being commonly used, thinning is rarely considered efficient. For sampling constraint-based models, a highly relevant use-case in systems biology, we here demonstrate that thinning boosts computa...

Descripción completa

Detalles Bibliográficos
Autores principales: Jadebeck, Johann F., Wiechert, Wolfgang, Nöh, Katharina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446239/
https://www.ncbi.nlm.nih.gov/pubmed/37566638
http://dx.doi.org/10.1371/journal.pcbi.1011378
_version_ 1785094362470809600
author Jadebeck, Johann F.
Wiechert, Wolfgang
Nöh, Katharina
author_facet Jadebeck, Johann F.
Wiechert, Wolfgang
Nöh, Katharina
author_sort Jadebeck, Johann F.
collection PubMed
description Thinning is a sub-sampling technique to reduce the memory footprint of Markov chain Monte Carlo. Despite being commonly used, thinning is rarely considered efficient. For sampling constraint-based models, a highly relevant use-case in systems biology, we here demonstrate that thinning boosts computational and, thereby, sampling efficiencies of the widely used Coordinate Hit-and-Run with Rounding (CHRR) algorithm. By benchmarking CHRR with thinning with simplices and genome-scale metabolic networks of up to thousands of dimensions, we find a substantial increase in computational efficiency compared to unthinned CHRR, in our examples by orders of magnitude, as measured by the effective sample size per time (ESS/t), with performance gains growing with polytope (effective network) dimension. Using a set of benchmark models we derive a ready-to-apply guideline for tuning thinning to efficient and effective use of compute resources without requiring additional coding effort. Our guideline is validated using three (out-of-sample) large-scale networks and we show that it allows sampling convex polytopes uniformly to convergence in a fraction of time, thereby unlocking the rigorous investigation of hitherto intractable models. The derivation of our guideline is explained in detail, allowing future researchers to update it as needed as new model classes and more training data becomes available. CHRR with deliberate utilization of thinning thereby paves the way to keep pace with progressing model sizes derived with the constraint-based reconstruction and analysis (COBRA) tool set. Sampling and evaluation pipelines are available at https://jugit.fz-juelich.de/IBG-1/ModSim/fluxomics/chrrt.
format Online
Article
Text
id pubmed-10446239
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104462392023-08-24 Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance Jadebeck, Johann F. Wiechert, Wolfgang Nöh, Katharina PLoS Comput Biol Research Article Thinning is a sub-sampling technique to reduce the memory footprint of Markov chain Monte Carlo. Despite being commonly used, thinning is rarely considered efficient. For sampling constraint-based models, a highly relevant use-case in systems biology, we here demonstrate that thinning boosts computational and, thereby, sampling efficiencies of the widely used Coordinate Hit-and-Run with Rounding (CHRR) algorithm. By benchmarking CHRR with thinning with simplices and genome-scale metabolic networks of up to thousands of dimensions, we find a substantial increase in computational efficiency compared to unthinned CHRR, in our examples by orders of magnitude, as measured by the effective sample size per time (ESS/t), with performance gains growing with polytope (effective network) dimension. Using a set of benchmark models we derive a ready-to-apply guideline for tuning thinning to efficient and effective use of compute resources without requiring additional coding effort. Our guideline is validated using three (out-of-sample) large-scale networks and we show that it allows sampling convex polytopes uniformly to convergence in a fraction of time, thereby unlocking the rigorous investigation of hitherto intractable models. The derivation of our guideline is explained in detail, allowing future researchers to update it as needed as new model classes and more training data becomes available. CHRR with deliberate utilization of thinning thereby paves the way to keep pace with progressing model sizes derived with the constraint-based reconstruction and analysis (COBRA) tool set. Sampling and evaluation pipelines are available at https://jugit.fz-juelich.de/IBG-1/ModSim/fluxomics/chrrt. Public Library of Science 2023-08-11 /pmc/articles/PMC10446239/ /pubmed/37566638 http://dx.doi.org/10.1371/journal.pcbi.1011378 Text en © 2023 Jadebeck et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jadebeck, Johann F.
Wiechert, Wolfgang
Nöh, Katharina
Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance
title Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance
title_full Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance
title_fullStr Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance
title_full_unstemmed Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance
title_short Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance
title_sort practical sampling of constraint-based models: optimized thinning boosts chrr performance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446239/
https://www.ncbi.nlm.nih.gov/pubmed/37566638
http://dx.doi.org/10.1371/journal.pcbi.1011378
work_keys_str_mv AT jadebeckjohannf practicalsamplingofconstraintbasedmodelsoptimizedthinningboostschrrperformance
AT wiechertwolfgang practicalsamplingofconstraintbasedmodelsoptimizedthinningboostschrrperformance
AT nohkatharina practicalsamplingofconstraintbasedmodelsoptimizedthinningboostschrrperformance