Cargando…
Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance
Thinning is a sub-sampling technique to reduce the memory footprint of Markov chain Monte Carlo. Despite being commonly used, thinning is rarely considered efficient. For sampling constraint-based models, a highly relevant use-case in systems biology, we here demonstrate that thinning boosts computa...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446239/ https://www.ncbi.nlm.nih.gov/pubmed/37566638 http://dx.doi.org/10.1371/journal.pcbi.1011378 |
_version_ | 1785094362470809600 |
---|---|
author | Jadebeck, Johann F. Wiechert, Wolfgang Nöh, Katharina |
author_facet | Jadebeck, Johann F. Wiechert, Wolfgang Nöh, Katharina |
author_sort | Jadebeck, Johann F. |
collection | PubMed |
description | Thinning is a sub-sampling technique to reduce the memory footprint of Markov chain Monte Carlo. Despite being commonly used, thinning is rarely considered efficient. For sampling constraint-based models, a highly relevant use-case in systems biology, we here demonstrate that thinning boosts computational and, thereby, sampling efficiencies of the widely used Coordinate Hit-and-Run with Rounding (CHRR) algorithm. By benchmarking CHRR with thinning with simplices and genome-scale metabolic networks of up to thousands of dimensions, we find a substantial increase in computational efficiency compared to unthinned CHRR, in our examples by orders of magnitude, as measured by the effective sample size per time (ESS/t), with performance gains growing with polytope (effective network) dimension. Using a set of benchmark models we derive a ready-to-apply guideline for tuning thinning to efficient and effective use of compute resources without requiring additional coding effort. Our guideline is validated using three (out-of-sample) large-scale networks and we show that it allows sampling convex polytopes uniformly to convergence in a fraction of time, thereby unlocking the rigorous investigation of hitherto intractable models. The derivation of our guideline is explained in detail, allowing future researchers to update it as needed as new model classes and more training data becomes available. CHRR with deliberate utilization of thinning thereby paves the way to keep pace with progressing model sizes derived with the constraint-based reconstruction and analysis (COBRA) tool set. Sampling and evaluation pipelines are available at https://jugit.fz-juelich.de/IBG-1/ModSim/fluxomics/chrrt. |
format | Online Article Text |
id | pubmed-10446239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-104462392023-08-24 Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance Jadebeck, Johann F. Wiechert, Wolfgang Nöh, Katharina PLoS Comput Biol Research Article Thinning is a sub-sampling technique to reduce the memory footprint of Markov chain Monte Carlo. Despite being commonly used, thinning is rarely considered efficient. For sampling constraint-based models, a highly relevant use-case in systems biology, we here demonstrate that thinning boosts computational and, thereby, sampling efficiencies of the widely used Coordinate Hit-and-Run with Rounding (CHRR) algorithm. By benchmarking CHRR with thinning with simplices and genome-scale metabolic networks of up to thousands of dimensions, we find a substantial increase in computational efficiency compared to unthinned CHRR, in our examples by orders of magnitude, as measured by the effective sample size per time (ESS/t), with performance gains growing with polytope (effective network) dimension. Using a set of benchmark models we derive a ready-to-apply guideline for tuning thinning to efficient and effective use of compute resources without requiring additional coding effort. Our guideline is validated using three (out-of-sample) large-scale networks and we show that it allows sampling convex polytopes uniformly to convergence in a fraction of time, thereby unlocking the rigorous investigation of hitherto intractable models. The derivation of our guideline is explained in detail, allowing future researchers to update it as needed as new model classes and more training data becomes available. CHRR with deliberate utilization of thinning thereby paves the way to keep pace with progressing model sizes derived with the constraint-based reconstruction and analysis (COBRA) tool set. Sampling and evaluation pipelines are available at https://jugit.fz-juelich.de/IBG-1/ModSim/fluxomics/chrrt. Public Library of Science 2023-08-11 /pmc/articles/PMC10446239/ /pubmed/37566638 http://dx.doi.org/10.1371/journal.pcbi.1011378 Text en © 2023 Jadebeck et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Jadebeck, Johann F. Wiechert, Wolfgang Nöh, Katharina Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance |
title | Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance |
title_full | Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance |
title_fullStr | Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance |
title_full_unstemmed | Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance |
title_short | Practical sampling of constraint-based models: Optimized thinning boosts CHRR performance |
title_sort | practical sampling of constraint-based models: optimized thinning boosts chrr performance |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446239/ https://www.ncbi.nlm.nih.gov/pubmed/37566638 http://dx.doi.org/10.1371/journal.pcbi.1011378 |
work_keys_str_mv | AT jadebeckjohannf practicalsamplingofconstraintbasedmodelsoptimizedthinningboostschrrperformance AT wiechertwolfgang practicalsamplingofconstraintbasedmodelsoptimizedthinningboostschrrperformance AT nohkatharina practicalsamplingofconstraintbasedmodelsoptimizedthinningboostschrrperformance |