Cargando…
Nunchaku: optimally partitioning data into piece-wise contiguous segments
MOTIVATION: When analyzing 1D time series, scientists are often interested in identifying regions where one variable depends linearly on the other. Typically, they use an ad hoc and therefore often subjective method to do so. RESULTS: Here, we develop a statistically rigorous, Bayesian approach to i...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697733/ https://www.ncbi.nlm.nih.gov/pubmed/37966918 http://dx.doi.org/10.1093/bioinformatics/btad688 |
_version_ | 1785154802371526656 |
---|---|
author | Huo, Yu Li, Hongpei Wang, Xiao Du, Xiaochen Swain, Peter S |
author_facet | Huo, Yu Li, Hongpei Wang, Xiao Du, Xiaochen Swain, Peter S |
author_sort | Huo, Yu |
collection | PubMed |
description | MOTIVATION: When analyzing 1D time series, scientists are often interested in identifying regions where one variable depends linearly on the other. Typically, they use an ad hoc and therefore often subjective method to do so. RESULTS: Here, we develop a statistically rigorous, Bayesian approach to infer the optimal partitioning of a dataset not only into contiguous piece-wise linear segments, but also into contiguous segments described by linear combinations of arbitrary basis functions. We therefore present a general solution to the problem of identifying discontinuous change points. Focusing on microbial growth, we use the algorithm to find the range of optical density where this density is linearly proportional to the number of cells and to automatically find the regions of exponential growth for both Escherichia coli and Saccharomyces cerevisiae. For budding yeast, we consequently are able to infer the Monod constant for growth on fructose. Our algorithm lends itself to automation and high throughput studies, increases reproducibility, and should facilitate data analyses for a broad range of scientists. AVAILABILITY AND IMPLEMENTATION: The corresponding Python package, entitled Nunchaku, is available at PyPI: https://pypi.org/project/nunchaku. |
format | Online Article Text |
id | pubmed-10697733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106977332023-12-06 Nunchaku: optimally partitioning data into piece-wise contiguous segments Huo, Yu Li, Hongpei Wang, Xiao Du, Xiaochen Swain, Peter S Bioinformatics Original Paper MOTIVATION: When analyzing 1D time series, scientists are often interested in identifying regions where one variable depends linearly on the other. Typically, they use an ad hoc and therefore often subjective method to do so. RESULTS: Here, we develop a statistically rigorous, Bayesian approach to infer the optimal partitioning of a dataset not only into contiguous piece-wise linear segments, but also into contiguous segments described by linear combinations of arbitrary basis functions. We therefore present a general solution to the problem of identifying discontinuous change points. Focusing on microbial growth, we use the algorithm to find the range of optical density where this density is linearly proportional to the number of cells and to automatically find the regions of exponential growth for both Escherichia coli and Saccharomyces cerevisiae. For budding yeast, we consequently are able to infer the Monod constant for growth on fructose. Our algorithm lends itself to automation and high throughput studies, increases reproducibility, and should facilitate data analyses for a broad range of scientists. AVAILABILITY AND IMPLEMENTATION: The corresponding Python package, entitled Nunchaku, is available at PyPI: https://pypi.org/project/nunchaku. Oxford University Press 2023-11-15 /pmc/articles/PMC10697733/ /pubmed/37966918 http://dx.doi.org/10.1093/bioinformatics/btad688 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Huo, Yu Li, Hongpei Wang, Xiao Du, Xiaochen Swain, Peter S Nunchaku: optimally partitioning data into piece-wise contiguous segments |
title | Nunchaku: optimally partitioning data into piece-wise contiguous segments |
title_full | Nunchaku: optimally partitioning data into piece-wise contiguous segments |
title_fullStr | Nunchaku: optimally partitioning data into piece-wise contiguous segments |
title_full_unstemmed | Nunchaku: optimally partitioning data into piece-wise contiguous segments |
title_short | Nunchaku: optimally partitioning data into piece-wise contiguous segments |
title_sort | nunchaku: optimally partitioning data into piece-wise contiguous segments |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697733/ https://www.ncbi.nlm.nih.gov/pubmed/37966918 http://dx.doi.org/10.1093/bioinformatics/btad688 |
work_keys_str_mv | AT huoyu nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments AT lihongpei nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments AT wangxiao nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments AT duxiaochen nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments AT swainpeters nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments |