Cargando…

Nunchaku: optimally partitioning data into piece-wise contiguous segments

MOTIVATION: When analyzing 1D time series, scientists are often interested in identifying regions where one variable depends linearly on the other. Typically, they use an ad hoc and therefore often subjective method to do so. RESULTS: Here, we develop a statistically rigorous, Bayesian approach to i...

Descripción completa

Detalles Bibliográficos
Autores principales: Huo, Yu, Li, Hongpei, Wang, Xiao, Du, Xiaochen, Swain, Peter S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697733/
https://www.ncbi.nlm.nih.gov/pubmed/37966918
http://dx.doi.org/10.1093/bioinformatics/btad688
_version_ 1785154802371526656
author Huo, Yu
Li, Hongpei
Wang, Xiao
Du, Xiaochen
Swain, Peter S
author_facet Huo, Yu
Li, Hongpei
Wang, Xiao
Du, Xiaochen
Swain, Peter S
author_sort Huo, Yu
collection PubMed
description MOTIVATION: When analyzing 1D time series, scientists are often interested in identifying regions where one variable depends linearly on the other. Typically, they use an ad hoc and therefore often subjective method to do so. RESULTS: Here, we develop a statistically rigorous, Bayesian approach to infer the optimal partitioning of a dataset not only into contiguous piece-wise linear segments, but also into contiguous segments described by linear combinations of arbitrary basis functions. We therefore present a general solution to the problem of identifying discontinuous change points. Focusing on microbial growth, we use the algorithm to find the range of optical density where this density is linearly proportional to the number of cells and to automatically find the regions of exponential growth for both Escherichia coli and Saccharomyces cerevisiae. For budding yeast, we consequently are able to infer the Monod constant for growth on fructose. Our algorithm lends itself to automation and high throughput studies, increases reproducibility, and should facilitate data analyses for a broad range of scientists. AVAILABILITY AND IMPLEMENTATION: The corresponding Python package, entitled Nunchaku, is available at PyPI: https://pypi.org/project/nunchaku.
format Online
Article
Text
id pubmed-10697733
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106977332023-12-06 Nunchaku: optimally partitioning data into piece-wise contiguous segments Huo, Yu Li, Hongpei Wang, Xiao Du, Xiaochen Swain, Peter S Bioinformatics Original Paper MOTIVATION: When analyzing 1D time series, scientists are often interested in identifying regions where one variable depends linearly on the other. Typically, they use an ad hoc and therefore often subjective method to do so. RESULTS: Here, we develop a statistically rigorous, Bayesian approach to infer the optimal partitioning of a dataset not only into contiguous piece-wise linear segments, but also into contiguous segments described by linear combinations of arbitrary basis functions. We therefore present a general solution to the problem of identifying discontinuous change points. Focusing on microbial growth, we use the algorithm to find the range of optical density where this density is linearly proportional to the number of cells and to automatically find the regions of exponential growth for both Escherichia coli and Saccharomyces cerevisiae. For budding yeast, we consequently are able to infer the Monod constant for growth on fructose. Our algorithm lends itself to automation and high throughput studies, increases reproducibility, and should facilitate data analyses for a broad range of scientists. AVAILABILITY AND IMPLEMENTATION: The corresponding Python package, entitled Nunchaku, is available at PyPI: https://pypi.org/project/nunchaku. Oxford University Press 2023-11-15 /pmc/articles/PMC10697733/ /pubmed/37966918 http://dx.doi.org/10.1093/bioinformatics/btad688 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Huo, Yu
Li, Hongpei
Wang, Xiao
Du, Xiaochen
Swain, Peter S
Nunchaku: optimally partitioning data into piece-wise contiguous segments
title Nunchaku: optimally partitioning data into piece-wise contiguous segments
title_full Nunchaku: optimally partitioning data into piece-wise contiguous segments
title_fullStr Nunchaku: optimally partitioning data into piece-wise contiguous segments
title_full_unstemmed Nunchaku: optimally partitioning data into piece-wise contiguous segments
title_short Nunchaku: optimally partitioning data into piece-wise contiguous segments
title_sort nunchaku: optimally partitioning data into piece-wise contiguous segments
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10697733/
https://www.ncbi.nlm.nih.gov/pubmed/37966918
http://dx.doi.org/10.1093/bioinformatics/btad688
work_keys_str_mv AT huoyu nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments
AT lihongpei nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments
AT wangxiao nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments
AT duxiaochen nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments
AT swainpeters nunchakuoptimallypartitioningdataintopiecewisecontiguoussegments