Cargando…

Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform

This study develops a new clustering method for high-dimensional zero-inflated time series data. The proposed method is based on thick-pen transform (TPT), in which the basic idea is to draw along the data with a pen of a given thickness. Since TPT is a multi-scale visualization technique, it provid...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Minji, Oh, Hee-Seok, Lim, Yaeji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10258486/
https://www.ncbi.nlm.nih.gov/pubmed/37359508
http://dx.doi.org/10.1007/s00357-023-09437-z
_version_ 1785057476256727040
author Kim, Minji
Oh, Hee-Seok
Lim, Yaeji
author_facet Kim, Minji
Oh, Hee-Seok
Lim, Yaeji
author_sort Kim, Minji
collection PubMed
description This study develops a new clustering method for high-dimensional zero-inflated time series data. The proposed method is based on thick-pen transform (TPT), in which the basic idea is to draw along the data with a pen of a given thickness. Since TPT is a multi-scale visualization technique, it provides some information on the temporal tendency of neighborhood values. We introduce a modified TPT, termed ‘ensemble TPT (e-TPT)’, to enhance the temporal resolution of zero-inflated time series data that is crucial for clustering them efficiently. Furthermore, this study defines a modified similarity measure for zero-inflated time series data considering e-TPT and proposes an efficient iterative clustering algorithm suitable for the proposed measure. Finally, the effectiveness of the proposed method is demonstrated by simulation experiments and two real datasets: step count data and newly confirmed COVID-19 case data.
format Online
Article
Text
id pubmed-10258486
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-102584862023-06-14 Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform Kim, Minji Oh, Hee-Seok Lim, Yaeji J Classif Article This study develops a new clustering method for high-dimensional zero-inflated time series data. The proposed method is based on thick-pen transform (TPT), in which the basic idea is to draw along the data with a pen of a given thickness. Since TPT is a multi-scale visualization technique, it provides some information on the temporal tendency of neighborhood values. We introduce a modified TPT, termed ‘ensemble TPT (e-TPT)’, to enhance the temporal resolution of zero-inflated time series data that is crucial for clustering them efficiently. Furthermore, this study defines a modified similarity measure for zero-inflated time series data considering e-TPT and proposes an efficient iterative clustering algorithm suitable for the proposed measure. Finally, the effectiveness of the proposed method is demonstrated by simulation experiments and two real datasets: step count data and newly confirmed COVID-19 case data. Springer US 2023-06-12 /pmc/articles/PMC10258486/ /pubmed/37359508 http://dx.doi.org/10.1007/s00357-023-09437-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Kim, Minji
Oh, Hee-Seok
Lim, Yaeji
Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform
title Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform
title_full Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform
title_fullStr Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform
title_full_unstemmed Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform
title_short Zero-Inflated Time Series Clustering Via Ensemble Thick-Pen Transform
title_sort zero-inflated time series clustering via ensemble thick-pen transform
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10258486/
https://www.ncbi.nlm.nih.gov/pubmed/37359508
http://dx.doi.org/10.1007/s00357-023-09437-z
work_keys_str_mv AT kimminji zeroinflatedtimeseriesclusteringviaensemblethickpentransform
AT ohheeseok zeroinflatedtimeseriesclusteringviaensemblethickpentransform
AT limyaeji zeroinflatedtimeseriesclusteringviaensemblethickpentransform