Cargando…
Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality
As the Internet-of-Things is deployed widely, many time-series data are generated everyday. Thus, classifying time-series automatically has become important. Compression-based pattern recognition has attracted attention, because it can analyze various data universally with few model parameters. RPCD...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10297683/ https://www.ncbi.nlm.nih.gov/pubmed/37372297 http://dx.doi.org/10.3390/e25060953 |
_version_ | 1785063940260102144 |
---|---|
author | Murai, Tatsumasa Koga, Hisashi |
author_facet | Murai, Tatsumasa Koga, Hisashi |
author_sort | Murai, Tatsumasa |
collection | PubMed |
description | As the Internet-of-Things is deployed widely, many time-series data are generated everyday. Thus, classifying time-series automatically has become important. Compression-based pattern recognition has attracted attention, because it can analyze various data universally with few model parameters. RPCD (Recurrent Plots Compression Distance) is known as a compression-based time-series classification method. First, RPCD transforms time-series data into an image called “Recurrent Plots (RP)”. Then, the distance between two time-series data is determined as the dissimilarity between their RPs. Here, the dissimilarity between two images is computed from the file size, when an MPEG-1 encoder compresses the video, which serializes the two images in order. In this paper, by analyzing the RPCD, we give an important insight that the quality parameter for the MPEG-1 encoding that controls the resolution of compressed videos influences the classification performance very much. We also show that the optimal parameter value depends extremely on the dataset to be classified: Interestingly, the optimal value for one dataset can make the RPCD fall behind a naive random classifier for another dataset. Supported by these insights, we propose an improved version of RPCD named qRPCD, which searches the optimal parameter value by means of cross-validation. Experimentally, qRPCD works superiorly to the original RPCD by about 4% in terms of classification accuracy. |
format | Online Article Text |
id | pubmed-10297683 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102976832023-06-28 Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality Murai, Tatsumasa Koga, Hisashi Entropy (Basel) Article As the Internet-of-Things is deployed widely, many time-series data are generated everyday. Thus, classifying time-series automatically has become important. Compression-based pattern recognition has attracted attention, because it can analyze various data universally with few model parameters. RPCD (Recurrent Plots Compression Distance) is known as a compression-based time-series classification method. First, RPCD transforms time-series data into an image called “Recurrent Plots (RP)”. Then, the distance between two time-series data is determined as the dissimilarity between their RPs. Here, the dissimilarity between two images is computed from the file size, when an MPEG-1 encoder compresses the video, which serializes the two images in order. In this paper, by analyzing the RPCD, we give an important insight that the quality parameter for the MPEG-1 encoding that controls the resolution of compressed videos influences the classification performance very much. We also show that the optimal parameter value depends extremely on the dataset to be classified: Interestingly, the optimal value for one dataset can make the RPCD fall behind a naive random classifier for another dataset. Supported by these insights, we propose an improved version of RPCD named qRPCD, which searches the optimal parameter value by means of cross-validation. Experimentally, qRPCD works superiorly to the original RPCD by about 4% in terms of classification accuracy. MDPI 2023-06-19 /pmc/articles/PMC10297683/ /pubmed/37372297 http://dx.doi.org/10.3390/e25060953 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Murai, Tatsumasa Koga, Hisashi Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality |
title | Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality |
title_full | Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality |
title_fullStr | Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality |
title_full_unstemmed | Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality |
title_short | Improved Recurrence Plots Compression Distance by Learning Parameter for Video Compression Quality |
title_sort | improved recurrence plots compression distance by learning parameter for video compression quality |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10297683/ https://www.ncbi.nlm.nih.gov/pubmed/37372297 http://dx.doi.org/10.3390/e25060953 |
work_keys_str_mv | AT muraitatsumasa improvedrecurrenceplotscompressiondistancebylearningparameterforvideocompressionquality AT kogahisashi improvedrecurrenceplotscompressiondistancebylearningparameterforvideocompressionquality |