Cargando…

Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction

A wide range of applications based on sequential data, named time series, have become increasingly popular in recent years, mainly those based on the Internet of Things (IoT). Several different machine learning algorithms exploit the patterns extracted from sequential data to support multiple tasks....

Descripción completa

Detalles Bibliográficos
Autores principales:	Silva, Ricardo Petri, Zarpelão, Bruno Bogaz, Cano, Alberto, Junior, Sylvio Barbon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587387/ https://www.ncbi.nlm.nih.gov/pubmed/34770639 http://dx.doi.org/10.3390/s21217333

_version_	1784598131106643968
author	Silva, Ricardo Petri Zarpelão, Bruno Bogaz Cano, Alberto Junior, Sylvio Barbon
author_facet	Silva, Ricardo Petri Zarpelão, Bruno Bogaz Cano, Alberto Junior, Sylvio Barbon
author_sort	Silva, Ricardo Petri
collection	PubMed
description	A wide range of applications based on sequential data, named time series, have become increasingly popular in recent years, mainly those based on the Internet of Things (IoT). Several different machine learning algorithms exploit the patterns extracted from sequential data to support multiple tasks. However, this data can suffer from unreliable readings that can lead to low accuracy models due to the low-quality training sets available. Detecting the change point between high representative segments is an important ally to find and thread biased subsequences. By constructing a framework based on the Augmented Dickey-Fuller (ADF) test for data stationarity, two proposals to automatically segment subsequences in a time series were developed. The former proposal, called Change Detector segmentation, relies on change detection methods of data stream mining. The latter, called ADF-based segmentation, is constructed on a new change detector derived from the ADF test only. Experiments over real-file IoT databases and benchmarks showed the improvement provided by our proposals for prediction tasks with traditional Autoregressive integrated moving average (ARIMA) and Deep Learning (Long short-term memory and Temporal Convolutional Networks) methods. Results obtained by the Long short-term memory predictive model reduced the relative prediction error from 1 to 0.67, compared to time series without segmentation.
format	Online Article Text
id	pubmed-8587387
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-85873872021-11-13 Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction Silva, Ricardo Petri Zarpelão, Bruno Bogaz Cano, Alberto Junior, Sylvio Barbon Sensors (Basel) Article A wide range of applications based on sequential data, named time series, have become increasingly popular in recent years, mainly those based on the Internet of Things (IoT). Several different machine learning algorithms exploit the patterns extracted from sequential data to support multiple tasks. However, this data can suffer from unreliable readings that can lead to low accuracy models due to the low-quality training sets available. Detecting the change point between high representative segments is an important ally to find and thread biased subsequences. By constructing a framework based on the Augmented Dickey-Fuller (ADF) test for data stationarity, two proposals to automatically segment subsequences in a time series were developed. The former proposal, called Change Detector segmentation, relies on change detection methods of data stream mining. The latter, called ADF-based segmentation, is constructed on a new change detector derived from the ADF test only. Experiments over real-file IoT databases and benchmarks showed the improvement provided by our proposals for prediction tasks with traditional Autoregressive integrated moving average (ARIMA) and Deep Learning (Long short-term memory and Temporal Convolutional Networks) methods. Results obtained by the Long short-term memory predictive model reduced the relative prediction error from 1 to 0.67, compared to time series without segmentation. MDPI 2021-11-04 /pmc/articles/PMC8587387/ /pubmed/34770639 http://dx.doi.org/10.3390/s21217333 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Silva, Ricardo Petri Zarpelão, Bruno Bogaz Cano, Alberto Junior, Sylvio Barbon Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction
title	Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction
title_full	Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction
title_fullStr	Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction
title_full_unstemmed	Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction
title_short	Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction
title_sort	time series segmentation based on stationarity analysis to improve new samples prediction
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8587387/ https://www.ncbi.nlm.nih.gov/pubmed/34770639 http://dx.doi.org/10.3390/s21217333
work_keys_str_mv	AT silvaricardopetri timeseriessegmentationbasedonstationarityanalysistoimprovenewsamplesprediction AT zarpelaobrunobogaz timeseriessegmentationbasedonstationarityanalysistoimprovenewsamplesprediction AT canoalberto timeseriessegmentationbasedonstationarityanalysistoimprovenewsamplesprediction AT juniorsylviobarbon timeseriessegmentationbasedonstationarityanalysistoimprovenewsamplesprediction

Time Series Segmentation Based on Stationarity Analysis to Improve New Samples Prediction

Ejemplares similares