Cargando…

The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances

Time Series Classification (TSC) involves building predictive models for a discrete target variable from ordered, real valued, attributes. Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art. The main focus has...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruiz, Alejandro Pasos, Flynn, Michael, Large, James, Middlehurst, Matthew, Bagnall, Anthony
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7897627/
https://www.ncbi.nlm.nih.gov/pubmed/33679210
http://dx.doi.org/10.1007/s10618-020-00727-3
_version_ 1783653707757387776
author Ruiz, Alejandro Pasos
Flynn, Michael
Large, James
Middlehurst, Matthew
Bagnall, Anthony
author_facet Ruiz, Alejandro Pasos
Flynn, Michael
Large, James
Middlehurst, Matthew
Bagnall, Anthony
author_sort Ruiz, Alejandro Pasos
collection PubMed
description Time Series Classification (TSC) involves building predictive models for a discrete target variable from ordered, real valued, attributes. Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art. The main focus has been on univariate TSC, i.e. the problem where each case has a single series and a class label. In reality, it is more common to encounter multivariate TSC (MTSC) problems where the time series for a single case has multiple dimensions. Despite this, much less consideration has been given to MTSC than the univariate case. The UCR archive has provided a valuable resource for univariate TSC, and the lack of a standard set of test problems may explain why there has been less focus on MTSC. The UEA archive of 30 MTSC problems released in 2018 has made comparison of algorithms easier. We review recently proposed bespoke MTSC algorithms based on deep learning, shapelets and bag of words approaches. If an algorithm cannot naturally handle multivariate data, the simplest approach to adapt a univariate classifier to MTSC is to ensemble it over the multivariate dimensions. We compare the bespoke algorithms to these dimension independent approaches on the 26 of the 30 MTSC archive problems where the data are all of equal length. We demonstrate that four classifiers are significantly more accurate than the benchmark dynamic time warping algorithm and that one of these recently proposed classifiers, ROCKET, achieves significant improvement on the archive datasets in at least an order of magnitude less time than the other three.
format Online
Article
Text
id pubmed-7897627
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-78976272021-03-05 The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances Ruiz, Alejandro Pasos Flynn, Michael Large, James Middlehurst, Matthew Bagnall, Anthony Data Min Knowl Discov Article Time Series Classification (TSC) involves building predictive models for a discrete target variable from ordered, real valued, attributes. Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art. The main focus has been on univariate TSC, i.e. the problem where each case has a single series and a class label. In reality, it is more common to encounter multivariate TSC (MTSC) problems where the time series for a single case has multiple dimensions. Despite this, much less consideration has been given to MTSC than the univariate case. The UCR archive has provided a valuable resource for univariate TSC, and the lack of a standard set of test problems may explain why there has been less focus on MTSC. The UEA archive of 30 MTSC problems released in 2018 has made comparison of algorithms easier. We review recently proposed bespoke MTSC algorithms based on deep learning, shapelets and bag of words approaches. If an algorithm cannot naturally handle multivariate data, the simplest approach to adapt a univariate classifier to MTSC is to ensemble it over the multivariate dimensions. We compare the bespoke algorithms to these dimension independent approaches on the 26 of the 30 MTSC archive problems where the data are all of equal length. We demonstrate that four classifiers are significantly more accurate than the benchmark dynamic time warping algorithm and that one of these recently proposed classifiers, ROCKET, achieves significant improvement on the archive datasets in at least an order of magnitude less time than the other three. Springer US 2020-12-18 2021 /pmc/articles/PMC7897627/ /pubmed/33679210 http://dx.doi.org/10.1007/s10618-020-00727-3 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Ruiz, Alejandro Pasos
Flynn, Michael
Large, James
Middlehurst, Matthew
Bagnall, Anthony
The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_full The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_fullStr The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_full_unstemmed The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_short The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_sort great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7897627/
https://www.ncbi.nlm.nih.gov/pubmed/33679210
http://dx.doi.org/10.1007/s10618-020-00727-3
work_keys_str_mv AT ruizalejandropasos thegreatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT flynnmichael thegreatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT largejames thegreatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT middlehurstmatthew thegreatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT bagnallanthony thegreatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT ruizalejandropasos greatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT flynnmichael greatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT largejames greatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT middlehurstmatthew greatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT bagnallanthony greatmultivariatetimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances