Cargando…

The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

In the last 5 years there have been a large number of new time series classification algorithms proposed in the literature. These algorithms have been evaluated on subsets of the 47 data sets in the University of California, Riverside time series classification archive. The archive has recently been...

Descripción completa

Detalles Bibliográficos
Autores principales: Bagnall, Anthony, Lines, Jason, Bostrom, Aaron, Large, James, Keogh, Eamonn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6404674/
https://www.ncbi.nlm.nih.gov/pubmed/30930678
http://dx.doi.org/10.1007/s10618-016-0483-9
_version_ 1783400938955866112
author Bagnall, Anthony
Lines, Jason
Bostrom, Aaron
Large, James
Keogh, Eamonn
author_facet Bagnall, Anthony
Lines, Jason
Bostrom, Aaron
Large, James
Keogh, Eamonn
author_sort Bagnall, Anthony
collection PubMed
description In the last 5 years there have been a large number of new time series classification algorithms proposed in the literature. These algorithms have been evaluated on subsets of the 47 data sets in the University of California, Riverside time series classification archive. The archive has recently been expanded to 85 data sets, over half of which have been donated by researchers at the University of East Anglia. Aspects of previous evaluations have made comparisons between algorithms difficult. For example, several different programming languages have been used, experiments involved a single train/test split and some used normalised data whilst others did not. The relaunch of the archive provides a timely opportunity to thoroughly evaluate algorithms on a larger number of datasets. We have implemented 18 recently proposed algorithms in a common Java framework and compared them against two standard benchmark classifiers (and each other) by performing 100 resampling experiments on each of the 85 datasets. We use these results to test several hypotheses relating to whether the algorithms are significantly more accurate than the benchmarks and each other. Our results indicate that only nine of these algorithms are significantly more accurate than both benchmarks and that one classifier, the collective of transformation ensembles, is significantly more accurate than all of the others. All of our experiments and results are reproducible: we release all of our code, results and experimental details and we hope these experiments form the basis for more robust testing of new algorithms in the future.
format Online
Article
Text
id pubmed-6404674
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-64046742019-03-27 The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances Bagnall, Anthony Lines, Jason Bostrom, Aaron Large, James Keogh, Eamonn Data Min Knowl Discov Article In the last 5 years there have been a large number of new time series classification algorithms proposed in the literature. These algorithms have been evaluated on subsets of the 47 data sets in the University of California, Riverside time series classification archive. The archive has recently been expanded to 85 data sets, over half of which have been donated by researchers at the University of East Anglia. Aspects of previous evaluations have made comparisons between algorithms difficult. For example, several different programming languages have been used, experiments involved a single train/test split and some used normalised data whilst others did not. The relaunch of the archive provides a timely opportunity to thoroughly evaluate algorithms on a larger number of datasets. We have implemented 18 recently proposed algorithms in a common Java framework and compared them against two standard benchmark classifiers (and each other) by performing 100 resampling experiments on each of the 85 datasets. We use these results to test several hypotheses relating to whether the algorithms are significantly more accurate than the benchmarks and each other. Our results indicate that only nine of these algorithms are significantly more accurate than both benchmarks and that one classifier, the collective of transformation ensembles, is significantly more accurate than all of the others. All of our experiments and results are reproducible: we release all of our code, results and experimental details and we hope these experiments form the basis for more robust testing of new algorithms in the future. Springer US 2016-11-23 2017 /pmc/articles/PMC6404674/ /pubmed/30930678 http://dx.doi.org/10.1007/s10618-016-0483-9 Text en © The Author(s) 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Bagnall, Anthony
Lines, Jason
Bostrom, Aaron
Large, James
Keogh, Eamonn
The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_full The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_fullStr The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_full_unstemmed The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_short The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
title_sort great time series classification bake off: a review and experimental evaluation of recent algorithmic advances
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6404674/
https://www.ncbi.nlm.nih.gov/pubmed/30930678
http://dx.doi.org/10.1007/s10618-016-0483-9
work_keys_str_mv AT bagnallanthony thegreattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT linesjason thegreattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT bostromaaron thegreattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT largejames thegreattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT keogheamonn thegreattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT bagnallanthony greattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT linesjason greattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT bostromaaron greattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT largejames greattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances
AT keogheamonn greattimeseriesclassificationbakeoffareviewandexperimentalevaluationofrecentalgorithmicadvances