Cargando…

Confidence-Calibrated Human Activity Recognition

Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used...

Descripción completa

Detalles Bibliográficos
Autores principales: Roy, Debaditya, Girdzijauskas, Sarunas, Socolovschi, Serghei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512601/
https://www.ncbi.nlm.nih.gov/pubmed/34640886
http://dx.doi.org/10.3390/s21196566
_version_ 1784583033868779520
author Roy, Debaditya
Girdzijauskas, Sarunas
Socolovschi, Serghei
author_facet Roy, Debaditya
Girdzijauskas, Sarunas
Socolovschi, Serghei
author_sort Roy, Debaditya
collection PubMed
description Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used for AR are trained to discriminate different activity classes at high accuracy, not considering the confidence calibration of predictive output of those models. This results in probabilistic estimates that might not capture the true likelihood and is thus unreliable. In practice, it tends to produce overconfident estimates. In this paper, the problem is addressed by proposing deep time ensembles, a novel ensembling method capable of producing calibrated confidence estimates from neural network architectures. In particular, the method trains an ensemble of network models with temporal sequences extracted by varying the window size over the input time series and averaging the predictive output. The method is evaluated on four different benchmark HAR datasets and three different neural network architectures. Across all the datasets and architectures, our method shows an improvement in calibration by reducing the expected calibration error (ECE)by at least 40%, thereby providing superior likelihood estimates. In addition to providing reliable predictions our method also outperforms the state-of-the-art classification results in the WISDM, UCI HAR, and PAMAP2 datasets and performs as good as the state-of-the-art in the Skoda dataset.
format Online
Article
Text
id pubmed-8512601
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85126012021-10-14 Confidence-Calibrated Human Activity Recognition Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei Sensors (Basel) Article Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used for AR are trained to discriminate different activity classes at high accuracy, not considering the confidence calibration of predictive output of those models. This results in probabilistic estimates that might not capture the true likelihood and is thus unreliable. In practice, it tends to produce overconfident estimates. In this paper, the problem is addressed by proposing deep time ensembles, a novel ensembling method capable of producing calibrated confidence estimates from neural network architectures. In particular, the method trains an ensemble of network models with temporal sequences extracted by varying the window size over the input time series and averaging the predictive output. The method is evaluated on four different benchmark HAR datasets and three different neural network architectures. Across all the datasets and architectures, our method shows an improvement in calibration by reducing the expected calibration error (ECE)by at least 40%, thereby providing superior likelihood estimates. In addition to providing reliable predictions our method also outperforms the state-of-the-art classification results in the WISDM, UCI HAR, and PAMAP2 datasets and performs as good as the state-of-the-art in the Skoda dataset. MDPI 2021-09-30 /pmc/articles/PMC8512601/ /pubmed/34640886 http://dx.doi.org/10.3390/s21196566 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Roy, Debaditya
Girdzijauskas, Sarunas
Socolovschi, Serghei
Confidence-Calibrated Human Activity Recognition
title Confidence-Calibrated Human Activity Recognition
title_full Confidence-Calibrated Human Activity Recognition
title_fullStr Confidence-Calibrated Human Activity Recognition
title_full_unstemmed Confidence-Calibrated Human Activity Recognition
title_short Confidence-Calibrated Human Activity Recognition
title_sort confidence-calibrated human activity recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512601/
https://www.ncbi.nlm.nih.gov/pubmed/34640886
http://dx.doi.org/10.3390/s21196566
work_keys_str_mv AT roydebaditya confidencecalibratedhumanactivityrecognition
AT girdzijauskassarunas confidencecalibratedhumanactivityrecognition
AT socolovschiserghei confidencecalibratedhumanactivityrecognition