Cargando…

Confidence-Calibrated Human Activity Recognition

Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used...

Descripción completa

Detalles Bibliográficos
Autores principales:	Roy, Debaditya, Girdzijauskas, Sarunas, Socolovschi, Serghei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512601/ https://www.ncbi.nlm.nih.gov/pubmed/34640886 http://dx.doi.org/10.3390/s21196566

_version_	1784583033868779520
author	Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei
author_facet	Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei
author_sort	Roy, Debaditya
collection	PubMed
description	Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used for AR are trained to discriminate different activity classes at high accuracy, not considering the confidence calibration of predictive output of those models. This results in probabilistic estimates that might not capture the true likelihood and is thus unreliable. In practice, it tends to produce overconfident estimates. In this paper, the problem is addressed by proposing deep time ensembles, a novel ensembling method capable of producing calibrated confidence estimates from neural network architectures. In particular, the method trains an ensemble of network models with temporal sequences extracted by varying the window size over the input time series and averaging the predictive output. The method is evaluated on four different benchmark HAR datasets and three different neural network architectures. Across all the datasets and architectures, our method shows an improvement in calibration by reducing the expected calibration error (ECE)by at least 40%, thereby providing superior likelihood estimates. In addition to providing reliable predictions our method also outperforms the state-of-the-art classification results in the WISDM, UCI HAR, and PAMAP2 datasets and performs as good as the state-of-the-art in the Skoda dataset.
format	Online Article Text
id	pubmed-8512601
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-85126012021-10-14 Confidence-Calibrated Human Activity Recognition Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei Sensors (Basel) Article Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used for AR are trained to discriminate different activity classes at high accuracy, not considering the confidence calibration of predictive output of those models. This results in probabilistic estimates that might not capture the true likelihood and is thus unreliable. In practice, it tends to produce overconfident estimates. In this paper, the problem is addressed by proposing deep time ensembles, a novel ensembling method capable of producing calibrated confidence estimates from neural network architectures. In particular, the method trains an ensemble of network models with temporal sequences extracted by varying the window size over the input time series and averaging the predictive output. The method is evaluated on four different benchmark HAR datasets and three different neural network architectures. Across all the datasets and architectures, our method shows an improvement in calibration by reducing the expected calibration error (ECE)by at least 40%, thereby providing superior likelihood estimates. In addition to providing reliable predictions our method also outperforms the state-of-the-art classification results in the WISDM, UCI HAR, and PAMAP2 datasets and performs as good as the state-of-the-art in the Skoda dataset. MDPI 2021-09-30 /pmc/articles/PMC8512601/ /pubmed/34640886 http://dx.doi.org/10.3390/s21196566 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei Confidence-Calibrated Human Activity Recognition
title	Confidence-Calibrated Human Activity Recognition
title_full	Confidence-Calibrated Human Activity Recognition
title_fullStr	Confidence-Calibrated Human Activity Recognition
title_full_unstemmed	Confidence-Calibrated Human Activity Recognition
title_short	Confidence-Calibrated Human Activity Recognition
title_sort	confidence-calibrated human activity recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512601/ https://www.ncbi.nlm.nih.gov/pubmed/34640886 http://dx.doi.org/10.3390/s21196566
work_keys_str_mv	AT roydebaditya confidencecalibratedhumanactivityrecognition AT girdzijauskassarunas confidencecalibratedhumanactivityrecognition AT socolovschiserghei confidencecalibratedhumanactivityrecognition

Confidence-Calibrated Human Activity Recognition

Ejemplares similares