Cargando…
Confidence-Calibrated Human Activity Recognition
Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512601/ https://www.ncbi.nlm.nih.gov/pubmed/34640886 http://dx.doi.org/10.3390/s21196566 |
_version_ | 1784583033868779520 |
---|---|
author | Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei |
author_facet | Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei |
author_sort | Roy, Debaditya |
collection | PubMed |
description | Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used for AR are trained to discriminate different activity classes at high accuracy, not considering the confidence calibration of predictive output of those models. This results in probabilistic estimates that might not capture the true likelihood and is thus unreliable. In practice, it tends to produce overconfident estimates. In this paper, the problem is addressed by proposing deep time ensembles, a novel ensembling method capable of producing calibrated confidence estimates from neural network architectures. In particular, the method trains an ensemble of network models with temporal sequences extracted by varying the window size over the input time series and averaging the predictive output. The method is evaluated on four different benchmark HAR datasets and three different neural network architectures. Across all the datasets and architectures, our method shows an improvement in calibration by reducing the expected calibration error (ECE)by at least 40%, thereby providing superior likelihood estimates. In addition to providing reliable predictions our method also outperforms the state-of-the-art classification results in the WISDM, UCI HAR, and PAMAP2 datasets and performs as good as the state-of-the-art in the Skoda dataset. |
format | Online Article Text |
id | pubmed-8512601 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85126012021-10-14 Confidence-Calibrated Human Activity Recognition Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei Sensors (Basel) Article Wearable sensors are widely used in activity recognition (AR) tasks with broad applicability in health and well-being, sports, geriatric care, etc. Deep learning (DL) has been at the forefront of progress in activity classification with wearable sensors. However, most state-of-the-art DL models used for AR are trained to discriminate different activity classes at high accuracy, not considering the confidence calibration of predictive output of those models. This results in probabilistic estimates that might not capture the true likelihood and is thus unreliable. In practice, it tends to produce overconfident estimates. In this paper, the problem is addressed by proposing deep time ensembles, a novel ensembling method capable of producing calibrated confidence estimates from neural network architectures. In particular, the method trains an ensemble of network models with temporal sequences extracted by varying the window size over the input time series and averaging the predictive output. The method is evaluated on four different benchmark HAR datasets and three different neural network architectures. Across all the datasets and architectures, our method shows an improvement in calibration by reducing the expected calibration error (ECE)by at least 40%, thereby providing superior likelihood estimates. In addition to providing reliable predictions our method also outperforms the state-of-the-art classification results in the WISDM, UCI HAR, and PAMAP2 datasets and performs as good as the state-of-the-art in the Skoda dataset. MDPI 2021-09-30 /pmc/articles/PMC8512601/ /pubmed/34640886 http://dx.doi.org/10.3390/s21196566 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Roy, Debaditya Girdzijauskas, Sarunas Socolovschi, Serghei Confidence-Calibrated Human Activity Recognition |
title | Confidence-Calibrated Human Activity Recognition |
title_full | Confidence-Calibrated Human Activity Recognition |
title_fullStr | Confidence-Calibrated Human Activity Recognition |
title_full_unstemmed | Confidence-Calibrated Human Activity Recognition |
title_short | Confidence-Calibrated Human Activity Recognition |
title_sort | confidence-calibrated human activity recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8512601/ https://www.ncbi.nlm.nih.gov/pubmed/34640886 http://dx.doi.org/10.3390/s21196566 |
work_keys_str_mv | AT roydebaditya confidencecalibratedhumanactivityrecognition AT girdzijauskassarunas confidencecalibratedhumanactivityrecognition AT socolovschiserghei confidencecalibratedhumanactivityrecognition |