Cargando…
HARTH: A Human Activity Recognition Dataset for Machine Learning
Existing accelerometer-based human activity recognition (HAR) benchmark datasets that were recorded during free living suffer from non-fixed sensor placement, the usage of only one sensor, and unreliable annotations. We make two contributions in this work. First, we present the publicly available Hu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659926/ https://www.ncbi.nlm.nih.gov/pubmed/34883863 http://dx.doi.org/10.3390/s21237853 |
_version_ | 1784613079684743168 |
---|---|
author | Logacjov, Aleksej Bach, Kerstin Kongsvold, Atle Bårdstu, Hilde Bremseth Mork, Paul Jarle |
author_facet | Logacjov, Aleksej Bach, Kerstin Kongsvold, Atle Bårdstu, Hilde Bremseth Mork, Paul Jarle |
author_sort | Logacjov, Aleksej |
collection | PubMed |
description | Existing accelerometer-based human activity recognition (HAR) benchmark datasets that were recorded during free living suffer from non-fixed sensor placement, the usage of only one sensor, and unreliable annotations. We make two contributions in this work. First, we present the publicly available Human Activity Recognition Trondheim dataset (HARTH). Twenty-two participants were recorded for 90 to 120 min during their regular working hours using two three-axial accelerometers, attached to the thigh and lower back, and a chest-mounted camera. Experts annotated the data independently using the camera’s video signal and achieved high inter-rater agreement (Fleiss’ Kappa [Formula: see text]). They labeled twelve activities. The second contribution of this paper is the training of seven different baseline machine learning models for HAR on our dataset. We used a support vector machine, k-nearest neighbor, random forest, extreme gradient boost, convolutional neural network, bidirectional long short-term memory, and convolutional neural network with multi-resolution blocks. The support vector machine achieved the best results with an F1-score of [Formula: see text] (standard deviation: [Formula: see text]), recall of [Formula: see text] , and precision of [Formula: see text] in a leave-one-subject-out cross-validation. Our highly professional recordings and annotations provide a promising benchmark dataset for researchers to develop innovative machine learning approaches for precise HAR in free living. |
format | Online Article Text |
id | pubmed-8659926 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-86599262021-12-10 HARTH: A Human Activity Recognition Dataset for Machine Learning Logacjov, Aleksej Bach, Kerstin Kongsvold, Atle Bårdstu, Hilde Bremseth Mork, Paul Jarle Sensors (Basel) Article Existing accelerometer-based human activity recognition (HAR) benchmark datasets that were recorded during free living suffer from non-fixed sensor placement, the usage of only one sensor, and unreliable annotations. We make two contributions in this work. First, we present the publicly available Human Activity Recognition Trondheim dataset (HARTH). Twenty-two participants were recorded for 90 to 120 min during their regular working hours using two three-axial accelerometers, attached to the thigh and lower back, and a chest-mounted camera. Experts annotated the data independently using the camera’s video signal and achieved high inter-rater agreement (Fleiss’ Kappa [Formula: see text]). They labeled twelve activities. The second contribution of this paper is the training of seven different baseline machine learning models for HAR on our dataset. We used a support vector machine, k-nearest neighbor, random forest, extreme gradient boost, convolutional neural network, bidirectional long short-term memory, and convolutional neural network with multi-resolution blocks. The support vector machine achieved the best results with an F1-score of [Formula: see text] (standard deviation: [Formula: see text]), recall of [Formula: see text] , and precision of [Formula: see text] in a leave-one-subject-out cross-validation. Our highly professional recordings and annotations provide a promising benchmark dataset for researchers to develop innovative machine learning approaches for precise HAR in free living. MDPI 2021-11-25 /pmc/articles/PMC8659926/ /pubmed/34883863 http://dx.doi.org/10.3390/s21237853 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Logacjov, Aleksej Bach, Kerstin Kongsvold, Atle Bårdstu, Hilde Bremseth Mork, Paul Jarle HARTH: A Human Activity Recognition Dataset for Machine Learning |
title | HARTH: A Human Activity Recognition Dataset for Machine Learning |
title_full | HARTH: A Human Activity Recognition Dataset for Machine Learning |
title_fullStr | HARTH: A Human Activity Recognition Dataset for Machine Learning |
title_full_unstemmed | HARTH: A Human Activity Recognition Dataset for Machine Learning |
title_short | HARTH: A Human Activity Recognition Dataset for Machine Learning |
title_sort | harth: a human activity recognition dataset for machine learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659926/ https://www.ncbi.nlm.nih.gov/pubmed/34883863 http://dx.doi.org/10.3390/s21237853 |
work_keys_str_mv | AT logacjovaleksej harthahumanactivityrecognitiondatasetformachinelearning AT bachkerstin harthahumanactivityrecognitiondatasetformachinelearning AT kongsvoldatle harthahumanactivityrecognitiondatasetformachinelearning AT bardstuhildebremseth harthahumanactivityrecognitiondatasetformachinelearning AT morkpauljarle harthahumanactivityrecognitiondatasetformachinelearning |