Cargando…

HARTH: A Human Activity Recognition Dataset for Machine Learning

Existing accelerometer-based human activity recognition (HAR) benchmark datasets that were recorded during free living suffer from non-fixed sensor placement, the usage of only one sensor, and unreliable annotations. We make two contributions in this work. First, we present the publicly available Hu...

Descripción completa

Detalles Bibliográficos
Autores principales: Logacjov, Aleksej, Bach, Kerstin, Kongsvold, Atle, Bårdstu, Hilde Bremseth, Mork, Paul Jarle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659926/
https://www.ncbi.nlm.nih.gov/pubmed/34883863
http://dx.doi.org/10.3390/s21237853
_version_ 1784613079684743168
author Logacjov, Aleksej
Bach, Kerstin
Kongsvold, Atle
Bårdstu, Hilde Bremseth
Mork, Paul Jarle
author_facet Logacjov, Aleksej
Bach, Kerstin
Kongsvold, Atle
Bårdstu, Hilde Bremseth
Mork, Paul Jarle
author_sort Logacjov, Aleksej
collection PubMed
description Existing accelerometer-based human activity recognition (HAR) benchmark datasets that were recorded during free living suffer from non-fixed sensor placement, the usage of only one sensor, and unreliable annotations. We make two contributions in this work. First, we present the publicly available Human Activity Recognition Trondheim dataset (HARTH). Twenty-two participants were recorded for 90 to 120 min during their regular working hours using two three-axial accelerometers, attached to the thigh and lower back, and a chest-mounted camera. Experts annotated the data independently using the camera’s video signal and achieved high inter-rater agreement (Fleiss’ Kappa [Formula: see text]). They labeled twelve activities. The second contribution of this paper is the training of seven different baseline machine learning models for HAR on our dataset. We used a support vector machine, k-nearest neighbor, random forest, extreme gradient boost, convolutional neural network, bidirectional long short-term memory, and convolutional neural network with multi-resolution blocks. The support vector machine achieved the best results with an F1-score of [Formula: see text] (standard deviation: [Formula: see text]), recall of [Formula: see text] , and precision of [Formula: see text] in a leave-one-subject-out cross-validation. Our highly professional recordings and annotations provide a promising benchmark dataset for researchers to develop innovative machine learning approaches for precise HAR in free living.
format Online
Article
Text
id pubmed-8659926
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86599262021-12-10 HARTH: A Human Activity Recognition Dataset for Machine Learning Logacjov, Aleksej Bach, Kerstin Kongsvold, Atle Bårdstu, Hilde Bremseth Mork, Paul Jarle Sensors (Basel) Article Existing accelerometer-based human activity recognition (HAR) benchmark datasets that were recorded during free living suffer from non-fixed sensor placement, the usage of only one sensor, and unreliable annotations. We make two contributions in this work. First, we present the publicly available Human Activity Recognition Trondheim dataset (HARTH). Twenty-two participants were recorded for 90 to 120 min during their regular working hours using two three-axial accelerometers, attached to the thigh and lower back, and a chest-mounted camera. Experts annotated the data independently using the camera’s video signal and achieved high inter-rater agreement (Fleiss’ Kappa [Formula: see text]). They labeled twelve activities. The second contribution of this paper is the training of seven different baseline machine learning models for HAR on our dataset. We used a support vector machine, k-nearest neighbor, random forest, extreme gradient boost, convolutional neural network, bidirectional long short-term memory, and convolutional neural network with multi-resolution blocks. The support vector machine achieved the best results with an F1-score of [Formula: see text] (standard deviation: [Formula: see text]), recall of [Formula: see text] , and precision of [Formula: see text] in a leave-one-subject-out cross-validation. Our highly professional recordings and annotations provide a promising benchmark dataset for researchers to develop innovative machine learning approaches for precise HAR in free living. MDPI 2021-11-25 /pmc/articles/PMC8659926/ /pubmed/34883863 http://dx.doi.org/10.3390/s21237853 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Logacjov, Aleksej
Bach, Kerstin
Kongsvold, Atle
Bårdstu, Hilde Bremseth
Mork, Paul Jarle
HARTH: A Human Activity Recognition Dataset for Machine Learning
title HARTH: A Human Activity Recognition Dataset for Machine Learning
title_full HARTH: A Human Activity Recognition Dataset for Machine Learning
title_fullStr HARTH: A Human Activity Recognition Dataset for Machine Learning
title_full_unstemmed HARTH: A Human Activity Recognition Dataset for Machine Learning
title_short HARTH: A Human Activity Recognition Dataset for Machine Learning
title_sort harth: a human activity recognition dataset for machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659926/
https://www.ncbi.nlm.nih.gov/pubmed/34883863
http://dx.doi.org/10.3390/s21237853
work_keys_str_mv AT logacjovaleksej harthahumanactivityrecognitiondatasetformachinelearning
AT bachkerstin harthahumanactivityrecognitiondatasetformachinelearning
AT kongsvoldatle harthahumanactivityrecognitiondatasetformachinelearning
AT bardstuhildebremseth harthahumanactivityrecognitiondatasetformachinelearning
AT morkpauljarle harthahumanactivityrecognitiondatasetformachinelearning