Cargando…

Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition

Human activity recognition (HAR) is a popular field of study. The outcomes of the projects in this area have the potential to impact on the quality of life of people with conditions such as dementia. HAR is focused primarily on applying machine learning classifiers on data from low level sensors suc...

Descripción completa

Detalles Bibliográficos
Autores principales: Neira-Rodado, Dionicio, Nugent, Chris, Cleland, Ian, Velasquez, Javier, Viloria, Amelec
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7180455/
https://www.ncbi.nlm.nih.gov/pubmed/32230844
http://dx.doi.org/10.3390/s20071858
_version_ 1783525823373901824
author Neira-Rodado, Dionicio
Nugent, Chris
Cleland, Ian
Velasquez, Javier
Viloria, Amelec
author_facet Neira-Rodado, Dionicio
Nugent, Chris
Cleland, Ian
Velasquez, Javier
Viloria, Amelec
author_sort Neira-Rodado, Dionicio
collection PubMed
description Human activity recognition (HAR) is a popular field of study. The outcomes of the projects in this area have the potential to impact on the quality of life of people with conditions such as dementia. HAR is focused primarily on applying machine learning classifiers on data from low level sensors such as accelerometers. The performance of these classifiers can be improved through an adequate training process. In order to improve the training process, multivariate outlier detection was used in order to improve the quality of data in the training set and, subsequently, performance of the classifier. The impact of the technique was evaluated with KNN and random forest (RF) classifiers. In the case of KNN, the performance of the classifier was improved from 55.9% to 63.59%.
format Online
Article
Text
id pubmed-7180455
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-71804552020-05-01 Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition Neira-Rodado, Dionicio Nugent, Chris Cleland, Ian Velasquez, Javier Viloria, Amelec Sensors (Basel) Article Human activity recognition (HAR) is a popular field of study. The outcomes of the projects in this area have the potential to impact on the quality of life of people with conditions such as dementia. HAR is focused primarily on applying machine learning classifiers on data from low level sensors such as accelerometers. The performance of these classifiers can be improved through an adequate training process. In order to improve the training process, multivariate outlier detection was used in order to improve the quality of data in the training set and, subsequently, performance of the classifier. The impact of the technique was evaluated with KNN and random forest (RF) classifiers. In the case of KNN, the performance of the classifier was improved from 55.9% to 63.59%. MDPI 2020-03-27 /pmc/articles/PMC7180455/ /pubmed/32230844 http://dx.doi.org/10.3390/s20071858 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Neira-Rodado, Dionicio
Nugent, Chris
Cleland, Ian
Velasquez, Javier
Viloria, Amelec
Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition
title Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition
title_full Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition
title_fullStr Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition
title_full_unstemmed Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition
title_short Evaluating the Impact of a Two-Stage Multivariate Data Cleansing Approach to Improve to the Performance of Machine Learning Classifiers: A Case Study in Human Activity Recognition
title_sort evaluating the impact of a two-stage multivariate data cleansing approach to improve to the performance of machine learning classifiers: a case study in human activity recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7180455/
https://www.ncbi.nlm.nih.gov/pubmed/32230844
http://dx.doi.org/10.3390/s20071858
work_keys_str_mv AT neirarodadodionicio evaluatingtheimpactofatwostagemultivariatedatacleansingapproachtoimprovetotheperformanceofmachinelearningclassifiersacasestudyinhumanactivityrecognition
AT nugentchris evaluatingtheimpactofatwostagemultivariatedatacleansingapproachtoimprovetotheperformanceofmachinelearningclassifiersacasestudyinhumanactivityrecognition
AT clelandian evaluatingtheimpactofatwostagemultivariatedatacleansingapproachtoimprovetotheperformanceofmachinelearningclassifiersacasestudyinhumanactivityrecognition
AT velasquezjavier evaluatingtheimpactofatwostagemultivariatedatacleansingapproachtoimprovetotheperformanceofmachinelearningclassifiersacasestudyinhumanactivityrecognition
AT viloriaamelec evaluatingtheimpactofatwostagemultivariatedatacleansingapproachtoimprovetotheperformanceofmachinelearningclassifiersacasestudyinhumanactivityrecognition