Cargando…

Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning

Human movements are characterized by highly non-linear and multi-dimensional interactions within the motor system. Therefore, the future of human movement analysis requires procedures that enhance the classification of movement patterns into relevant groups and support practitioners in their decisio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Burdack, Johannes, Horst, Fabian, Giesselbach, Sven, Hassan, Ibrahim, Daffner, Sabrina, Schöllhorn, Wolfgang I.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Bioengineering and Biotechnology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7174559/ https://www.ncbi.nlm.nih.gov/pubmed/32351945 http://dx.doi.org/10.3389/fbioe.2020.00260

_version_	1783524647389626368
author	Burdack, Johannes Horst, Fabian Giesselbach, Sven Hassan, Ibrahim Daffner, Sabrina Schöllhorn, Wolfgang I.
author_facet	Burdack, Johannes Horst, Fabian Giesselbach, Sven Hassan, Ibrahim Daffner, Sabrina Schöllhorn, Wolfgang I.
author_sort	Burdack, Johannes
collection	PubMed
description	Human movements are characterized by highly non-linear and multi-dimensional interactions within the motor system. Therefore, the future of human movement analysis requires procedures that enhance the classification of movement patterns into relevant groups and support practitioners in their decisions. In this regard, the use of data-driven techniques seems to be particularly suitable to generate classification models. Recently, an increasing emphasis on machine-learning applications has led to a significant contribution, e.g., in increasing the classification performance. In order to ensure the generalizability of the machine-learning models, different data preprocessing steps are usually carried out to process the measured raw data before the classifications. In the past, various methods have been used for each of these preprocessing steps. However, there are hardly any standard procedures or rather systematic comparisons of these different methods and their impact on the classification performance. Therefore, the aim of this analysis is to compare different combinations of commonly applied data preprocessing steps and test their effects on the classification performance of gait patterns. A publicly available dataset on intra-individual changes of gait patterns was used for this analysis. Forty-two healthy participants performed 6 sessions of 15 gait trials for 1 day. For each trial, two force plates recorded the three-dimensional ground reaction forces (GRFs). The data was preprocessed with the following steps: GRF filtering, time derivative, time normalization, data reduction, weight normalization and data scaling. Subsequently, combinations of all methods from each preprocessing step were analyzed by comparing their prediction performance in a six-session classification using Support Vector Machines, Random Forest Classifiers, Multi-Layer Perceptrons, and Convolutional Neural Networks. The results indicate that filtering GRF data and a supervised data reduction (e.g., using Principal Components Analysis) lead to increased prediction performance of the machine-learning classifiers. Interestingly, the weight normalization and the number of data points (above a certain minimum) in the time normalization does not have a substantial effect. In conclusion, the present results provide first domain-specific recommendations for commonly applied data preprocessing methods and might help to build more comparable and more robust classification models based on machine learning that are suitable for a practical application.
format	Online Article Text
id	pubmed-7174559
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-71745592020-04-29 Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning Burdack, Johannes Horst, Fabian Giesselbach, Sven Hassan, Ibrahim Daffner, Sabrina Schöllhorn, Wolfgang I. Front Bioeng Biotechnol Bioengineering and Biotechnology Human movements are characterized by highly non-linear and multi-dimensional interactions within the motor system. Therefore, the future of human movement analysis requires procedures that enhance the classification of movement patterns into relevant groups and support practitioners in their decisions. In this regard, the use of data-driven techniques seems to be particularly suitable to generate classification models. Recently, an increasing emphasis on machine-learning applications has led to a significant contribution, e.g., in increasing the classification performance. In order to ensure the generalizability of the machine-learning models, different data preprocessing steps are usually carried out to process the measured raw data before the classifications. In the past, various methods have been used for each of these preprocessing steps. However, there are hardly any standard procedures or rather systematic comparisons of these different methods and their impact on the classification performance. Therefore, the aim of this analysis is to compare different combinations of commonly applied data preprocessing steps and test their effects on the classification performance of gait patterns. A publicly available dataset on intra-individual changes of gait patterns was used for this analysis. Forty-two healthy participants performed 6 sessions of 15 gait trials for 1 day. For each trial, two force plates recorded the three-dimensional ground reaction forces (GRFs). The data was preprocessed with the following steps: GRF filtering, time derivative, time normalization, data reduction, weight normalization and data scaling. Subsequently, combinations of all methods from each preprocessing step were analyzed by comparing their prediction performance in a six-session classification using Support Vector Machines, Random Forest Classifiers, Multi-Layer Perceptrons, and Convolutional Neural Networks. The results indicate that filtering GRF data and a supervised data reduction (e.g., using Principal Components Analysis) lead to increased prediction performance of the machine-learning classifiers. Interestingly, the weight normalization and the number of data points (above a certain minimum) in the time normalization does not have a substantial effect. In conclusion, the present results provide first domain-specific recommendations for commonly applied data preprocessing methods and might help to build more comparable and more robust classification models based on machine learning that are suitable for a practical application. Frontiers Media S.A. 2020-04-15 /pmc/articles/PMC7174559/ /pubmed/32351945 http://dx.doi.org/10.3389/fbioe.2020.00260 Text en Copyright © 2020 Burdack, Horst, Giesselbach, Hassan, Daffner and Schöllhorn. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Bioengineering and Biotechnology Burdack, Johannes Horst, Fabian Giesselbach, Sven Hassan, Ibrahim Daffner, Sabrina Schöllhorn, Wolfgang I. Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning
title	Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning
title_full	Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning
title_fullStr	Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning
title_full_unstemmed	Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning
title_short	Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning
title_sort	systematic comparison of the influence of different data preprocessing methods on the performance of gait classifications using machine learning
topic	Bioengineering and Biotechnology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7174559/ https://www.ncbi.nlm.nih.gov/pubmed/32351945 http://dx.doi.org/10.3389/fbioe.2020.00260
work_keys_str_mv	AT burdackjohannes systematiccomparisonoftheinfluenceofdifferentdatapreprocessingmethodsontheperformanceofgaitclassificationsusingmachinelearning AT horstfabian systematiccomparisonoftheinfluenceofdifferentdatapreprocessingmethodsontheperformanceofgaitclassificationsusingmachinelearning AT giesselbachsven systematiccomparisonoftheinfluenceofdifferentdatapreprocessingmethodsontheperformanceofgaitclassificationsusingmachinelearning AT hassanibrahim systematiccomparisonoftheinfluenceofdifferentdatapreprocessingmethodsontheperformanceofgaitclassificationsusingmachinelearning AT daffnersabrina systematiccomparisonoftheinfluenceofdifferentdatapreprocessingmethodsontheperformanceofgaitclassificationsusingmachinelearning AT schollhornwolfgangi systematiccomparisonoftheinfluenceofdifferentdatapreprocessingmethodsontheperformanceofgaitclassificationsusingmachinelearning

Systematic Comparison of the Influence of Different Data Preprocessing Methods on the Performance of Gait Classifications Using Machine Learning

Ejemplares similares