Cargando…

A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition

Hand gesture recognition tasks based on surface electromyography (sEMG) are vital in human-computer interaction, speech detection, robot control, and rehabilitation applications. However, existing models, whether traditional machine learnings (ML) or other state-of-the-arts, are limited in the numbe...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Pufan, Li, Fei, Wang, Haipeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8775254/
https://www.ncbi.nlm.nih.gov/pubmed/35051235
http://dx.doi.org/10.1371/journal.pone.0262810
_version_ 1784636541330522112
author Xu, Pufan
Li, Fei
Wang, Haipeng
author_facet Xu, Pufan
Li, Fei
Wang, Haipeng
author_sort Xu, Pufan
collection PubMed
description Hand gesture recognition tasks based on surface electromyography (sEMG) are vital in human-computer interaction, speech detection, robot control, and rehabilitation applications. However, existing models, whether traditional machine learnings (ML) or other state-of-the-arts, are limited in the number of movements. Targeting a large number of gesture classes, more data features such as temporal information should be persisted as much as possible. In the field of sEMG-based recognitions, the recurrent convolutional neural network (RCNN) is an advanced method due to the sequential characteristic of sEMG signals. However, the invariance of the pooling layer damages important temporal information. In the all convolutional neural network (ACNN), because of the feature-mixing convolution operation, a same output can be received from completely different inputs. This paper proposes a concatenate feature fusion (CFF) strategy and a novel concatenate feature fusion recurrent convolutional neural network (CFF-RCNN). In CFF-RCNN, a max-pooling layer and a 2-stride convolutional layer are concatenated together to replace the conventional simple dimensionality reduction layer. The featurewise pooling operation serves as a signal amplitude detector without using any parameter. The feature-mixing convolution operation calculates the contextual information. Complete evaluations are made on both the accuracy and convergence speed of the CFF-RCNN. Experiments are conducted using three sEMG benchmark databases named DB1, DB2 and DB4 from the NinaPro database. With more than 50 gestures, the classification accuracies of the CFF-RCNN are 88.87% on DB1, 99.51% on DB2, and 99.29% on DB4. These accuracies are the highest compared with reported accuracies of machine learnings and other state-of-the-art methods. To achieve accuracies of 86%, 99% and 98% for the RCNN, the training time are 2353.686 s, 816.173 s and 731.771 s, respectively. However, for the CFF-RCNN to reach the same accuracies, it needs only 1727.415 s, 542.245 s and 576.734 s, corresponding to a reduction of 26.61%, 33.56% and 21.19% in training time. We concluded that the CFF-RCNN is an improved method when classifying a large number of hand gestures. The CFF strategy significantly improved model performance with higher accuracy and faster convergence as compared to traditional RCNN.
format Online
Article
Text
id pubmed-8775254
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87752542022-01-21 A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition Xu, Pufan Li, Fei Wang, Haipeng PLoS One Research Article Hand gesture recognition tasks based on surface electromyography (sEMG) are vital in human-computer interaction, speech detection, robot control, and rehabilitation applications. However, existing models, whether traditional machine learnings (ML) or other state-of-the-arts, are limited in the number of movements. Targeting a large number of gesture classes, more data features such as temporal information should be persisted as much as possible. In the field of sEMG-based recognitions, the recurrent convolutional neural network (RCNN) is an advanced method due to the sequential characteristic of sEMG signals. However, the invariance of the pooling layer damages important temporal information. In the all convolutional neural network (ACNN), because of the feature-mixing convolution operation, a same output can be received from completely different inputs. This paper proposes a concatenate feature fusion (CFF) strategy and a novel concatenate feature fusion recurrent convolutional neural network (CFF-RCNN). In CFF-RCNN, a max-pooling layer and a 2-stride convolutional layer are concatenated together to replace the conventional simple dimensionality reduction layer. The featurewise pooling operation serves as a signal amplitude detector without using any parameter. The feature-mixing convolution operation calculates the contextual information. Complete evaluations are made on both the accuracy and convergence speed of the CFF-RCNN. Experiments are conducted using three sEMG benchmark databases named DB1, DB2 and DB4 from the NinaPro database. With more than 50 gestures, the classification accuracies of the CFF-RCNN are 88.87% on DB1, 99.51% on DB2, and 99.29% on DB4. These accuracies are the highest compared with reported accuracies of machine learnings and other state-of-the-art methods. To achieve accuracies of 86%, 99% and 98% for the RCNN, the training time are 2353.686 s, 816.173 s and 731.771 s, respectively. However, for the CFF-RCNN to reach the same accuracies, it needs only 1727.415 s, 542.245 s and 576.734 s, corresponding to a reduction of 26.61%, 33.56% and 21.19% in training time. We concluded that the CFF-RCNN is an improved method when classifying a large number of hand gestures. The CFF strategy significantly improved model performance with higher accuracy and faster convergence as compared to traditional RCNN. Public Library of Science 2022-01-20 /pmc/articles/PMC8775254/ /pubmed/35051235 http://dx.doi.org/10.1371/journal.pone.0262810 Text en © 2022 Xu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Xu, Pufan
Li, Fei
Wang, Haipeng
A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition
title A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition
title_full A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition
title_fullStr A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition
title_full_unstemmed A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition
title_short A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition
title_sort novel concatenate feature fusion rcnn architecture for semg-based hand gesture recognition
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8775254/
https://www.ncbi.nlm.nih.gov/pubmed/35051235
http://dx.doi.org/10.1371/journal.pone.0262810
work_keys_str_mv AT xupufan anovelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition
AT lifei anovelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition
AT wanghaipeng anovelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition
AT xupufan novelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition
AT lifei novelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition
AT wanghaipeng novelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition