Cargando…
A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition
Hand gesture recognition tasks based on surface electromyography (sEMG) are vital in human-computer interaction, speech detection, robot control, and rehabilitation applications. However, existing models, whether traditional machine learnings (ML) or other state-of-the-arts, are limited in the numbe...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8775254/ https://www.ncbi.nlm.nih.gov/pubmed/35051235 http://dx.doi.org/10.1371/journal.pone.0262810 |
_version_ | 1784636541330522112 |
---|---|
author | Xu, Pufan Li, Fei Wang, Haipeng |
author_facet | Xu, Pufan Li, Fei Wang, Haipeng |
author_sort | Xu, Pufan |
collection | PubMed |
description | Hand gesture recognition tasks based on surface electromyography (sEMG) are vital in human-computer interaction, speech detection, robot control, and rehabilitation applications. However, existing models, whether traditional machine learnings (ML) or other state-of-the-arts, are limited in the number of movements. Targeting a large number of gesture classes, more data features such as temporal information should be persisted as much as possible. In the field of sEMG-based recognitions, the recurrent convolutional neural network (RCNN) is an advanced method due to the sequential characteristic of sEMG signals. However, the invariance of the pooling layer damages important temporal information. In the all convolutional neural network (ACNN), because of the feature-mixing convolution operation, a same output can be received from completely different inputs. This paper proposes a concatenate feature fusion (CFF) strategy and a novel concatenate feature fusion recurrent convolutional neural network (CFF-RCNN). In CFF-RCNN, a max-pooling layer and a 2-stride convolutional layer are concatenated together to replace the conventional simple dimensionality reduction layer. The featurewise pooling operation serves as a signal amplitude detector without using any parameter. The feature-mixing convolution operation calculates the contextual information. Complete evaluations are made on both the accuracy and convergence speed of the CFF-RCNN. Experiments are conducted using three sEMG benchmark databases named DB1, DB2 and DB4 from the NinaPro database. With more than 50 gestures, the classification accuracies of the CFF-RCNN are 88.87% on DB1, 99.51% on DB2, and 99.29% on DB4. These accuracies are the highest compared with reported accuracies of machine learnings and other state-of-the-art methods. To achieve accuracies of 86%, 99% and 98% for the RCNN, the training time are 2353.686 s, 816.173 s and 731.771 s, respectively. However, for the CFF-RCNN to reach the same accuracies, it needs only 1727.415 s, 542.245 s and 576.734 s, corresponding to a reduction of 26.61%, 33.56% and 21.19% in training time. We concluded that the CFF-RCNN is an improved method when classifying a large number of hand gestures. The CFF strategy significantly improved model performance with higher accuracy and faster convergence as compared to traditional RCNN. |
format | Online Article Text |
id | pubmed-8775254 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-87752542022-01-21 A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition Xu, Pufan Li, Fei Wang, Haipeng PLoS One Research Article Hand gesture recognition tasks based on surface electromyography (sEMG) are vital in human-computer interaction, speech detection, robot control, and rehabilitation applications. However, existing models, whether traditional machine learnings (ML) or other state-of-the-arts, are limited in the number of movements. Targeting a large number of gesture classes, more data features such as temporal information should be persisted as much as possible. In the field of sEMG-based recognitions, the recurrent convolutional neural network (RCNN) is an advanced method due to the sequential characteristic of sEMG signals. However, the invariance of the pooling layer damages important temporal information. In the all convolutional neural network (ACNN), because of the feature-mixing convolution operation, a same output can be received from completely different inputs. This paper proposes a concatenate feature fusion (CFF) strategy and a novel concatenate feature fusion recurrent convolutional neural network (CFF-RCNN). In CFF-RCNN, a max-pooling layer and a 2-stride convolutional layer are concatenated together to replace the conventional simple dimensionality reduction layer. The featurewise pooling operation serves as a signal amplitude detector without using any parameter. The feature-mixing convolution operation calculates the contextual information. Complete evaluations are made on both the accuracy and convergence speed of the CFF-RCNN. Experiments are conducted using three sEMG benchmark databases named DB1, DB2 and DB4 from the NinaPro database. With more than 50 gestures, the classification accuracies of the CFF-RCNN are 88.87% on DB1, 99.51% on DB2, and 99.29% on DB4. These accuracies are the highest compared with reported accuracies of machine learnings and other state-of-the-art methods. To achieve accuracies of 86%, 99% and 98% for the RCNN, the training time are 2353.686 s, 816.173 s and 731.771 s, respectively. However, for the CFF-RCNN to reach the same accuracies, it needs only 1727.415 s, 542.245 s and 576.734 s, corresponding to a reduction of 26.61%, 33.56% and 21.19% in training time. We concluded that the CFF-RCNN is an improved method when classifying a large number of hand gestures. The CFF strategy significantly improved model performance with higher accuracy and faster convergence as compared to traditional RCNN. Public Library of Science 2022-01-20 /pmc/articles/PMC8775254/ /pubmed/35051235 http://dx.doi.org/10.1371/journal.pone.0262810 Text en © 2022 Xu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Xu, Pufan Li, Fei Wang, Haipeng A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition |
title | A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition |
title_full | A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition |
title_fullStr | A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition |
title_full_unstemmed | A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition |
title_short | A novel concatenate feature fusion RCNN architecture for sEMG-based hand gesture recognition |
title_sort | novel concatenate feature fusion rcnn architecture for semg-based hand gesture recognition |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8775254/ https://www.ncbi.nlm.nih.gov/pubmed/35051235 http://dx.doi.org/10.1371/journal.pone.0262810 |
work_keys_str_mv | AT xupufan anovelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition AT lifei anovelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition AT wanghaipeng anovelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition AT xupufan novelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition AT lifei novelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition AT wanghaipeng novelconcatenatefeaturefusionrcnnarchitectureforsemgbasedhandgesturerecognition |