Cargando…
IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346255/ https://www.ncbi.nlm.nih.gov/pubmed/37447625 http://dx.doi.org/10.3390/s23135775 |
_version_ | 1785073272045436928 |
---|---|
author | Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan |
author_facet | Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan |
author_sort | Wang, Xiangrui |
collection | PubMed |
description | Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives. |
format | Online Article Text |
id | pubmed-10346255 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103462552023-07-15 IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan Sensors (Basel) Article Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives. MDPI 2023-06-21 /pmc/articles/PMC10346255/ /pubmed/37447625 http://dx.doi.org/10.3390/s23135775 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography |
title | IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography |
title_full | IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography |
title_fullStr | IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography |
title_full_unstemmed | IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography |
title_short | IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography |
title_sort | irdc-net: an inception network with a residual module and dilated convolution for sign language recognition based on surface electromyography |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346255/ https://www.ncbi.nlm.nih.gov/pubmed/37447625 http://dx.doi.org/10.3390/s23135775 |
work_keys_str_mv | AT wangxiangrui irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT tanglu irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT zhengqibin irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT yangxilin irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT luzhiyuan irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography |