Cargando…

IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography

Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Xiangrui, Tang, Lu, Zheng, Qibin, Yang, Xilin, Lu, Zhiyuan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346255/ https://www.ncbi.nlm.nih.gov/pubmed/37447625 http://dx.doi.org/10.3390/s23135775

_version_	1785073272045436928
author	Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan
author_facet	Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan
author_sort	Wang, Xiangrui
collection	PubMed
description	Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives.
format	Online Article Text
id	pubmed-10346255
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-103462552023-07-15 IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan Sensors (Basel) Article Deaf and hearing-impaired people always face communication barriers. Non-invasive surface electromyography (sEMG) sensor-based sign language recognition (SLR) technology can help them to better integrate into social life. Since the traditional tandem convolutional neural network (CNN) structure used in most CNN-based studies inadequately captures the features of the input data, we propose a novel inception architecture with a residual module and dilated convolution (IRDC-net) to enlarge the receptive fields and enrich the feature maps, applying it to SLR tasks for the first time. This work first transformed the time domain signal into a time–frequency domain using discrete Fourier transformation. Second, an IRDC-net was constructed to recognize ten Chinese sign language signs. Third, the tandem CNN networks VGG-net and ResNet-18 were compared with our proposed parallel structure network, IRDC-net. Finally, the public dataset Ninapro DB1 was utilized to verify the generalization performance of the IRDC-net. The results showed that after transforming the time domain sEMG signal into the time–frequency domain, the classification accuracy (acc) increased from 84.29% to 91.70% when using the IRDC-net on our sign language dataset. Furthermore, for the time–frequency information of the public dataset Ninapro DB1, the classification accuracy reached 89.82%; this value is higher than that achieved in other recent studies. As such, our findings contribute to research into SLR tasks and to improving deaf and hearing-impaired people’s daily lives. MDPI 2023-06-21 /pmc/articles/PMC10346255/ /pubmed/37447625 http://dx.doi.org/10.3390/s23135775 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Xiangrui Tang, Lu Zheng, Qibin Yang, Xilin Lu, Zhiyuan IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
title	IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
title_full	IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
title_fullStr	IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
title_full_unstemmed	IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
title_short	IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography
title_sort	irdc-net: an inception network with a residual module and dilated convolution for sign language recognition based on surface electromyography
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346255/ https://www.ncbi.nlm.nih.gov/pubmed/37447625 http://dx.doi.org/10.3390/s23135775
work_keys_str_mv	AT wangxiangrui irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT tanglu irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT zhengqibin irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT yangxilin irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography AT luzhiyuan irdcnetaninceptionnetworkwitharesidualmoduleanddilatedconvolutionforsignlanguagerecognitionbasedonsurfaceelectromyography

IRDC-Net: An Inception Network with a Residual Module and Dilated Convolution for Sign Language Recognition Based on Surface Electromyography

Ejemplares similares