Cargando…

Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer

Hand gesture recognition is a challenging topic in the field of computer vision. Multimodal hand gesture recognition based on RGB-D is with higher accuracy than that of only RGB or depth. It is not difficult to conclude that the gain originates from the complementary information existing in the two...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jinghua, Liu, Runze, Kong, Dehui, Wang, Shaofan, Wang, Lichun, Yin, Baocai, Gao, Ronghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8616693/
https://www.ncbi.nlm.nih.gov/pubmed/34840561
http://dx.doi.org/10.1155/2021/5044916
_version_ 1784604399976316928
author Li, Jinghua
Liu, Runze
Kong, Dehui
Wang, Shaofan
Wang, Lichun
Yin, Baocai
Gao, Ronghua
author_facet Li, Jinghua
Liu, Runze
Kong, Dehui
Wang, Shaofan
Wang, Lichun
Yin, Baocai
Gao, Ronghua
author_sort Li, Jinghua
collection PubMed
description Hand gesture recognition is a challenging topic in the field of computer vision. Multimodal hand gesture recognition based on RGB-D is with higher accuracy than that of only RGB or depth. It is not difficult to conclude that the gain originates from the complementary information existing in the two modalities. However, in reality, multimodal data are not always easy to acquire simultaneously, while unimodal RGB or depth hand gesture data are more general. Therefore, one hand gesture system is expected, in which only unimordal RGB or Depth data is supported for testing, while multimodal RGB-D data is available for training so as to attain the complementary information. Fortunately, a kind of method via multimodal training and unimodal testing has been proposed. However, unimodal feature representation and cross-modality transfer still need to be further improved. To this end, this paper proposes a new 3D-Ghost and Spatial Attention Inflated 3D ConvNet (3DGSAI) to extract high-quality features for each modality. The baseline of 3DGSAI network is Inflated 3D ConvNet (I3D), and two main improvements are proposed. One is 3D-Ghost module, and the other is the spatial attention mechanism. The 3D-Ghost module can extract richer features for hand gesture representation, and the spatial attention mechanism makes the network pay more attention to hand region. This paper also proposes an adaptive parameter for positive knowledge transfer, which ensures that the transfer always occurs from the strong modality network to the weak one. Extensive experiments on SKIG, VIVA, and NVGesture datasets demonstrate that our method is competitive with the state of the art. Especially, the performance of our method reaches 97.87% on the SKIG dataset using only RGB, which is the current best result.
format Online
Article
Text
id pubmed-8616693
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-86166932021-11-26 Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer Li, Jinghua Liu, Runze Kong, Dehui Wang, Shaofan Wang, Lichun Yin, Baocai Gao, Ronghua Comput Intell Neurosci Research Article Hand gesture recognition is a challenging topic in the field of computer vision. Multimodal hand gesture recognition based on RGB-D is with higher accuracy than that of only RGB or depth. It is not difficult to conclude that the gain originates from the complementary information existing in the two modalities. However, in reality, multimodal data are not always easy to acquire simultaneously, while unimodal RGB or depth hand gesture data are more general. Therefore, one hand gesture system is expected, in which only unimordal RGB or Depth data is supported for testing, while multimodal RGB-D data is available for training so as to attain the complementary information. Fortunately, a kind of method via multimodal training and unimodal testing has been proposed. However, unimodal feature representation and cross-modality transfer still need to be further improved. To this end, this paper proposes a new 3D-Ghost and Spatial Attention Inflated 3D ConvNet (3DGSAI) to extract high-quality features for each modality. The baseline of 3DGSAI network is Inflated 3D ConvNet (I3D), and two main improvements are proposed. One is 3D-Ghost module, and the other is the spatial attention mechanism. The 3D-Ghost module can extract richer features for hand gesture representation, and the spatial attention mechanism makes the network pay more attention to hand region. This paper also proposes an adaptive parameter for positive knowledge transfer, which ensures that the transfer always occurs from the strong modality network to the weak one. Extensive experiments on SKIG, VIVA, and NVGesture datasets demonstrate that our method is competitive with the state of the art. Especially, the performance of our method reaches 97.87% on the SKIG dataset using only RGB, which is the current best result. Hindawi 2021-11-18 /pmc/articles/PMC8616693/ /pubmed/34840561 http://dx.doi.org/10.1155/2021/5044916 Text en Copyright © 2021 Jinghua Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Jinghua
Liu, Runze
Kong, Dehui
Wang, Shaofan
Wang, Lichun
Yin, Baocai
Gao, Ronghua
Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer
title Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer
title_full Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer
title_fullStr Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer
title_full_unstemmed Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer
title_short Attentive 3D-Ghost Module for Dynamic Hand Gesture Recognition with Positive Knowledge Transfer
title_sort attentive 3d-ghost module for dynamic hand gesture recognition with positive knowledge transfer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8616693/
https://www.ncbi.nlm.nih.gov/pubmed/34840561
http://dx.doi.org/10.1155/2021/5044916
work_keys_str_mv AT lijinghua attentive3dghostmodulefordynamichandgesturerecognitionwithpositiveknowledgetransfer
AT liurunze attentive3dghostmodulefordynamichandgesturerecognitionwithpositiveknowledgetransfer
AT kongdehui attentive3dghostmodulefordynamichandgesturerecognitionwithpositiveknowledgetransfer
AT wangshaofan attentive3dghostmodulefordynamichandgesturerecognitionwithpositiveknowledgetransfer
AT wanglichun attentive3dghostmodulefordynamichandgesturerecognitionwithpositiveknowledgetransfer
AT yinbaocai attentive3dghostmodulefordynamichandgesturerecognitionwithpositiveknowledgetransfer
AT gaoronghua attentive3dghostmodulefordynamichandgesturerecognitionwithpositiveknowledgetransfer