Cargando…

Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition

Hand gestures are useful tools for many applications in the human-computer interaction community. Here, the objective is to track the movement of the hand irrespective of the shape, size and color of the hand. And, for this, a motion template guided by optical flow (OFMT) is proposed. OFMT is a comp...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sarma, Debajit, Kavyasree, V., Bhuyan, M. K.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer London 2022
Materias:	S.I. : Low Resource Machine Learning Algorithms (LR-MLA)
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9420673/ https://www.ncbi.nlm.nih.gov/pubmed/36060497 http://dx.doi.org/10.1007/s11334-022-00477-z

_version_	1784777440341524480
author	Sarma, Debajit Kavyasree, V. Bhuyan, M. K.
author_facet	Sarma, Debajit Kavyasree, V. Bhuyan, M. K.
author_sort	Sarma, Debajit
collection	PubMed
description	Hand gestures are useful tools for many applications in the human-computer interaction community. Here, the objective is to track the movement of the hand irrespective of the shape, size and color of the hand. And, for this, a motion template guided by optical flow (OFMT) is proposed. OFMT is a compact representation of the motion information of a gesture encoded into a single image. Recently, deep networks have shown impressive improvements as compared to conventional hand-crafted feature-based techniques. Moreover, it is seen that the use of different streams with informative input data helps to increase the recognition performance. This work basically proposes a two-stream fusion model for hand gesture recognition. The two-stream network consists of two layers—a 3D convolutional neural network (C3D) that takes gesture videos as input and a 2D-CNN that takes OFMT images as input. C3D has shown its efficiency in capturing spatiotemporal information of a video, whereas OFMT helps to eliminate irrelevant gestures providing additional motion information. Though each stream can work independently, they are combined with a fusion scheme to boost the recognition results. We have shown the efficiency of the proposed two-stream network on two databases.
format	Online Article Text
id	pubmed-9420673
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer London
record_format	MEDLINE/PubMed
spelling	pubmed-94206732022-08-30 Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition Sarma, Debajit Kavyasree, V. Bhuyan, M. K. Innov Syst Softw Eng S.I. : Low Resource Machine Learning Algorithms (LR-MLA) Hand gestures are useful tools for many applications in the human-computer interaction community. Here, the objective is to track the movement of the hand irrespective of the shape, size and color of the hand. And, for this, a motion template guided by optical flow (OFMT) is proposed. OFMT is a compact representation of the motion information of a gesture encoded into a single image. Recently, deep networks have shown impressive improvements as compared to conventional hand-crafted feature-based techniques. Moreover, it is seen that the use of different streams with informative input data helps to increase the recognition performance. This work basically proposes a two-stream fusion model for hand gesture recognition. The two-stream network consists of two layers—a 3D convolutional neural network (C3D) that takes gesture videos as input and a 2D-CNN that takes OFMT images as input. C3D has shown its efficiency in capturing spatiotemporal information of a video, whereas OFMT helps to eliminate irrelevant gestures providing additional motion information. Though each stream can work independently, they are combined with a fusion scheme to boost the recognition results. We have shown the efficiency of the proposed two-stream network on two databases. Springer London 2022-08-29 /pmc/articles/PMC9420673/ /pubmed/36060497 http://dx.doi.org/10.1007/s11334-022-00477-z Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	S.I. : Low Resource Machine Learning Algorithms (LR-MLA) Sarma, Debajit Kavyasree, V. Bhuyan, M. K. Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition
title	Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition
title_full	Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition
title_fullStr	Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition
title_full_unstemmed	Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition
title_short	Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition
title_sort	two-stream fusion model using 3d-cnn and 2d-cnn via video-frames and optical flow motion templates for hand gesture recognition
topic	S.I. : Low Resource Machine Learning Algorithms (LR-MLA)
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9420673/ https://www.ncbi.nlm.nih.gov/pubmed/36060497 http://dx.doi.org/10.1007/s11334-022-00477-z
work_keys_str_mv	AT sarmadebajit twostreamfusionmodelusing3dcnnand2dcnnviavideoframesandopticalflowmotiontemplatesforhandgesturerecognition AT kavyasreev twostreamfusionmodelusing3dcnnand2dcnnviavideoframesandopticalflowmotiontemplatesforhandgesturerecognition AT bhuyanmk twostreamfusionmodelusing3dcnnand2dcnnviavideoframesandopticalflowmotiontemplatesforhandgesturerecognition

Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition

Ejemplares similares