Cargando…

Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model

With the recent growth of Smart TV technology, the demand for unique and beneficial applications motivates the study of a unique gesture-based system for a smart TV-like environment. Combining movie recommendation, social media platform, call a friend application, weather updates, chatting app, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Hakim, Noorkholis Luthfil, Shih, Timothy K., Kasthuri Arachchi, Sandeli Priyanwada, Aditya, Wisnu, Chen, Yi-Cheng, Lin, Chih-Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6961023/
https://www.ncbi.nlm.nih.gov/pubmed/31835404
http://dx.doi.org/10.3390/s19245429
_version_ 1783487904533708800
author Hakim, Noorkholis Luthfil
Shih, Timothy K.
Kasthuri Arachchi, Sandeli Priyanwada
Aditya, Wisnu
Chen, Yi-Cheng
Lin, Chih-Yang
author_facet Hakim, Noorkholis Luthfil
Shih, Timothy K.
Kasthuri Arachchi, Sandeli Priyanwada
Aditya, Wisnu
Chen, Yi-Cheng
Lin, Chih-Yang
author_sort Hakim, Noorkholis Luthfil
collection PubMed
description With the recent growth of Smart TV technology, the demand for unique and beneficial applications motivates the study of a unique gesture-based system for a smart TV-like environment. Combining movie recommendation, social media platform, call a friend application, weather updates, chatting app, and tourism platform into a single system regulated by natural-like gesture controller is proposed to allow the ease of use and natural interaction. Gesture recognition problem solving was designed through 24 gestures of 13 static and 11 dynamic gestures that suit to the environment. Dataset of a sequence of RGB and depth images were collected, preprocessed, and trained in the proposed deep learning architecture. Combination of three-dimensional Convolutional Neural Network (3DCNN) followed by Long Short-Term Memory (LSTM) model was used to extract the spatio-temporal features. At the end of the classification, Finite State Machine (FSM) communicates the model to control the class decision results based on application context. The result suggested the combination data of depth and RGB to hold 97.8% of accuracy rate on eight selected gestures, while the FSM has improved the recognition rate from 89% to 91% in a real-time performance.
format Online
Article
Text
id pubmed-6961023
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-69610232020-01-24 Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model Hakim, Noorkholis Luthfil Shih, Timothy K. Kasthuri Arachchi, Sandeli Priyanwada Aditya, Wisnu Chen, Yi-Cheng Lin, Chih-Yang Sensors (Basel) Article With the recent growth of Smart TV technology, the demand for unique and beneficial applications motivates the study of a unique gesture-based system for a smart TV-like environment. Combining movie recommendation, social media platform, call a friend application, weather updates, chatting app, and tourism platform into a single system regulated by natural-like gesture controller is proposed to allow the ease of use and natural interaction. Gesture recognition problem solving was designed through 24 gestures of 13 static and 11 dynamic gestures that suit to the environment. Dataset of a sequence of RGB and depth images were collected, preprocessed, and trained in the proposed deep learning architecture. Combination of three-dimensional Convolutional Neural Network (3DCNN) followed by Long Short-Term Memory (LSTM) model was used to extract the spatio-temporal features. At the end of the classification, Finite State Machine (FSM) communicates the model to control the class decision results based on application context. The result suggested the combination data of depth and RGB to hold 97.8% of accuracy rate on eight selected gestures, while the FSM has improved the recognition rate from 89% to 91% in a real-time performance. MDPI 2019-12-09 /pmc/articles/PMC6961023/ /pubmed/31835404 http://dx.doi.org/10.3390/s19245429 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hakim, Noorkholis Luthfil
Shih, Timothy K.
Kasthuri Arachchi, Sandeli Priyanwada
Aditya, Wisnu
Chen, Yi-Cheng
Lin, Chih-Yang
Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model
title Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model
title_full Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model
title_fullStr Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model
title_full_unstemmed Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model
title_short Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model
title_sort dynamic hand gesture recognition using 3dcnn and lstm with fsm context-aware model
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6961023/
https://www.ncbi.nlm.nih.gov/pubmed/31835404
http://dx.doi.org/10.3390/s19245429
work_keys_str_mv AT hakimnoorkholisluthfil dynamichandgesturerecognitionusing3dcnnandlstmwithfsmcontextawaremodel
AT shihtimothyk dynamichandgesturerecognitionusing3dcnnandlstmwithfsmcontextawaremodel
AT kasthuriarachchisandelipriyanwada dynamichandgesturerecognitionusing3dcnnandlstmwithfsmcontextawaremodel
AT adityawisnu dynamichandgesturerecognitionusing3dcnnandlstmwithfsmcontextawaremodel
AT chenyicheng dynamichandgesturerecognitionusing3dcnnandlstmwithfsmcontextawaremodel
AT linchihyang dynamichandgesturerecognitionusing3dcnnandlstmwithfsmcontextawaremodel