Cargando…

Hand Pose Recognition Using Parallel Multi Stream CNN

Recently, several computer applications provided operating mode through pointing fingers, waving hands, and with body movement instead of a mouse, keyboard, audio, or touch input such as sign language recognition, robot control, games, appliances control, and smart surveillance. With the increase of...

Descripción completa

Detalles Bibliográficos
Autores principales: Noreen, Iram, Hamid, Muhammad, Akram, Uzma, Malik, Saadia, Saleem, Muhammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8708730/
https://www.ncbi.nlm.nih.gov/pubmed/34960562
http://dx.doi.org/10.3390/s21248469
_version_ 1784622758404030464
author Noreen, Iram
Hamid, Muhammad
Akram, Uzma
Malik, Saadia
Saleem, Muhammad
author_facet Noreen, Iram
Hamid, Muhammad
Akram, Uzma
Malik, Saadia
Saleem, Muhammad
author_sort Noreen, Iram
collection PubMed
description Recently, several computer applications provided operating mode through pointing fingers, waving hands, and with body movement instead of a mouse, keyboard, audio, or touch input such as sign language recognition, robot control, games, appliances control, and smart surveillance. With the increase of hand-pose-based applications, new challenges in this domain have also emerged. Support vector machines and neural networks have been extensively used in this domain using conventional RGB data, which are not very effective for adequate performance. Recently, depth data have become popular due to better understating of posture attributes. In this study, a multiple parallel stream 2D CNN (two-dimensional convolution neural network) model is proposed to recognize the hand postures. The proposed model comprises multiple steps and layers to detect hand poses from image maps obtained from depth data. The hyper parameters of the proposed model are tuned through experimental analysis. Three publicly available benchmark datasets: Kaggle, First Person, and Dexter, are used independently to train and test the proposed approach. The accuracy of the proposed method is 99.99%, 99.48%, and 98% using the Kaggle hand posture dataset, First Person hand posture dataset, and Dexter dataset, respectively. Further, the results obtained for F1 and AUC scores are also near-optimal. Comparative analysis with state-of-the-art shows that the proposed model outperforms the previous methods.
format Online
Article
Text
id pubmed-8708730
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87087302021-12-25 Hand Pose Recognition Using Parallel Multi Stream CNN Noreen, Iram Hamid, Muhammad Akram, Uzma Malik, Saadia Saleem, Muhammad Sensors (Basel) Article Recently, several computer applications provided operating mode through pointing fingers, waving hands, and with body movement instead of a mouse, keyboard, audio, or touch input such as sign language recognition, robot control, games, appliances control, and smart surveillance. With the increase of hand-pose-based applications, new challenges in this domain have also emerged. Support vector machines and neural networks have been extensively used in this domain using conventional RGB data, which are not very effective for adequate performance. Recently, depth data have become popular due to better understating of posture attributes. In this study, a multiple parallel stream 2D CNN (two-dimensional convolution neural network) model is proposed to recognize the hand postures. The proposed model comprises multiple steps and layers to detect hand poses from image maps obtained from depth data. The hyper parameters of the proposed model are tuned through experimental analysis. Three publicly available benchmark datasets: Kaggle, First Person, and Dexter, are used independently to train and test the proposed approach. The accuracy of the proposed method is 99.99%, 99.48%, and 98% using the Kaggle hand posture dataset, First Person hand posture dataset, and Dexter dataset, respectively. Further, the results obtained for F1 and AUC scores are also near-optimal. Comparative analysis with state-of-the-art shows that the proposed model outperforms the previous methods. MDPI 2021-12-18 /pmc/articles/PMC8708730/ /pubmed/34960562 http://dx.doi.org/10.3390/s21248469 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Noreen, Iram
Hamid, Muhammad
Akram, Uzma
Malik, Saadia
Saleem, Muhammad
Hand Pose Recognition Using Parallel Multi Stream CNN
title Hand Pose Recognition Using Parallel Multi Stream CNN
title_full Hand Pose Recognition Using Parallel Multi Stream CNN
title_fullStr Hand Pose Recognition Using Parallel Multi Stream CNN
title_full_unstemmed Hand Pose Recognition Using Parallel Multi Stream CNN
title_short Hand Pose Recognition Using Parallel Multi Stream CNN
title_sort hand pose recognition using parallel multi stream cnn
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8708730/
https://www.ncbi.nlm.nih.gov/pubmed/34960562
http://dx.doi.org/10.3390/s21248469
work_keys_str_mv AT noreeniram handposerecognitionusingparallelmultistreamcnn
AT hamidmuhammad handposerecognitionusingparallelmultistreamcnn
AT akramuzma handposerecognitionusingparallelmultistreamcnn
AT maliksaadia handposerecognitionusingparallelmultistreamcnn
AT saleemmuhammad handposerecognitionusingparallelmultistreamcnn