Cargando…
Hand Pose Recognition Using Parallel Multi Stream CNN
Recently, several computer applications provided operating mode through pointing fingers, waving hands, and with body movement instead of a mouse, keyboard, audio, or touch input such as sign language recognition, robot control, games, appliances control, and smart surveillance. With the increase of...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8708730/ https://www.ncbi.nlm.nih.gov/pubmed/34960562 http://dx.doi.org/10.3390/s21248469 |
_version_ | 1784622758404030464 |
---|---|
author | Noreen, Iram Hamid, Muhammad Akram, Uzma Malik, Saadia Saleem, Muhammad |
author_facet | Noreen, Iram Hamid, Muhammad Akram, Uzma Malik, Saadia Saleem, Muhammad |
author_sort | Noreen, Iram |
collection | PubMed |
description | Recently, several computer applications provided operating mode through pointing fingers, waving hands, and with body movement instead of a mouse, keyboard, audio, or touch input such as sign language recognition, robot control, games, appliances control, and smart surveillance. With the increase of hand-pose-based applications, new challenges in this domain have also emerged. Support vector machines and neural networks have been extensively used in this domain using conventional RGB data, which are not very effective for adequate performance. Recently, depth data have become popular due to better understating of posture attributes. In this study, a multiple parallel stream 2D CNN (two-dimensional convolution neural network) model is proposed to recognize the hand postures. The proposed model comprises multiple steps and layers to detect hand poses from image maps obtained from depth data. The hyper parameters of the proposed model are tuned through experimental analysis. Three publicly available benchmark datasets: Kaggle, First Person, and Dexter, are used independently to train and test the proposed approach. The accuracy of the proposed method is 99.99%, 99.48%, and 98% using the Kaggle hand posture dataset, First Person hand posture dataset, and Dexter dataset, respectively. Further, the results obtained for F1 and AUC scores are also near-optimal. Comparative analysis with state-of-the-art shows that the proposed model outperforms the previous methods. |
format | Online Article Text |
id | pubmed-8708730 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-87087302021-12-25 Hand Pose Recognition Using Parallel Multi Stream CNN Noreen, Iram Hamid, Muhammad Akram, Uzma Malik, Saadia Saleem, Muhammad Sensors (Basel) Article Recently, several computer applications provided operating mode through pointing fingers, waving hands, and with body movement instead of a mouse, keyboard, audio, or touch input such as sign language recognition, robot control, games, appliances control, and smart surveillance. With the increase of hand-pose-based applications, new challenges in this domain have also emerged. Support vector machines and neural networks have been extensively used in this domain using conventional RGB data, which are not very effective for adequate performance. Recently, depth data have become popular due to better understating of posture attributes. In this study, a multiple parallel stream 2D CNN (two-dimensional convolution neural network) model is proposed to recognize the hand postures. The proposed model comprises multiple steps and layers to detect hand poses from image maps obtained from depth data. The hyper parameters of the proposed model are tuned through experimental analysis. Three publicly available benchmark datasets: Kaggle, First Person, and Dexter, are used independently to train and test the proposed approach. The accuracy of the proposed method is 99.99%, 99.48%, and 98% using the Kaggle hand posture dataset, First Person hand posture dataset, and Dexter dataset, respectively. Further, the results obtained for F1 and AUC scores are also near-optimal. Comparative analysis with state-of-the-art shows that the proposed model outperforms the previous methods. MDPI 2021-12-18 /pmc/articles/PMC8708730/ /pubmed/34960562 http://dx.doi.org/10.3390/s21248469 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Noreen, Iram Hamid, Muhammad Akram, Uzma Malik, Saadia Saleem, Muhammad Hand Pose Recognition Using Parallel Multi Stream CNN |
title | Hand Pose Recognition Using Parallel Multi Stream CNN |
title_full | Hand Pose Recognition Using Parallel Multi Stream CNN |
title_fullStr | Hand Pose Recognition Using Parallel Multi Stream CNN |
title_full_unstemmed | Hand Pose Recognition Using Parallel Multi Stream CNN |
title_short | Hand Pose Recognition Using Parallel Multi Stream CNN |
title_sort | hand pose recognition using parallel multi stream cnn |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8708730/ https://www.ncbi.nlm.nih.gov/pubmed/34960562 http://dx.doi.org/10.3390/s21248469 |
work_keys_str_mv | AT noreeniram handposerecognitionusingparallelmultistreamcnn AT hamidmuhammad handposerecognitionusingparallelmultistreamcnn AT akramuzma handposerecognitionusingparallelmultistreamcnn AT maliksaadia handposerecognitionusingparallelmultistreamcnn AT saleemmuhammad handposerecognitionusingparallelmultistreamcnn |