Cargando…

A PointNet-Based Solution for 3D Hand Gesture Recognition

Gesture recognition is an intensively researched area for several reasons. One of the most important reasons is because of this technology’s numerous application in various domains (e.g., robotics, games, medicine, automotive, etc.) Additionally, the introduction of three-dimensional (3D) image acqu...

Descripción completa

Detalles Bibliográficos
Autores principales: Mirsu, Radu, Simion, Georgiana, Caleanu, Catalin Daniel, Pop-Calimanu, Ioana Monica
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308972/
https://www.ncbi.nlm.nih.gov/pubmed/32517141
http://dx.doi.org/10.3390/s20113226
_version_ 1783549115807825920
author Mirsu, Radu
Simion, Georgiana
Caleanu, Catalin Daniel
Pop-Calimanu, Ioana Monica
author_facet Mirsu, Radu
Simion, Georgiana
Caleanu, Catalin Daniel
Pop-Calimanu, Ioana Monica
author_sort Mirsu, Radu
collection PubMed
description Gesture recognition is an intensively researched area for several reasons. One of the most important reasons is because of this technology’s numerous application in various domains (e.g., robotics, games, medicine, automotive, etc.) Additionally, the introduction of three-dimensional (3D) image acquisition techniques (e.g., stereovision, projected-light, time-of-flight, etc.) overcomes the limitations of traditional two-dimensional (2D) approaches. Combined with the larger availability of 3D sensors (e.g., Microsoft Kinect, Intel RealSense, photonic mixer device (PMD), CamCube, etc.), recent interest in this domain has sparked. Moreover, in many computer vision tasks, the traditional statistic top approaches were outperformed by deep neural network-based solutions. In view of these considerations, we proposed a deep neural network solution by employing PointNet architecture for the problem of hand gesture recognition using depth data produced by a time of flight (ToF) sensor. We created a custom hand gesture dataset, then proposed a multistage hand segmentation by designing filtering, clustering, and finding the hand in the volume of interest and hand-forearm segmentation. For comparison purpose, two equivalent datasets were tested: a 3D point cloud dataset and a 2D image dataset, both obtained from the same stream. Besides the advantages of the 3D technology, the accuracy of the 3D method using PointNet is proven to outperform the 2D method in all circumstances, even the 2D method that employs a deep neural network.
format Online
Article
Text
id pubmed-7308972
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73089722020-06-25 A PointNet-Based Solution for 3D Hand Gesture Recognition Mirsu, Radu Simion, Georgiana Caleanu, Catalin Daniel Pop-Calimanu, Ioana Monica Sensors (Basel) Article Gesture recognition is an intensively researched area for several reasons. One of the most important reasons is because of this technology’s numerous application in various domains (e.g., robotics, games, medicine, automotive, etc.) Additionally, the introduction of three-dimensional (3D) image acquisition techniques (e.g., stereovision, projected-light, time-of-flight, etc.) overcomes the limitations of traditional two-dimensional (2D) approaches. Combined with the larger availability of 3D sensors (e.g., Microsoft Kinect, Intel RealSense, photonic mixer device (PMD), CamCube, etc.), recent interest in this domain has sparked. Moreover, in many computer vision tasks, the traditional statistic top approaches were outperformed by deep neural network-based solutions. In view of these considerations, we proposed a deep neural network solution by employing PointNet architecture for the problem of hand gesture recognition using depth data produced by a time of flight (ToF) sensor. We created a custom hand gesture dataset, then proposed a multistage hand segmentation by designing filtering, clustering, and finding the hand in the volume of interest and hand-forearm segmentation. For comparison purpose, two equivalent datasets were tested: a 3D point cloud dataset and a 2D image dataset, both obtained from the same stream. Besides the advantages of the 3D technology, the accuracy of the 3D method using PointNet is proven to outperform the 2D method in all circumstances, even the 2D method that employs a deep neural network. MDPI 2020-06-05 /pmc/articles/PMC7308972/ /pubmed/32517141 http://dx.doi.org/10.3390/s20113226 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mirsu, Radu
Simion, Georgiana
Caleanu, Catalin Daniel
Pop-Calimanu, Ioana Monica
A PointNet-Based Solution for 3D Hand Gesture Recognition
title A PointNet-Based Solution for 3D Hand Gesture Recognition
title_full A PointNet-Based Solution for 3D Hand Gesture Recognition
title_fullStr A PointNet-Based Solution for 3D Hand Gesture Recognition
title_full_unstemmed A PointNet-Based Solution for 3D Hand Gesture Recognition
title_short A PointNet-Based Solution for 3D Hand Gesture Recognition
title_sort pointnet-based solution for 3d hand gesture recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308972/
https://www.ncbi.nlm.nih.gov/pubmed/32517141
http://dx.doi.org/10.3390/s20113226
work_keys_str_mv AT mirsuradu apointnetbasedsolutionfor3dhandgesturerecognition
AT simiongeorgiana apointnetbasedsolutionfor3dhandgesturerecognition
AT caleanucatalindaniel apointnetbasedsolutionfor3dhandgesturerecognition
AT popcalimanuioanamonica apointnetbasedsolutionfor3dhandgesturerecognition
AT mirsuradu pointnetbasedsolutionfor3dhandgesturerecognition
AT simiongeorgiana pointnetbasedsolutionfor3dhandgesturerecognition
AT caleanucatalindaniel pointnetbasedsolutionfor3dhandgesturerecognition
AT popcalimanuioanamonica pointnetbasedsolutionfor3dhandgesturerecognition