Cargando…

SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation

Hand pose estimation (HPE) plays an important role during the functional assessment of the hand and in potential rehabilitation. It is a challenge to predict the pose of the hand conveniently and accurately during functional tasks, and this limits the application of HPE. In this paper, we propose a...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Chenfei, Hu, Boce, Chen, Jiawei, Ai, Xupeng, Agrawal, Sunil K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9952393/
https://www.ncbi.nlm.nih.gov/pubmed/36829620
http://dx.doi.org/10.3390/bioengineering10020126
_version_ 1784893620421132288
author Zhu, Chenfei
Hu, Boce
Chen, Jiawei
Ai, Xupeng
Agrawal, Sunil K.
author_facet Zhu, Chenfei
Hu, Boce
Chen, Jiawei
Ai, Xupeng
Agrawal, Sunil K.
author_sort Zhu, Chenfei
collection PubMed
description Hand pose estimation (HPE) plays an important role during the functional assessment of the hand and in potential rehabilitation. It is a challenge to predict the pose of the hand conveniently and accurately during functional tasks, and this limits the application of HPE. In this paper, we propose a novel architecture of a shifted attention regression network (SARN) to perform HPE. Given a depth image, SARN first predicts the spatial relationships between points in the depth image and a group of hand keypoints that determine the pose of the hand. Then, SARN uses these spatial relationships to infer the 3D position of each hand keypoint. To verify the effectiveness of the proposed method, we conducted experiments on three open-source datasets of 3D hand poses: NYU, ICVL, and MSRA. The proposed method achieved state-of-the-art performance with 7.32 mm, 5.91 mm, and 7.17 mm of mean error at the hand keypoints, i.e., mean Euclidean distance between the predicted and ground-truth hand keypoint positions. Additionally, to test the feasibility of SARN in hand movement recognition, a hand movement dataset of 26K depth images from 17 healthy subjects was constructed based on the finger tapping test, an important component of neurological exams administered to Parkinson’s patients. Each image was annotated with the tips of the index finger and the thumb. For this dataset, the proposed method achieved a mean error of 2.99 mm at the hand keypoints and comparable performance on three task-specific metrics: the distance, velocity, and acceleration of the relative movement of the two fingertips. Results on the open-source datasets demonstrated the effectiveness of the proposed method, and results on our finger tapping dataset validated its potential for applications in functional task characterization.
format Online
Article
Text
id pubmed-9952393
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99523932023-02-25 SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation Zhu, Chenfei Hu, Boce Chen, Jiawei Ai, Xupeng Agrawal, Sunil K. Bioengineering (Basel) Article Hand pose estimation (HPE) plays an important role during the functional assessment of the hand and in potential rehabilitation. It is a challenge to predict the pose of the hand conveniently and accurately during functional tasks, and this limits the application of HPE. In this paper, we propose a novel architecture of a shifted attention regression network (SARN) to perform HPE. Given a depth image, SARN first predicts the spatial relationships between points in the depth image and a group of hand keypoints that determine the pose of the hand. Then, SARN uses these spatial relationships to infer the 3D position of each hand keypoint. To verify the effectiveness of the proposed method, we conducted experiments on three open-source datasets of 3D hand poses: NYU, ICVL, and MSRA. The proposed method achieved state-of-the-art performance with 7.32 mm, 5.91 mm, and 7.17 mm of mean error at the hand keypoints, i.e., mean Euclidean distance between the predicted and ground-truth hand keypoint positions. Additionally, to test the feasibility of SARN in hand movement recognition, a hand movement dataset of 26K depth images from 17 healthy subjects was constructed based on the finger tapping test, an important component of neurological exams administered to Parkinson’s patients. Each image was annotated with the tips of the index finger and the thumb. For this dataset, the proposed method achieved a mean error of 2.99 mm at the hand keypoints and comparable performance on three task-specific metrics: the distance, velocity, and acceleration of the relative movement of the two fingertips. Results on the open-source datasets demonstrated the effectiveness of the proposed method, and results on our finger tapping dataset validated its potential for applications in functional task characterization. MDPI 2023-01-17 /pmc/articles/PMC9952393/ /pubmed/36829620 http://dx.doi.org/10.3390/bioengineering10020126 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhu, Chenfei
Hu, Boce
Chen, Jiawei
Ai, Xupeng
Agrawal, Sunil K.
SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation
title SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation
title_full SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation
title_fullStr SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation
title_full_unstemmed SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation
title_short SARN: Shifted Attention Regression Network for 3D Hand Pose Estimation
title_sort sarn: shifted attention regression network for 3d hand pose estimation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9952393/
https://www.ncbi.nlm.nih.gov/pubmed/36829620
http://dx.doi.org/10.3390/bioengineering10020126
work_keys_str_mv AT zhuchenfei sarnshiftedattentionregressionnetworkfor3dhandposeestimation
AT huboce sarnshiftedattentionregressionnetworkfor3dhandposeestimation
AT chenjiawei sarnshiftedattentionregressionnetworkfor3dhandposeestimation
AT aixupeng sarnshiftedattentionregressionnetworkfor3dhandposeestimation
AT agrawalsunilk sarnshiftedattentionregressionnetworkfor3dhandposeestimation