Cargando…

Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network

The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object re...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Maomao, Li, Ao, Liu, Honglei, Wang, Minghui
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662406/ https://www.ncbi.nlm.nih.gov/pubmed/34884096 http://dx.doi.org/10.3390/s21238092

_version_	1784613428467335168
author	Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui
author_facet	Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui
author_sort	Zhang, Maomao
collection	PubMed
description	The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object relations in 3D pose refinement rather than in the process of converting 2D poses to 3D poses. Specifically, in the coarse stage, 2D heatmaps of hand and object keypoints are obtained from RGB image and subsequently fed into pose regressor to derive coarse 3D poses. As for the fine stage, an interaction-aware graph convolutional network called InterGCN is introduced to perform pose refinement by fully leveraging the hand–object relations in 3D context. One major challenge in 3D pose refinement lies in the fact that relations between hand and object change dynamically according to different HOI scenarios. In response to this issue, we leverage both general and interaction-specific relation graphs to significantly enhance the capacity of the network to cover variations of HOI scenarios for successful 3D pose refinement. Extensive experiments demonstrate state-of-the-art performance of our approach on benchmark hand–object datasets.
format	Online Article Text
id	pubmed-8662406
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-86624062021-12-11 Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui Sensors (Basel) Article The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object relations in 3D pose refinement rather than in the process of converting 2D poses to 3D poses. Specifically, in the coarse stage, 2D heatmaps of hand and object keypoints are obtained from RGB image and subsequently fed into pose regressor to derive coarse 3D poses. As for the fine stage, an interaction-aware graph convolutional network called InterGCN is introduced to perform pose refinement by fully leveraging the hand–object relations in 3D context. One major challenge in 3D pose refinement lies in the fact that relations between hand and object change dynamically according to different HOI scenarios. In response to this issue, we leverage both general and interaction-specific relation graphs to significantly enhance the capacity of the network to cover variations of HOI scenarios for successful 3D pose refinement. Extensive experiments demonstrate state-of-the-art performance of our approach on benchmark hand–object datasets. MDPI 2021-12-03 /pmc/articles/PMC8662406/ /pubmed/34884096 http://dx.doi.org/10.3390/s21238092 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title	Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_full	Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_fullStr	Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_full_unstemmed	Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_short	Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_sort	coarse-to-fine hand–object pose estimation with interaction-aware graph convolutional network
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662406/ https://www.ncbi.nlm.nih.gov/pubmed/34884096 http://dx.doi.org/10.3390/s21238092
work_keys_str_mv	AT zhangmaomao coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork AT liao coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork AT liuhonglei coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork AT wangminghui coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork

Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network

Ejemplares similares