Cargando…

Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network

The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object re...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Maomao, Li, Ao, Liu, Honglei, Wang, Minghui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662406/
https://www.ncbi.nlm.nih.gov/pubmed/34884096
http://dx.doi.org/10.3390/s21238092
_version_ 1784613428467335168
author Zhang, Maomao
Li, Ao
Liu, Honglei
Wang, Minghui
author_facet Zhang, Maomao
Li, Ao
Liu, Honglei
Wang, Minghui
author_sort Zhang, Maomao
collection PubMed
description The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object relations in 3D pose refinement rather than in the process of converting 2D poses to 3D poses. Specifically, in the coarse stage, 2D heatmaps of hand and object keypoints are obtained from RGB image and subsequently fed into pose regressor to derive coarse 3D poses. As for the fine stage, an interaction-aware graph convolutional network called InterGCN is introduced to perform pose refinement by fully leveraging the hand–object relations in 3D context. One major challenge in 3D pose refinement lies in the fact that relations between hand and object change dynamically according to different HOI scenarios. In response to this issue, we leverage both general and interaction-specific relation graphs to significantly enhance the capacity of the network to cover variations of HOI scenarios for successful 3D pose refinement. Extensive experiments demonstrate state-of-the-art performance of our approach on benchmark hand–object datasets.
format Online
Article
Text
id pubmed-8662406
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86624062021-12-11 Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui Sensors (Basel) Article The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object relations in 3D pose refinement rather than in the process of converting 2D poses to 3D poses. Specifically, in the coarse stage, 2D heatmaps of hand and object keypoints are obtained from RGB image and subsequently fed into pose regressor to derive coarse 3D poses. As for the fine stage, an interaction-aware graph convolutional network called InterGCN is introduced to perform pose refinement by fully leveraging the hand–object relations in 3D context. One major challenge in 3D pose refinement lies in the fact that relations between hand and object change dynamically according to different HOI scenarios. In response to this issue, we leverage both general and interaction-specific relation graphs to significantly enhance the capacity of the network to cover variations of HOI scenarios for successful 3D pose refinement. Extensive experiments demonstrate state-of-the-art performance of our approach on benchmark hand–object datasets. MDPI 2021-12-03 /pmc/articles/PMC8662406/ /pubmed/34884096 http://dx.doi.org/10.3390/s21238092 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Maomao
Li, Ao
Liu, Honglei
Wang, Minghui
Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_full Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_fullStr Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_full_unstemmed Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_short Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
title_sort coarse-to-fine hand–object pose estimation with interaction-aware graph convolutional network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662406/
https://www.ncbi.nlm.nih.gov/pubmed/34884096
http://dx.doi.org/10.3390/s21238092
work_keys_str_mv AT zhangmaomao coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork
AT liao coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork
AT liuhonglei coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork
AT wangminghui coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork