Cargando…
Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network
The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object re...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662406/ https://www.ncbi.nlm.nih.gov/pubmed/34884096 http://dx.doi.org/10.3390/s21238092 |
_version_ | 1784613428467335168 |
---|---|
author | Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui |
author_facet | Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui |
author_sort | Zhang, Maomao |
collection | PubMed |
description | The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object relations in 3D pose refinement rather than in the process of converting 2D poses to 3D poses. Specifically, in the coarse stage, 2D heatmaps of hand and object keypoints are obtained from RGB image and subsequently fed into pose regressor to derive coarse 3D poses. As for the fine stage, an interaction-aware graph convolutional network called InterGCN is introduced to perform pose refinement by fully leveraging the hand–object relations in 3D context. One major challenge in 3D pose refinement lies in the fact that relations between hand and object change dynamically according to different HOI scenarios. In response to this issue, we leverage both general and interaction-specific relation graphs to significantly enhance the capacity of the network to cover variations of HOI scenarios for successful 3D pose refinement. Extensive experiments demonstrate state-of-the-art performance of our approach on benchmark hand–object datasets. |
format | Online Article Text |
id | pubmed-8662406 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-86624062021-12-11 Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui Sensors (Basel) Article The analysis of hand–object poses from RGB images is important for understanding and imitating human behavior and acts as a key factor in various applications. In this paper, we propose a novel coarse-to-fine two-stage framework for hand–object pose estimation, which explicitly models hand–object relations in 3D pose refinement rather than in the process of converting 2D poses to 3D poses. Specifically, in the coarse stage, 2D heatmaps of hand and object keypoints are obtained from RGB image and subsequently fed into pose regressor to derive coarse 3D poses. As for the fine stage, an interaction-aware graph convolutional network called InterGCN is introduced to perform pose refinement by fully leveraging the hand–object relations in 3D context. One major challenge in 3D pose refinement lies in the fact that relations between hand and object change dynamically according to different HOI scenarios. In response to this issue, we leverage both general and interaction-specific relation graphs to significantly enhance the capacity of the network to cover variations of HOI scenarios for successful 3D pose refinement. Extensive experiments demonstrate state-of-the-art performance of our approach on benchmark hand–object datasets. MDPI 2021-12-03 /pmc/articles/PMC8662406/ /pubmed/34884096 http://dx.doi.org/10.3390/s21238092 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhang, Maomao Li, Ao Liu, Honglei Wang, Minghui Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network |
title | Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network |
title_full | Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network |
title_fullStr | Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network |
title_full_unstemmed | Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network |
title_short | Coarse-to-Fine Hand–Object Pose Estimation with Interaction-Aware Graph Convolutional Network |
title_sort | coarse-to-fine hand–object pose estimation with interaction-aware graph convolutional network |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662406/ https://www.ncbi.nlm.nih.gov/pubmed/34884096 http://dx.doi.org/10.3390/s21238092 |
work_keys_str_mv | AT zhangmaomao coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork AT liao coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork AT liuhonglei coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork AT wangminghui coarsetofinehandobjectposeestimationwithinteractionawaregraphconvolutionalnetwork |