Cargando…

Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments

Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such a...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Chi, Zhou, Jun, Cai, Wendi, Jiang, Yunkai, Li, Yongbo, Liu, Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7664645/
https://www.ncbi.nlm.nih.gov/pubmed/33171831
http://dx.doi.org/10.3390/s20216360
_version_ 1783609874845794304
author Xu, Chi
Zhou, Jun
Cai, Wendi
Jiang, Yunkai
Li, Yongbo
Liu, Yi
author_facet Xu, Chi
Zhou, Jun
Cai, Wendi
Jiang, Yunkai
Li, Yongbo
Liu, Yi
author_sort Xu, Chi
collection PubMed
description Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy.
format Online
Article
Text
id pubmed-7664645
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-76646452020-11-14 Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments Xu, Chi Zhou, Jun Cai, Wendi Jiang, Yunkai Li, Yongbo Liu, Yi Sensors (Basel) Article Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy. MDPI 2020-11-07 /pmc/articles/PMC7664645/ /pubmed/33171831 http://dx.doi.org/10.3390/s20216360 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xu, Chi
Zhou, Jun
Cai, Wendi
Jiang, Yunkai
Li, Yongbo
Liu, Yi
Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
title Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
title_full Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
title_fullStr Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
title_full_unstemmed Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
title_short Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
title_sort robust 3d hand detection from a single rgb-d image in unconstrained environments
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7664645/
https://www.ncbi.nlm.nih.gov/pubmed/33171831
http://dx.doi.org/10.3390/s20216360
work_keys_str_mv AT xuchi robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments
AT zhoujun robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments
AT caiwendi robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments
AT jiangyunkai robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments
AT liyongbo robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments
AT liuyi robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments