Cargando…
Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments
Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such a...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7664645/ https://www.ncbi.nlm.nih.gov/pubmed/33171831 http://dx.doi.org/10.3390/s20216360 |
_version_ | 1783609874845794304 |
---|---|
author | Xu, Chi Zhou, Jun Cai, Wendi Jiang, Yunkai Li, Yongbo Liu, Yi |
author_facet | Xu, Chi Zhou, Jun Cai, Wendi Jiang, Yunkai Li, Yongbo Liu, Yi |
author_sort | Xu, Chi |
collection | PubMed |
description | Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy. |
format | Online Article Text |
id | pubmed-7664645 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-76646452020-11-14 Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments Xu, Chi Zhou, Jun Cai, Wendi Jiang, Yunkai Li, Yongbo Liu, Yi Sensors (Basel) Article Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy. MDPI 2020-11-07 /pmc/articles/PMC7664645/ /pubmed/33171831 http://dx.doi.org/10.3390/s20216360 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Xu, Chi Zhou, Jun Cai, Wendi Jiang, Yunkai Li, Yongbo Liu, Yi Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments |
title | Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments |
title_full | Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments |
title_fullStr | Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments |
title_full_unstemmed | Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments |
title_short | Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments |
title_sort | robust 3d hand detection from a single rgb-d image in unconstrained environments |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7664645/ https://www.ncbi.nlm.nih.gov/pubmed/33171831 http://dx.doi.org/10.3390/s20216360 |
work_keys_str_mv | AT xuchi robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments AT zhoujun robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments AT caiwendi robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments AT jiangyunkai robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments AT liyongbo robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments AT liuyi robust3dhanddetectionfromasinglergbdimageinunconstrainedenvironments |