Cargando…

Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data

Recently, 6D pose estimation methods have shown robust performance on highly cluttered scenes and different illumination conditions. However, occlusions are still challenging, with recognition rates decreasing to less than 10% for half-visible objects in some datasets. In this paper, we propose to u...

Descripción completa

Detalles Bibliográficos
Autores principales: Vidal, Joel, Lin, Chyi-Yeu, Martí, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662424/
https://www.ncbi.nlm.nih.gov/pubmed/34884094
http://dx.doi.org/10.3390/s21238090
_version_ 1784613433167052800
author Vidal, Joel
Lin, Chyi-Yeu
Martí, Robert
author_facet Vidal, Joel
Lin, Chyi-Yeu
Martí, Robert
author_sort Vidal, Joel
collection PubMed
description Recently, 6D pose estimation methods have shown robust performance on highly cluttered scenes and different illumination conditions. However, occlusions are still challenging, with recognition rates decreasing to less than 10% for half-visible objects in some datasets. In this paper, we propose to use top-down visual attention and color cues to boost performance of a state-of-the-art method on occluded scenarios. More specifically, color information is employed to detect potential points in the scene, improve feature-matching, and compute more precise fitting scores. The proposed method is evaluated on the Linemod occluded (LM-O), TUD light (TUD-L), Tejani (IC-MI) and Doumanoglou (IC-BIN) datasets, as part of the SiSo BOP benchmark, which includes challenging highly occluded cases, illumination changing scenarios, and multiple instances. The method is analyzed and discussed for different parameters, color spaces and metrics. The presented results show the validity of the proposed approach and their robustness against illumination changes and multiple instance scenarios, specially boosting the performance on relatively high occluded cases. The proposed solution provides an absolute improvement of up to 30% for levels of occlusion between 40% to 50%, outperforming other approaches with a best overall recall of 71% for the LM-O, 92% for TUD-L, 99.3% for IC-MI and 97.5% for IC-BIN.
format Online
Article
Text
id pubmed-8662424
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86624242021-12-11 Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data Vidal, Joel Lin, Chyi-Yeu Martí, Robert Sensors (Basel) Article Recently, 6D pose estimation methods have shown robust performance on highly cluttered scenes and different illumination conditions. However, occlusions are still challenging, with recognition rates decreasing to less than 10% for half-visible objects in some datasets. In this paper, we propose to use top-down visual attention and color cues to boost performance of a state-of-the-art method on occluded scenarios. More specifically, color information is employed to detect potential points in the scene, improve feature-matching, and compute more precise fitting scores. The proposed method is evaluated on the Linemod occluded (LM-O), TUD light (TUD-L), Tejani (IC-MI) and Doumanoglou (IC-BIN) datasets, as part of the SiSo BOP benchmark, which includes challenging highly occluded cases, illumination changing scenarios, and multiple instances. The method is analyzed and discussed for different parameters, color spaces and metrics. The presented results show the validity of the proposed approach and their robustness against illumination changes and multiple instance scenarios, specially boosting the performance on relatively high occluded cases. The proposed solution provides an absolute improvement of up to 30% for levels of occlusion between 40% to 50%, outperforming other approaches with a best overall recall of 71% for the LM-O, 92% for TUD-L, 99.3% for IC-MI and 97.5% for IC-BIN. MDPI 2021-12-03 /pmc/articles/PMC8662424/ /pubmed/34884094 http://dx.doi.org/10.3390/s21238090 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Vidal, Joel
Lin, Chyi-Yeu
Martí, Robert
Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data
title Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data
title_full Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data
title_fullStr Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data
title_full_unstemmed Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data
title_short Visual Attention and Color Cues for 6D Pose Estimation on Occluded Scenarios Using RGB-D Data
title_sort visual attention and color cues for 6d pose estimation on occluded scenarios using rgb-d data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662424/
https://www.ncbi.nlm.nih.gov/pubmed/34884094
http://dx.doi.org/10.3390/s21238090
work_keys_str_mv AT vidaljoel visualattentionandcolorcuesfor6dposeestimationonoccludedscenariosusingrgbddata
AT linchyiyeu visualattentionandcolorcuesfor6dposeestimationonoccludedscenariosusingrgbddata
AT martirobert visualattentionandcolorcuesfor6dposeestimationonoccludedscenariosusingrgbddata