Cargando…

Uncertainty Prediction for Monocular 3D Object Detection

For object detection, capturing the scale of uncertainty is as important as accurate localization. Without understanding uncertainties, self-driving vehicles cannot plan a safe path. Many studies have focused on improving object detection, but relatively little attention has been paid to uncertainty...

Descripción completa

Detalles Bibliográficos
Autores principales: Mun, Junghwan, Choi, Hyukdoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10304882/
https://www.ncbi.nlm.nih.gov/pubmed/37420563
http://dx.doi.org/10.3390/s23125395
_version_ 1785065606358237184
author Mun, Junghwan
Choi, Hyukdoo
author_facet Mun, Junghwan
Choi, Hyukdoo
author_sort Mun, Junghwan
collection PubMed
description For object detection, capturing the scale of uncertainty is as important as accurate localization. Without understanding uncertainties, self-driving vehicles cannot plan a safe path. Many studies have focused on improving object detection, but relatively little attention has been paid to uncertainty estimation. We present an uncertainty model to predict the standard deviation of bounding box parameters for a monocular 3D object detection model. The uncertainty model is a small, multi-layer perceptron (MLP) that is trained to predict uncertainty for each detected object. In addition, we observe that occlusion information helps predict uncertainty accurately. A new monocular detection model is designed to classify occlusion levels as well as to detect objects. An input vector to the uncertainty model contains bounding box parameters, class probabilities, and occlusion probabilities. To validate predicted uncertainties, actual uncertainties are estimated at the specific predicted uncertainties. The accuracy of the predicted values is evaluated using these estimated actual values. We find that the mean uncertainty error is reduced by 7.1% using the occlusion information. The uncertainty model directly estimates total uncertainty at the absolute scale, which is critical to self-driving systems. Our approach is validated through the KITTI object detection benchmark.
format Online
Article
Text
id pubmed-10304882
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103048822023-06-29 Uncertainty Prediction for Monocular 3D Object Detection Mun, Junghwan Choi, Hyukdoo Sensors (Basel) Article For object detection, capturing the scale of uncertainty is as important as accurate localization. Without understanding uncertainties, self-driving vehicles cannot plan a safe path. Many studies have focused on improving object detection, but relatively little attention has been paid to uncertainty estimation. We present an uncertainty model to predict the standard deviation of bounding box parameters for a monocular 3D object detection model. The uncertainty model is a small, multi-layer perceptron (MLP) that is trained to predict uncertainty for each detected object. In addition, we observe that occlusion information helps predict uncertainty accurately. A new monocular detection model is designed to classify occlusion levels as well as to detect objects. An input vector to the uncertainty model contains bounding box parameters, class probabilities, and occlusion probabilities. To validate predicted uncertainties, actual uncertainties are estimated at the specific predicted uncertainties. The accuracy of the predicted values is evaluated using these estimated actual values. We find that the mean uncertainty error is reduced by 7.1% using the occlusion information. The uncertainty model directly estimates total uncertainty at the absolute scale, which is critical to self-driving systems. Our approach is validated through the KITTI object detection benchmark. MDPI 2023-06-07 /pmc/articles/PMC10304882/ /pubmed/37420563 http://dx.doi.org/10.3390/s23125395 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mun, Junghwan
Choi, Hyukdoo
Uncertainty Prediction for Monocular 3D Object Detection
title Uncertainty Prediction for Monocular 3D Object Detection
title_full Uncertainty Prediction for Monocular 3D Object Detection
title_fullStr Uncertainty Prediction for Monocular 3D Object Detection
title_full_unstemmed Uncertainty Prediction for Monocular 3D Object Detection
title_short Uncertainty Prediction for Monocular 3D Object Detection
title_sort uncertainty prediction for monocular 3d object detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10304882/
https://www.ncbi.nlm.nih.gov/pubmed/37420563
http://dx.doi.org/10.3390/s23125395
work_keys_str_mv AT munjunghwan uncertaintypredictionformonocular3dobjectdetection
AT choihyukdoo uncertaintypredictionformonocular3dobjectdetection