Cargando…

Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments

Estimating the distance to objects is crucial for autonomous vehicles, but cost, weight or power constraints sometimes prevent the use of dedicated depth sensors. In this case, the distance has to be estimated from on-board mounted RGB cameras, which is a complex task especially for environments suc...

Descripción completa

Detalles Bibliográficos
Autores principales: Fonder, Michaël, Ernst, Damien, Van Droogenbroeck, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9741046/
https://www.ncbi.nlm.nih.gov/pubmed/36502073
http://dx.doi.org/10.3390/s22239374
_version_ 1784848220287926272
author Fonder, Michaël
Ernst, Damien
Van Droogenbroeck, Marc
author_facet Fonder, Michaël
Ernst, Damien
Van Droogenbroeck, Marc
author_sort Fonder, Michaël
collection PubMed
description Estimating the distance to objects is crucial for autonomous vehicles, but cost, weight or power constraints sometimes prevent the use of dedicated depth sensors. In this case, the distance has to be estimated from on-board mounted RGB cameras, which is a complex task especially for environments such as natural outdoor landscapes. In this paper, we present a new depth estimation method suitable for use in such landscapes. First, we establish a bijective relationship between depth and the visual parallax of two consecutive frames and show how to exploit it to perform motion-invariant pixel-wise depth estimation. Then, we detail our architecture which is based on a pyramidal convolutional neural network where each level refines an input parallax map estimate by using two customized cost volumes. We use these cost volumes to leverage the visual spatio-temporal constraints imposed by motion and make the network robust for varied scenes. We benchmarked our approach both in test and generalization modes on public datasets featuring synthetic camera trajectories recorded in a wide variety of outdoor scenes. Results show that our network outperforms the state of the art on these datasets, while also performing well on a standard depth estimation benchmark.
format Online
Article
Text
id pubmed-9741046
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97410462022-12-11 Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments Fonder, Michaël Ernst, Damien Van Droogenbroeck, Marc Sensors (Basel) Article Estimating the distance to objects is crucial for autonomous vehicles, but cost, weight or power constraints sometimes prevent the use of dedicated depth sensors. In this case, the distance has to be estimated from on-board mounted RGB cameras, which is a complex task especially for environments such as natural outdoor landscapes. In this paper, we present a new depth estimation method suitable for use in such landscapes. First, we establish a bijective relationship between depth and the visual parallax of two consecutive frames and show how to exploit it to perform motion-invariant pixel-wise depth estimation. Then, we detail our architecture which is based on a pyramidal convolutional neural network where each level refines an input parallax map estimate by using two customized cost volumes. We use these cost volumes to leverage the visual spatio-temporal constraints imposed by motion and make the network robust for varied scenes. We benchmarked our approach both in test and generalization modes on public datasets featuring synthetic camera trajectories recorded in a wide variety of outdoor scenes. Results show that our network outperforms the state of the art on these datasets, while also performing well on a standard depth estimation benchmark. MDPI 2022-12-01 /pmc/articles/PMC9741046/ /pubmed/36502073 http://dx.doi.org/10.3390/s22239374 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Fonder, Michaël
Ernst, Damien
Van Droogenbroeck, Marc
Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments
title Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments
title_full Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments
title_fullStr Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments
title_full_unstemmed Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments
title_short Parallax Inference for Robust Temporal Monocular Depth Estimation in Unstructured Environments
title_sort parallax inference for robust temporal monocular depth estimation in unstructured environments
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9741046/
https://www.ncbi.nlm.nih.gov/pubmed/36502073
http://dx.doi.org/10.3390/s22239374
work_keys_str_mv AT fondermichael parallaxinferenceforrobusttemporalmonoculardepthestimationinunstructuredenvironments
AT ernstdamien parallaxinferenceforrobusttemporalmonoculardepthestimationinunstructuredenvironments
AT vandroogenbroeckmarc parallaxinferenceforrobusttemporalmonoculardepthestimationinunstructuredenvironments