Cargando…

Improving Depth Estimation by Embedding Semantic Segmentation: A Hybrid CNN Model

Single image depth estimation works fail to separate foreground elements because they can easily be confounded with the background. To alleviate this problem, we propose the use of a semantic segmentation procedure that adds information to a depth estimator, in this case, a 3D Convolutional Neural N...

Descripción completa

Detalles Bibliográficos
Autores principales: Valdez-Rodríguez, José E., Calvo, Hiram, Felipe-Riverón, Edgardo, Moreno-Armendáriz, Marco A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8875167/
https://www.ncbi.nlm.nih.gov/pubmed/35214571
http://dx.doi.org/10.3390/s22041669
Descripción
Sumario:Single image depth estimation works fail to separate foreground elements because they can easily be confounded with the background. To alleviate this problem, we propose the use of a semantic segmentation procedure that adds information to a depth estimator, in this case, a 3D Convolutional Neural Network (CNN)—segmentation is coded as one-hot planes representing categories of objects. We explore 2D and 3D models. Particularly, we propose a hybrid 2D–3D CNN architecture capable of obtaining semantic segmentation and depth estimation at the same time. We tested our procedure on the SYNTHIA-AL dataset and obtained [Formula: see text] , which is an improvement of 0.14 points (compared with the state of the art of [Formula: see text]) by using manual segmentation, and [Formula: see text] using automatic semantic segmentation, proving that depth estimation is improved when the shape and position of objects in a scene are known.