Cargando…

Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance

Accurately sensing the surrounding 3D scene is indispensable for drones or robots to execute path planning and navigation. In this paper, a novel monocular depth estimation method was proposed that primarily utilizes a lighter-weight Convolutional Neural Network (CNN) structure for coarse depth pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Kang, Qu, Xingtian, Chen, Shouqian, Chen, Zhen, Zhang, Wang, Qi, Haogang, Zhao, Fengshang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506624/
https://www.ncbi.nlm.nih.gov/pubmed/32867293
http://dx.doi.org/10.3390/s20174856
_version_ 1783585056393003008
author Huang, Kang
Qu, Xingtian
Chen, Shouqian
Chen, Zhen
Zhang, Wang
Qi, Haogang
Zhao, Fengshang
author_facet Huang, Kang
Qu, Xingtian
Chen, Shouqian
Chen, Zhen
Zhang, Wang
Qi, Haogang
Zhao, Fengshang
author_sort Huang, Kang
collection PubMed
description Accurately sensing the surrounding 3D scene is indispensable for drones or robots to execute path planning and navigation. In this paper, a novel monocular depth estimation method was proposed that primarily utilizes a lighter-weight Convolutional Neural Network (CNN) structure for coarse depth prediction and then refines the coarse depth images by combining surface normal guidance. Specifically, the coarse depth prediction network is designed as pre-trained encoder–decoder architecture for describing the 3D structure. When it comes to surface normal estimation, the deep learning network was designed as a two-stream encoder–decoder structure, which hierarchically merges red-green-blue-depth (RGB-D) images for capturing more accurate geometric boundaries. Relying on fewer network parameters and simpler learning structure, better detailed depth maps are produced than the existing states. Moreover, 3D point cloud maps reconstructed from depth prediction images confirm that our framework can be conveniently adopted as components of a monocular simultaneous localization and mapping (SLAM) paradigm.
format Online
Article
Text
id pubmed-7506624
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75066242020-09-26 Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance Huang, Kang Qu, Xingtian Chen, Shouqian Chen, Zhen Zhang, Wang Qi, Haogang Zhao, Fengshang Sensors (Basel) Article Accurately sensing the surrounding 3D scene is indispensable for drones or robots to execute path planning and navigation. In this paper, a novel monocular depth estimation method was proposed that primarily utilizes a lighter-weight Convolutional Neural Network (CNN) structure for coarse depth prediction and then refines the coarse depth images by combining surface normal guidance. Specifically, the coarse depth prediction network is designed as pre-trained encoder–decoder architecture for describing the 3D structure. When it comes to surface normal estimation, the deep learning network was designed as a two-stream encoder–decoder structure, which hierarchically merges red-green-blue-depth (RGB-D) images for capturing more accurate geometric boundaries. Relying on fewer network parameters and simpler learning structure, better detailed depth maps are produced than the existing states. Moreover, 3D point cloud maps reconstructed from depth prediction images confirm that our framework can be conveniently adopted as components of a monocular simultaneous localization and mapping (SLAM) paradigm. MDPI 2020-08-27 /pmc/articles/PMC7506624/ /pubmed/32867293 http://dx.doi.org/10.3390/s20174856 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Huang, Kang
Qu, Xingtian
Chen, Shouqian
Chen, Zhen
Zhang, Wang
Qi, Haogang
Zhao, Fengshang
Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance
title Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance
title_full Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance
title_fullStr Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance
title_full_unstemmed Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance
title_short Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance
title_sort superb monocular depth estimation based on transfer learning and surface normal guidance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7506624/
https://www.ncbi.nlm.nih.gov/pubmed/32867293
http://dx.doi.org/10.3390/s20174856
work_keys_str_mv AT huangkang superbmonoculardepthestimationbasedontransferlearningandsurfacenormalguidance
AT quxingtian superbmonoculardepthestimationbasedontransferlearningandsurfacenormalguidance
AT chenshouqian superbmonoculardepthestimationbasedontransferlearningandsurfacenormalguidance
AT chenzhen superbmonoculardepthestimationbasedontransferlearningandsurfacenormalguidance
AT zhangwang superbmonoculardepthestimationbasedontransferlearningandsurfacenormalguidance
AT qihaogang superbmonoculardepthestimationbasedontransferlearningandsurfacenormalguidance
AT zhaofengshang superbmonoculardepthestimationbasedontransferlearningandsurfacenormalguidance