Cargando…

SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches

Monocular depth estimation based on unsupervised learning has attracted great attention due to the rising demand for lightweight monocular vision sensors. Inspired by multi-task learning, semantic information has been used to improve the monocular depth estimation models. However, multi-task learnin...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Rui, Zou, Jialing, Wen, James Zhiqing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8398641/ https://www.ncbi.nlm.nih.gov/pubmed/34450917 http://dx.doi.org/10.3390/s21165476

_version_	1783744888251088896
author	Wang, Rui Zou, Jialing Wen, James Zhiqing
author_facet	Wang, Rui Zou, Jialing Wen, James Zhiqing
author_sort	Wang, Rui
collection	PubMed
description	Monocular depth estimation based on unsupervised learning has attracted great attention due to the rising demand for lightweight monocular vision sensors. Inspired by multi-task learning, semantic information has been used to improve the monocular depth estimation models. However, multi-task learning is still limited by multi-type annotations. As far as we know, there are scarcely any large public datasets that provide all the necessary information. Therefore, we propose a novel network architecture Semantic-Feature-Aided Monocular Depth Estimation Network (SFA-MDEN) to extract multi-resolution depth features and semantic features, which are merged and fed into the decoder, with the goal of predicting depth with the support of semantics. Instead of using loss functions to relate the semantics and depth, the fusion of feature maps for semantics and depth is employed to predict the monocular depth. Therefore, two accessible datasets with similar topics for depth estimation and semantic segmentation can meet the requirements of SFA-MDEN for training sets. We explored the performance of the proposed SFA-MDEN with experiments on different datasets, including KITTI, Make3D, and our own dataset BHDE-v1. The experimental results demonstrate that SFA-MDEN achieves competitive accuracy and generalization capacity compared to state-of-the-art methods.
format	Online Article Text
id	pubmed-8398641
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-83986412021-08-29 SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches Wang, Rui Zou, Jialing Wen, James Zhiqing Sensors (Basel) Article Monocular depth estimation based on unsupervised learning has attracted great attention due to the rising demand for lightweight monocular vision sensors. Inspired by multi-task learning, semantic information has been used to improve the monocular depth estimation models. However, multi-task learning is still limited by multi-type annotations. As far as we know, there are scarcely any large public datasets that provide all the necessary information. Therefore, we propose a novel network architecture Semantic-Feature-Aided Monocular Depth Estimation Network (SFA-MDEN) to extract multi-resolution depth features and semantic features, which are merged and fed into the decoder, with the goal of predicting depth with the support of semantics. Instead of using loss functions to relate the semantics and depth, the fusion of feature maps for semantics and depth is employed to predict the monocular depth. Therefore, two accessible datasets with similar topics for depth estimation and semantic segmentation can meet the requirements of SFA-MDEN for training sets. We explored the performance of the proposed SFA-MDEN with experiments on different datasets, including KITTI, Make3D, and our own dataset BHDE-v1. The experimental results demonstrate that SFA-MDEN achieves competitive accuracy and generalization capacity compared to state-of-the-art methods. MDPI 2021-08-13 /pmc/articles/PMC8398641/ /pubmed/34450917 http://dx.doi.org/10.3390/s21165476 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Rui Zou, Jialing Wen, James Zhiqing SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches
title	SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches
title_full	SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches
title_fullStr	SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches
title_full_unstemmed	SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches
title_short	SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches
title_sort	sfa-mden: semantic-feature-aided monocular depth estimation network using dual branches
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8398641/ https://www.ncbi.nlm.nih.gov/pubmed/34450917 http://dx.doi.org/10.3390/s21165476
work_keys_str_mv	AT wangrui sfamdensemanticfeatureaidedmonoculardepthestimationnetworkusingdualbranches AT zoujialing sfamdensemanticfeatureaidedmonoculardepthestimationnetworkusingdualbranches AT wenjameszhiqing sfamdensemanticfeatureaidedmonoculardepthestimationnetworkusingdualbranches

SFA-MDEN: Semantic-Feature-Aided Monocular Depth Estimation Network Using Dual Branches

Ejemplares similares