Cargando…

Semantic Segmentation Leveraging Simultaneous Depth Estimation

Semantic segmentation is one of the most widely studied problems in computer vision communities, which makes a great contribution to a variety of applications. A lot of learning-based approaches, such as Convolutional Neural Network (CNN), have made a vast contribution to this problem. While rich co...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Wenbo, Gao, Zhi, Cui, Jinqiang, Ramesh, Bharath, Zhang, Bin, Li, Ziyao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7864030/
https://www.ncbi.nlm.nih.gov/pubmed/33498358
http://dx.doi.org/10.3390/s21030690
_version_ 1783647591071744000
author Sun, Wenbo
Gao, Zhi
Cui, Jinqiang
Ramesh, Bharath
Zhang, Bin
Li, Ziyao
author_facet Sun, Wenbo
Gao, Zhi
Cui, Jinqiang
Ramesh, Bharath
Zhang, Bin
Li, Ziyao
author_sort Sun, Wenbo
collection PubMed
description Semantic segmentation is one of the most widely studied problems in computer vision communities, which makes a great contribution to a variety of applications. A lot of learning-based approaches, such as Convolutional Neural Network (CNN), have made a vast contribution to this problem. While rich context information of the input images can be learned from multi-scale receptive fields by convolutions with deep layers, traditional CNNs have great difficulty in learning the geometrical relationship and distribution of objects in the RGB image due to the lack of depth information, which may lead to an inferior segmentation quality. To solve this problem, we propose a method that improves segmentation quality with depth estimation on RGB images. Specifically, we estimate depth information on RGB images via a depth estimation network, and then feed the depth map into the CNN which is able to guide the semantic segmentation. Furthermore, in order to parse the depth map and RGB images simultaneously, we construct a multi-branch encoder–decoder network and fuse the RGB and depth features step by step. Extensive experimental evaluation on four baseline networks demonstrates that our proposed method can enhance the segmentation quality considerably and obtain better performance compared to other segmentation networks.
format Online
Article
Text
id pubmed-7864030
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-78640302021-02-06 Semantic Segmentation Leveraging Simultaneous Depth Estimation Sun, Wenbo Gao, Zhi Cui, Jinqiang Ramesh, Bharath Zhang, Bin Li, Ziyao Sensors (Basel) Article Semantic segmentation is one of the most widely studied problems in computer vision communities, which makes a great contribution to a variety of applications. A lot of learning-based approaches, such as Convolutional Neural Network (CNN), have made a vast contribution to this problem. While rich context information of the input images can be learned from multi-scale receptive fields by convolutions with deep layers, traditional CNNs have great difficulty in learning the geometrical relationship and distribution of objects in the RGB image due to the lack of depth information, which may lead to an inferior segmentation quality. To solve this problem, we propose a method that improves segmentation quality with depth estimation on RGB images. Specifically, we estimate depth information on RGB images via a depth estimation network, and then feed the depth map into the CNN which is able to guide the semantic segmentation. Furthermore, in order to parse the depth map and RGB images simultaneously, we construct a multi-branch encoder–decoder network and fuse the RGB and depth features step by step. Extensive experimental evaluation on four baseline networks demonstrates that our proposed method can enhance the segmentation quality considerably and obtain better performance compared to other segmentation networks. MDPI 2021-01-20 /pmc/articles/PMC7864030/ /pubmed/33498358 http://dx.doi.org/10.3390/s21030690 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sun, Wenbo
Gao, Zhi
Cui, Jinqiang
Ramesh, Bharath
Zhang, Bin
Li, Ziyao
Semantic Segmentation Leveraging Simultaneous Depth Estimation
title Semantic Segmentation Leveraging Simultaneous Depth Estimation
title_full Semantic Segmentation Leveraging Simultaneous Depth Estimation
title_fullStr Semantic Segmentation Leveraging Simultaneous Depth Estimation
title_full_unstemmed Semantic Segmentation Leveraging Simultaneous Depth Estimation
title_short Semantic Segmentation Leveraging Simultaneous Depth Estimation
title_sort semantic segmentation leveraging simultaneous depth estimation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7864030/
https://www.ncbi.nlm.nih.gov/pubmed/33498358
http://dx.doi.org/10.3390/s21030690
work_keys_str_mv AT sunwenbo semanticsegmentationleveragingsimultaneousdepthestimation
AT gaozhi semanticsegmentationleveragingsimultaneousdepthestimation
AT cuijinqiang semanticsegmentationleveragingsimultaneousdepthestimation
AT rameshbharath semanticsegmentationleveragingsimultaneousdepthestimation
AT zhangbin semanticsegmentationleveragingsimultaneousdepthestimation
AT liziyao semanticsegmentationleveragingsimultaneousdepthestimation