Cargando…

Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image

This paper proposes a novel 3D representation, namely, a latent 3D volume, for joint depth estimation and semantic segmentation. Most previous studies encoded an input scene (typically given as a 2D image) into a set of feature vectors arranged over a 2D plane. However, considering the real world is...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ito, Seiya, Kaneko, Naoshi, Sumi, Kazuhiko
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7601744/ https://www.ncbi.nlm.nih.gov/pubmed/33053692 http://dx.doi.org/10.3390/s20205765

_version_	1783603504775954432
author	Ito, Seiya Kaneko, Naoshi Sumi, Kazuhiko
author_facet	Ito, Seiya Kaneko, Naoshi Sumi, Kazuhiko
author_sort	Ito, Seiya
collection	PubMed
description	This paper proposes a novel 3D representation, namely, a latent 3D volume, for joint depth estimation and semantic segmentation. Most previous studies encoded an input scene (typically given as a 2D image) into a set of feature vectors arranged over a 2D plane. However, considering the real world is three-dimensional, this 2D arrangement reduces one dimension and may limit the capacity of feature representation. In contrast, we examine the idea of arranging the feature vectors in 3D space rather than in a 2D plane. We refer to this 3D volumetric arrangement as a latent 3D volume. We will show that the latent 3D volume is beneficial to the tasks of depth estimation and semantic segmentation because these tasks require an understanding of the 3D structure of the scene. Our network first constructs an initial 3D volume using image features and then generates latent 3D volume by passing the initial 3D volume through several 3D convolutional layers. We apply depth regression and semantic segmentation by projecting the latent 3D volume onto a 2D plane. The evaluation results show that our method outperforms previous approaches on the NYU Depth v2 dataset.
format	Online Article Text
id	pubmed-7601744
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-76017442020-11-01 Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image Ito, Seiya Kaneko, Naoshi Sumi, Kazuhiko Sensors (Basel) Article This paper proposes a novel 3D representation, namely, a latent 3D volume, for joint depth estimation and semantic segmentation. Most previous studies encoded an input scene (typically given as a 2D image) into a set of feature vectors arranged over a 2D plane. However, considering the real world is three-dimensional, this 2D arrangement reduces one dimension and may limit the capacity of feature representation. In contrast, we examine the idea of arranging the feature vectors in 3D space rather than in a 2D plane. We refer to this 3D volumetric arrangement as a latent 3D volume. We will show that the latent 3D volume is beneficial to the tasks of depth estimation and semantic segmentation because these tasks require an understanding of the 3D structure of the scene. Our network first constructs an initial 3D volume using image features and then generates latent 3D volume by passing the initial 3D volume through several 3D convolutional layers. We apply depth regression and semantic segmentation by projecting the latent 3D volume onto a 2D plane. The evaluation results show that our method outperforms previous approaches on the NYU Depth v2 dataset. MDPI 2020-10-12 /pmc/articles/PMC7601744/ /pubmed/33053692 http://dx.doi.org/10.3390/s20205765 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ito, Seiya Kaneko, Naoshi Sumi, Kazuhiko Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image
title	Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image
title_full	Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image
title_fullStr	Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image
title_full_unstemmed	Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image
title_short	Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image
title_sort	latent 3d volume for joint depth estimation and semantic segmentation from a single image
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7601744/ https://www.ncbi.nlm.nih.gov/pubmed/33053692 http://dx.doi.org/10.3390/s20205765
work_keys_str_mv	AT itoseiya latent3dvolumeforjointdepthestimationandsemanticsegmentationfromasingleimage AT kanekonaoshi latent3dvolumeforjointdepthestimationandsemanticsegmentationfromasingleimage AT sumikazuhiko latent3dvolumeforjointdepthestimationandsemanticsegmentationfromasingleimage

Latent 3D Volume for Joint Depth Estimation and Semantic Segmentation from a Single Image

Ejemplares similares