Cargando…

Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network

In this paper, a novel Pixel-Voxel network is proposed for dense 3D semantic mapping, which can perform dense 3D mapping while simultaneously recognizing and labelling the semantic category each point in the 3D map. In our approach, we fully leverage the advantages of different modalities. That is,...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Cheng, Sun, Li, Purkait, Pulak, Duckett, Tom, Stolkin, Rustam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6164553/
https://www.ncbi.nlm.nih.gov/pubmed/30223501
http://dx.doi.org/10.3390/s18093099
_version_ 1783359627613700096
author Zhao, Cheng
Sun, Li
Purkait, Pulak
Duckett, Tom
Stolkin, Rustam
author_facet Zhao, Cheng
Sun, Li
Purkait, Pulak
Duckett, Tom
Stolkin, Rustam
author_sort Zhao, Cheng
collection PubMed
description In this paper, a novel Pixel-Voxel network is proposed for dense 3D semantic mapping, which can perform dense 3D mapping while simultaneously recognizing and labelling the semantic category each point in the 3D map. In our approach, we fully leverage the advantages of different modalities. That is, the PixelNet can learn the high-level contextual information from 2D RGB images, and the VoxelNet can learn 3D geometrical shapes from the 3D point cloud. Unlike the existing architecture that fuses score maps from different modalities with equal weights, we propose a softmax weighted fusion stack that adaptively learns the varying contributions of PixelNet and VoxelNet and fuses the score maps according to their respective confidence levels. Our approach achieved competitive results on both the SUN RGB-D and NYU V2 benchmarks, while the runtime of the proposed system is boosted to around 13 Hz, enabling near-real-time performance using an i7 eight-cores PC with a single Titan X GPU.
format Online
Article
Text
id pubmed-6164553
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-61645532018-10-10 Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network Zhao, Cheng Sun, Li Purkait, Pulak Duckett, Tom Stolkin, Rustam Sensors (Basel) Article In this paper, a novel Pixel-Voxel network is proposed for dense 3D semantic mapping, which can perform dense 3D mapping while simultaneously recognizing and labelling the semantic category each point in the 3D map. In our approach, we fully leverage the advantages of different modalities. That is, the PixelNet can learn the high-level contextual information from 2D RGB images, and the VoxelNet can learn 3D geometrical shapes from the 3D point cloud. Unlike the existing architecture that fuses score maps from different modalities with equal weights, we propose a softmax weighted fusion stack that adaptively learns the varying contributions of PixelNet and VoxelNet and fuses the score maps according to their respective confidence levels. Our approach achieved competitive results on both the SUN RGB-D and NYU V2 benchmarks, while the runtime of the proposed system is boosted to around 13 Hz, enabling near-real-time performance using an i7 eight-cores PC with a single Titan X GPU. MDPI 2018-09-14 /pmc/articles/PMC6164553/ /pubmed/30223501 http://dx.doi.org/10.3390/s18093099 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Cheng
Sun, Li
Purkait, Pulak
Duckett, Tom
Stolkin, Rustam
Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network
title Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network
title_full Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network
title_fullStr Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network
title_full_unstemmed Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network
title_short Dense RGB-D Semantic Mapping with Pixel-Voxel Neural Network
title_sort dense rgb-d semantic mapping with pixel-voxel neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6164553/
https://www.ncbi.nlm.nih.gov/pubmed/30223501
http://dx.doi.org/10.3390/s18093099
work_keys_str_mv AT zhaocheng densergbdsemanticmappingwithpixelvoxelneuralnetwork
AT sunli densergbdsemanticmappingwithpixelvoxelneuralnetwork
AT purkaitpulak densergbdsemanticmappingwithpixelvoxelneuralnetwork
AT ducketttom densergbdsemanticmappingwithpixelvoxelneuralnetwork
AT stolkinrustam densergbdsemanticmappingwithpixelvoxelneuralnetwork