Cargando…

Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception

Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abst...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Li, Li, Ruifeng, Shi, Hezi, Sun, Jingwen, Zhao, Lijun, Seah, Hock Soon, Quah, Chee Kwang, Tandianus, Budianto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412344/
https://www.ncbi.nlm.nih.gov/pubmed/30795507
http://dx.doi.org/10.3390/s19040893
_version_ 1783402583628447744
author Wang, Li
Li, Ruifeng
Shi, Hezi
Sun, Jingwen
Zhao, Lijun
Seah, Hock Soon
Quah, Chee Kwang
Tandianus, Budianto
author_facet Wang, Li
Li, Ruifeng
Shi, Hezi
Sun, Jingwen
Zhao, Lijun
Seah, Hock Soon
Quah, Chee Kwang
Tandianus, Budianto
author_sort Wang, Li
collection PubMed
description Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot’s operation. In this paper, we focus on the 3D object detection to regress the object’s category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird’s eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot.
format Online
Article
Text
id pubmed-6412344
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64123442019-04-03 Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception Wang, Li Li, Ruifeng Shi, Hezi Sun, Jingwen Zhao, Lijun Seah, Hock Soon Quah, Chee Kwang Tandianus, Budianto Sensors (Basel) Article Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot’s operation. In this paper, we focus on the 3D object detection to regress the object’s category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird’s eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot. MDPI 2019-02-21 /pmc/articles/PMC6412344/ /pubmed/30795507 http://dx.doi.org/10.3390/s19040893 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Li
Li, Ruifeng
Shi, Hezi
Sun, Jingwen
Zhao, Lijun
Seah, Hock Soon
Quah, Chee Kwang
Tandianus, Budianto
Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception
title Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception
title_full Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception
title_fullStr Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception
title_full_unstemmed Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception
title_short Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception
title_sort multi-channel convolutional neural network based 3d object detection for indoor robot environmental perception
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412344/
https://www.ncbi.nlm.nih.gov/pubmed/30795507
http://dx.doi.org/10.3390/s19040893
work_keys_str_mv AT wangli multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception
AT liruifeng multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception
AT shihezi multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception
AT sunjingwen multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception
AT zhaolijun multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception
AT seahhocksoon multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception
AT quahcheekwang multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception
AT tandianusbudianto multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception