Cargando…
Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception
Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abst...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412344/ https://www.ncbi.nlm.nih.gov/pubmed/30795507 http://dx.doi.org/10.3390/s19040893 |
_version_ | 1783402583628447744 |
---|---|
author | Wang, Li Li, Ruifeng Shi, Hezi Sun, Jingwen Zhao, Lijun Seah, Hock Soon Quah, Chee Kwang Tandianus, Budianto |
author_facet | Wang, Li Li, Ruifeng Shi, Hezi Sun, Jingwen Zhao, Lijun Seah, Hock Soon Quah, Chee Kwang Tandianus, Budianto |
author_sort | Wang, Li |
collection | PubMed |
description | Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot’s operation. In this paper, we focus on the 3D object detection to regress the object’s category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird’s eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot. |
format | Online Article Text |
id | pubmed-6412344 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-64123442019-04-03 Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception Wang, Li Li, Ruifeng Shi, Hezi Sun, Jingwen Zhao, Lijun Seah, Hock Soon Quah, Chee Kwang Tandianus, Budianto Sensors (Basel) Article Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot’s operation. In this paper, we focus on the 3D object detection to regress the object’s category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird’s eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot. MDPI 2019-02-21 /pmc/articles/PMC6412344/ /pubmed/30795507 http://dx.doi.org/10.3390/s19040893 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Li Li, Ruifeng Shi, Hezi Sun, Jingwen Zhao, Lijun Seah, Hock Soon Quah, Chee Kwang Tandianus, Budianto Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception |
title | Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception |
title_full | Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception |
title_fullStr | Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception |
title_full_unstemmed | Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception |
title_short | Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception |
title_sort | multi-channel convolutional neural network based 3d object detection for indoor robot environmental perception |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6412344/ https://www.ncbi.nlm.nih.gov/pubmed/30795507 http://dx.doi.org/10.3390/s19040893 |
work_keys_str_mv | AT wangli multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception AT liruifeng multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception AT shihezi multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception AT sunjingwen multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception AT zhaolijun multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception AT seahhocksoon multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception AT quahcheekwang multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception AT tandianusbudianto multichannelconvolutionalneuralnetworkbased3dobjectdetectionforindoorrobotenvironmentalperception |