Cargando…

One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection

Three-dimensional (3D) object detection has important applications in robotics, automatic loading, automatic driving and other scenarios. With the improvement of devices, people can collect multi-sensor/multimodal data from a variety of sensors such as Lidar and cameras. In order to make full use of...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Minle, Hu, Yihua, Zhao, Nanxiang, Qian, Qishu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471046/ https://www.ncbi.nlm.nih.gov/pubmed/30909582 http://dx.doi.org/10.3390/s19061434

_version_	1783411936425148416
author	Li, Minle Hu, Yihua Zhao, Nanxiang Qian, Qishu
author_facet	Li, Minle Hu, Yihua Zhao, Nanxiang Qian, Qishu
author_sort	Li, Minle
collection	PubMed
description	Three-dimensional (3D) object detection has important applications in robotics, automatic loading, automatic driving and other scenarios. With the improvement of devices, people can collect multi-sensor/multimodal data from a variety of sensors such as Lidar and cameras. In order to make full use of various information advantages and improve the performance of object detection, we proposed a Complex-Retina network, a convolution neural network for 3D object detection based on multi-sensor data fusion. Firstly, a unified architecture with two feature extraction networks was designed, and the feature extraction of point clouds and images from different sensors realized synchronously. Then, we set a series of 3D anchors and projected them to the feature maps, which were cropped into 2D anchors with the same size and fused together. Finally, the object classification and 3D bounding box regression were carried out on the multipath of fully connected layers. The proposed network is a one-stage convolution neural network, which achieves the balance between the accuracy and speed of object detection. The experiments on KITTI datasets show that the proposed network is superior to the contrast algorithms in average precision (AP) and time consumption, which shows the effectiveness of the proposed network.
format	Online Article Text
id	pubmed-6471046
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-64710462019-04-26 One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection Li, Minle Hu, Yihua Zhao, Nanxiang Qian, Qishu Sensors (Basel) Article Three-dimensional (3D) object detection has important applications in robotics, automatic loading, automatic driving and other scenarios. With the improvement of devices, people can collect multi-sensor/multimodal data from a variety of sensors such as Lidar and cameras. In order to make full use of various information advantages and improve the performance of object detection, we proposed a Complex-Retina network, a convolution neural network for 3D object detection based on multi-sensor data fusion. Firstly, a unified architecture with two feature extraction networks was designed, and the feature extraction of point clouds and images from different sensors realized synchronously. Then, we set a series of 3D anchors and projected them to the feature maps, which were cropped into 2D anchors with the same size and fused together. Finally, the object classification and 3D bounding box regression were carried out on the multipath of fully connected layers. The proposed network is a one-stage convolution neural network, which achieves the balance between the accuracy and speed of object detection. The experiments on KITTI datasets show that the proposed network is superior to the contrast algorithms in average precision (AP) and time consumption, which shows the effectiveness of the proposed network. MDPI 2019-03-23 /pmc/articles/PMC6471046/ /pubmed/30909582 http://dx.doi.org/10.3390/s19061434 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Li, Minle Hu, Yihua Zhao, Nanxiang Qian, Qishu One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection
title	One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection
title_full	One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection
title_fullStr	One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection
title_full_unstemmed	One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection
title_short	One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection
title_sort	one-stage multi-sensor data fusion convolutional neural network for 3d object detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6471046/ https://www.ncbi.nlm.nih.gov/pubmed/30909582 http://dx.doi.org/10.3390/s19061434
work_keys_str_mv	AT liminle onestagemultisensordatafusionconvolutionalneuralnetworkfor3dobjectdetection AT huyihua onestagemultisensordatafusionconvolutionalneuralnetworkfor3dobjectdetection AT zhaonanxiang onestagemultisensordatafusionconvolutionalneuralnetworkfor3dobjectdetection AT qianqishu onestagemultisensordatafusionconvolutionalneuralnetworkfor3dobjectdetection

One-Stage Multi-Sensor Data Fusion Convolutional Neural Network for 3D Object Detection

Ejemplares similares