Cargando…

SECOND: Sparsely Embedded Convolutional Detection

LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yan, Yan, Mao, Yuxing, Li, Bo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210968/ https://www.ncbi.nlm.nih.gov/pubmed/30301196 http://dx.doi.org/10.3390/s18103337

_version_	1783367236266754048
author	Yan, Yan Mao, Yuxing Li, Bo
author_facet	Yan, Yan Mao, Yuxing Li, Bo
author_sort	Yan, Yan
collection	PubMed
description	LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain, including a slow inference speed and low orientation estimation performance. We therefore investigate an improved sparse convolution method for such networks, which significantly increases the speed of both training and inference. We also introduce a new form of angle loss regression to improve the orientation estimation performance and a new data augmentation approach that can enhance the convergence speed and performance. The proposed network produces state-of-the-art results on the KITTI 3D object detection benchmarks while maintaining a fast inference speed.
format	Online Article Text
id	pubmed-6210968
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-62109682018-11-02 SECOND: Sparsely Embedded Convolutional Detection Yan, Yan Mao, Yuxing Li, Bo Sensors (Basel) Article LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain, including a slow inference speed and low orientation estimation performance. We therefore investigate an improved sparse convolution method for such networks, which significantly increases the speed of both training and inference. We also introduce a new form of angle loss regression to improve the orientation estimation performance and a new data augmentation approach that can enhance the convergence speed and performance. The proposed network produces state-of-the-art results on the KITTI 3D object detection benchmarks while maintaining a fast inference speed. MDPI 2018-10-06 /pmc/articles/PMC6210968/ /pubmed/30301196 http://dx.doi.org/10.3390/s18103337 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Yan, Yan Mao, Yuxing Li, Bo SECOND: Sparsely Embedded Convolutional Detection
title	SECOND: Sparsely Embedded Convolutional Detection
title_full	SECOND: Sparsely Embedded Convolutional Detection
title_fullStr	SECOND: Sparsely Embedded Convolutional Detection
title_full_unstemmed	SECOND: Sparsely Embedded Convolutional Detection
title_short	SECOND: Sparsely Embedded Convolutional Detection
title_sort	second: sparsely embedded convolutional detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210968/ https://www.ncbi.nlm.nih.gov/pubmed/30301196 http://dx.doi.org/10.3390/s18103337
work_keys_str_mv	AT yanyan secondsparselyembeddedconvolutionaldetection AT maoyuxing secondsparselyembeddedconvolutionaldetection AT libo secondsparselyembeddedconvolutionaldetection

SECOND: Sparsely Embedded Convolutional Detection

Ejemplares similares