Cargando…

SECOND: Sparsely Embedded Convolutional Detection

LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain,...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Yan, Mao, Yuxing, Li, Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210968/
https://www.ncbi.nlm.nih.gov/pubmed/30301196
http://dx.doi.org/10.3390/s18103337
_version_ 1783367236266754048
author Yan, Yan
Mao, Yuxing
Li, Bo
author_facet Yan, Yan
Mao, Yuxing
Li, Bo
author_sort Yan, Yan
collection PubMed
description LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain, including a slow inference speed and low orientation estimation performance. We therefore investigate an improved sparse convolution method for such networks, which significantly increases the speed of both training and inference. We also introduce a new form of angle loss regression to improve the orientation estimation performance and a new data augmentation approach that can enhance the convergence speed and performance. The proposed network produces state-of-the-art results on the KITTI 3D object detection benchmarks while maintaining a fast inference speed.
format Online
Article
Text
id pubmed-6210968
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62109682018-11-02 SECOND: Sparsely Embedded Convolutional Detection Yan, Yan Mao, Yuxing Li, Bo Sensors (Basel) Article LiDAR-based or RGB-D-based object detection is used in numerous applications, ranging from autonomous driving to robot vision. Voxel-based 3D convolutional networks have been used for some time to enhance the retention of information when processing point cloud LiDAR data. However, problems remain, including a slow inference speed and low orientation estimation performance. We therefore investigate an improved sparse convolution method for such networks, which significantly increases the speed of both training and inference. We also introduce a new form of angle loss regression to improve the orientation estimation performance and a new data augmentation approach that can enhance the convergence speed and performance. The proposed network produces state-of-the-art results on the KITTI 3D object detection benchmarks while maintaining a fast inference speed. MDPI 2018-10-06 /pmc/articles/PMC6210968/ /pubmed/30301196 http://dx.doi.org/10.3390/s18103337 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yan, Yan
Mao, Yuxing
Li, Bo
SECOND: Sparsely Embedded Convolutional Detection
title SECOND: Sparsely Embedded Convolutional Detection
title_full SECOND: Sparsely Embedded Convolutional Detection
title_fullStr SECOND: Sparsely Embedded Convolutional Detection
title_full_unstemmed SECOND: Sparsely Embedded Convolutional Detection
title_short SECOND: Sparsely Embedded Convolutional Detection
title_sort second: sparsely embedded convolutional detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6210968/
https://www.ncbi.nlm.nih.gov/pubmed/30301196
http://dx.doi.org/10.3390/s18103337
work_keys_str_mv AT yanyan secondsparselyembeddedconvolutionaldetection
AT maoyuxing secondsparselyembeddedconvolutionaldetection
AT libo secondsparselyembeddedconvolutionaldetection