Cargando…
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods wo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230345/ https://www.ncbi.nlm.nih.gov/pubmed/34201390 http://dx.doi.org/10.3390/s21123964 |
_version_ | 1783713186972696576 |
---|---|
author | Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin |
author_facet | Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin |
author_sort | Imad, Muhammad |
collection | PubMed |
description | Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to [Formula: see text]) while still running at more than 30 frames per second (FPS). |
format | Online Article Text |
id | pubmed-8230345 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-82303452021-06-26 Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin Sensors (Basel) Article Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to [Formula: see text]) while still running at more than 30 frames per second (FPS). MDPI 2021-06-08 /pmc/articles/PMC8230345/ /pubmed/34201390 http://dx.doi.org/10.3390/s21123964 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud |
title | Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud |
title_full | Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud |
title_fullStr | Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud |
title_full_unstemmed | Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud |
title_short | Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud |
title_sort | transfer learning based semantic segmentation for 3d object detection from point cloud |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230345/ https://www.ncbi.nlm.nih.gov/pubmed/34201390 http://dx.doi.org/10.3390/s21123964 |
work_keys_str_mv | AT imadmuhammad transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud AT doukhioualid transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud AT leedeokjin transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud |