Cargando…

General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor

Over the last decade, robotic perception algorithms have significantly benefited from the rapid advances in deep learning (DL). Indeed, a significant amount of the autonomy stack of different commercial and research platforms relies on DL for situational awareness, especially vision sensors. This wo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yu, Xianjia, Salimpour, Sahar, Queralta, Jorge Peña, Westerlund, Tomi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Communication
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10058223/ https://www.ncbi.nlm.nih.gov/pubmed/36991648 http://dx.doi.org/10.3390/s23062936

_version_	1785016573624320000
author	Yu, Xianjia Salimpour, Sahar Queralta, Jorge Peña Westerlund, Tomi
author_facet	Yu, Xianjia Salimpour, Sahar Queralta, Jorge Peña Westerlund, Tomi
author_sort	Yu, Xianjia
collection	PubMed
description	Over the last decade, robotic perception algorithms have significantly benefited from the rapid advances in deep learning (DL). Indeed, a significant amount of the autonomy stack of different commercial and research platforms relies on DL for situational awareness, especially vision sensors. This work explored the potential of general-purpose DL perception algorithms, specifically detection and segmentation neural networks, for processing image-like outputs of advanced lidar sensors. Rather than processing the three-dimensional point cloud data, this is, to the best of our knowledge, the first work to focus on low-resolution images with a 360° field of view obtained with lidar sensors by encoding either depth, reflectivity, or near-infrared light in the image pixels. We showed that with adequate preprocessing, general-purpose DL models can process these images, opening the door to their usage in environmental conditions where vision sensors present inherent limitations. We provided both a qualitative and quantitative analysis of the performance of a variety of neural network architectures. We believe that using DL models built for visual cameras offers significant advantages due to their much wider availability and maturity compared to point cloud-based perception.
format	Online Article Text
id	pubmed-10058223
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100582232023-03-30 General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor Yu, Xianjia Salimpour, Sahar Queralta, Jorge Peña Westerlund, Tomi Sensors (Basel) Communication Over the last decade, robotic perception algorithms have significantly benefited from the rapid advances in deep learning (DL). Indeed, a significant amount of the autonomy stack of different commercial and research platforms relies on DL for situational awareness, especially vision sensors. This work explored the potential of general-purpose DL perception algorithms, specifically detection and segmentation neural networks, for processing image-like outputs of advanced lidar sensors. Rather than processing the three-dimensional point cloud data, this is, to the best of our knowledge, the first work to focus on low-resolution images with a 360° field of view obtained with lidar sensors by encoding either depth, reflectivity, or near-infrared light in the image pixels. We showed that with adequate preprocessing, general-purpose DL models can process these images, opening the door to their usage in environmental conditions where vision sensors present inherent limitations. We provided both a qualitative and quantitative analysis of the performance of a variety of neural network architectures. We believe that using DL models built for visual cameras offers significant advantages due to their much wider availability and maturity compared to point cloud-based perception. MDPI 2023-03-08 /pmc/articles/PMC10058223/ /pubmed/36991648 http://dx.doi.org/10.3390/s23062936 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Communication Yu, Xianjia Salimpour, Sahar Queralta, Jorge Peña Westerlund, Tomi General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
title	General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
title_full	General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
title_fullStr	General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
title_full_unstemmed	General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
title_short	General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor
title_sort	general-purpose deep learning detection and segmentation models for images from a lidar-based camera sensor
topic	Communication
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10058223/ https://www.ncbi.nlm.nih.gov/pubmed/36991648 http://dx.doi.org/10.3390/s23062936
work_keys_str_mv	AT yuxianjia generalpurposedeeplearningdetectionandsegmentationmodelsforimagesfromalidarbasedcamerasensor AT salimpoursahar generalpurposedeeplearningdetectionandsegmentationmodelsforimagesfromalidarbasedcamerasensor AT queraltajorgepena generalpurposedeeplearningdetectionandsegmentationmodelsforimagesfromalidarbasedcamerasensor AT westerlundtomi generalpurposedeeplearningdetectionandsegmentationmodelsforimagesfromalidarbasedcamerasensor

General-Purpose Deep Learning Detection and Segmentation Models for Images from a Lidar-Based Camera Sensor

Ejemplares similares