Cargando…

RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance

An essential component for the autonomous flight or air-to-ground surveillance of a UAV is an object detection device. It must possess a high detection accuracy and requires real-time data processing to be employed for various tasks such as search and rescue, object tracking and disaster analysis. W...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Jongwon, Cho, Jeongho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7957492/
https://www.ncbi.nlm.nih.gov/pubmed/33804364
http://dx.doi.org/10.3390/s21051677
_version_ 1783664660904411136
author Kim, Jongwon
Cho, Jeongho
author_facet Kim, Jongwon
Cho, Jeongho
author_sort Kim, Jongwon
collection PubMed
description An essential component for the autonomous flight or air-to-ground surveillance of a UAV is an object detection device. It must possess a high detection accuracy and requires real-time data processing to be employed for various tasks such as search and rescue, object tracking and disaster analysis. With the recent advancements in multimodal data-based object detection architectures, autonomous driving technology has significantly improved, and the latest algorithm has achieved an average precision of up to 96%. However, these remarkable advances may be unsuitable for the image processing of UAV aerial data directly onboard for object detection because of the following major problems: (1) Objects in aerial views generally have a smaller size than in an image and they are uneven and sparsely distributed throughout an image; (2) Objects are exposed to various environmental changes, such as occlusion and background interference; and (3) The payload weight of a UAV is limited. Thus, we propose employing a new real-time onboard object detection architecture, an RGB aerial image and a point cloud data (PCD) depth map image network (RGDiNet). A faster region-based convolutional neural network was used as the baseline detection network and an RGD, an integration of the RGB aerial image and the depth map reconstructed by the light detection and ranging PCD, was utilized as an input for computational efficiency. Performance tests and evaluation of the proposed RGDiNet were conducted under various operating conditions using hand-labeled aerial datasets. Consequently, it was shown that the proposed method has a superior performance for the detection of vehicles and pedestrians than conventional vision-based methods.
format Online
Article
Text
id pubmed-7957492
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79574922021-03-16 RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance Kim, Jongwon Cho, Jeongho Sensors (Basel) Article An essential component for the autonomous flight or air-to-ground surveillance of a UAV is an object detection device. It must possess a high detection accuracy and requires real-time data processing to be employed for various tasks such as search and rescue, object tracking and disaster analysis. With the recent advancements in multimodal data-based object detection architectures, autonomous driving technology has significantly improved, and the latest algorithm has achieved an average precision of up to 96%. However, these remarkable advances may be unsuitable for the image processing of UAV aerial data directly onboard for object detection because of the following major problems: (1) Objects in aerial views generally have a smaller size than in an image and they are uneven and sparsely distributed throughout an image; (2) Objects are exposed to various environmental changes, such as occlusion and background interference; and (3) The payload weight of a UAV is limited. Thus, we propose employing a new real-time onboard object detection architecture, an RGB aerial image and a point cloud data (PCD) depth map image network (RGDiNet). A faster region-based convolutional neural network was used as the baseline detection network and an RGD, an integration of the RGB aerial image and the depth map reconstructed by the light detection and ranging PCD, was utilized as an input for computational efficiency. Performance tests and evaluation of the proposed RGDiNet were conducted under various operating conditions using hand-labeled aerial datasets. Consequently, it was shown that the proposed method has a superior performance for the detection of vehicles and pedestrians than conventional vision-based methods. MDPI 2021-03-01 /pmc/articles/PMC7957492/ /pubmed/33804364 http://dx.doi.org/10.3390/s21051677 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kim, Jongwon
Cho, Jeongho
RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance
title RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance
title_full RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance
title_fullStr RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance
title_full_unstemmed RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance
title_short RGDiNet: Efficient Onboard Object Detection with Faster R-CNN for Air-to-Ground Surveillance
title_sort rgdinet: efficient onboard object detection with faster r-cnn for air-to-ground surveillance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7957492/
https://www.ncbi.nlm.nih.gov/pubmed/33804364
http://dx.doi.org/10.3390/s21051677
work_keys_str_mv AT kimjongwon rgdinetefficientonboardobjectdetectionwithfasterrcnnforairtogroundsurveillance
AT chojeongho rgdinetefficientonboardobjectdetectionwithfasterrcnnforairtogroundsurveillance