Cargando…

Object Detection for UAV Aerial Scenarios Based on Vectorized IOU

Object detection in unmanned aerial vehicle (UAV) images is an extremely challenging task and involves problems such as multi-scale objects, a high proportion of small objects, and high overlap between objects. To address these issues, first, we design a Vectorized Intersection Over Union (VIOU) los...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Shun, Lu, Hanyu, Dong, Jun, Wu, Shuang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054878/
https://www.ncbi.nlm.nih.gov/pubmed/36991772
http://dx.doi.org/10.3390/s23063061
_version_ 1785015777880965120
author Lu, Shun
Lu, Hanyu
Dong, Jun
Wu, Shuang
author_facet Lu, Shun
Lu, Hanyu
Dong, Jun
Wu, Shuang
author_sort Lu, Shun
collection PubMed
description Object detection in unmanned aerial vehicle (UAV) images is an extremely challenging task and involves problems such as multi-scale objects, a high proportion of small objects, and high overlap between objects. To address these issues, first, we design a Vectorized Intersection Over Union (VIOU) loss based on YOLOv5s. This loss uses the width and height of the bounding box as a vector to construct a cosine function that corresponds to the size of the box and the aspect ratio and directly compares the center point value of the box to improve the accuracy of the bounding box regression. Second, we propose a Progressive Feature Fusion Network (PFFN) that addresses the issue of insufficient semantic extraction of shallow features by Panet. This allows each node of the network to fuse semantic information from deep layers with features from the current layer, thus significantly improving the detection ability of small objects in multi-scale scenes. Finally, we propose an Asymmetric Decoupled (AD) head, which separates the classification network from the regression network and improves the classification and regression capabilities of the network. Our proposed method results in significant improvements on two benchmark datasets compared to YOLOv5s. On the VisDrone 2019 dataset, the performance increased by 9.7% from 34.9% to 44.6%, and on the DOTA dataset, the performance increased by 2.1%.
format Online
Article
Text
id pubmed-10054878
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100548782023-03-30 Object Detection for UAV Aerial Scenarios Based on Vectorized IOU Lu, Shun Lu, Hanyu Dong, Jun Wu, Shuang Sensors (Basel) Article Object detection in unmanned aerial vehicle (UAV) images is an extremely challenging task and involves problems such as multi-scale objects, a high proportion of small objects, and high overlap between objects. To address these issues, first, we design a Vectorized Intersection Over Union (VIOU) loss based on YOLOv5s. This loss uses the width and height of the bounding box as a vector to construct a cosine function that corresponds to the size of the box and the aspect ratio and directly compares the center point value of the box to improve the accuracy of the bounding box regression. Second, we propose a Progressive Feature Fusion Network (PFFN) that addresses the issue of insufficient semantic extraction of shallow features by Panet. This allows each node of the network to fuse semantic information from deep layers with features from the current layer, thus significantly improving the detection ability of small objects in multi-scale scenes. Finally, we propose an Asymmetric Decoupled (AD) head, which separates the classification network from the regression network and improves the classification and regression capabilities of the network. Our proposed method results in significant improvements on two benchmark datasets compared to YOLOv5s. On the VisDrone 2019 dataset, the performance increased by 9.7% from 34.9% to 44.6%, and on the DOTA dataset, the performance increased by 2.1%. MDPI 2023-03-13 /pmc/articles/PMC10054878/ /pubmed/36991772 http://dx.doi.org/10.3390/s23063061 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lu, Shun
Lu, Hanyu
Dong, Jun
Wu, Shuang
Object Detection for UAV Aerial Scenarios Based on Vectorized IOU
title Object Detection for UAV Aerial Scenarios Based on Vectorized IOU
title_full Object Detection for UAV Aerial Scenarios Based on Vectorized IOU
title_fullStr Object Detection for UAV Aerial Scenarios Based on Vectorized IOU
title_full_unstemmed Object Detection for UAV Aerial Scenarios Based on Vectorized IOU
title_short Object Detection for UAV Aerial Scenarios Based on Vectorized IOU
title_sort object detection for uav aerial scenarios based on vectorized iou
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054878/
https://www.ncbi.nlm.nih.gov/pubmed/36991772
http://dx.doi.org/10.3390/s23063061
work_keys_str_mv AT lushun objectdetectionforuavaerialscenariosbasedonvectorizediou
AT luhanyu objectdetectionforuavaerialscenariosbasedonvectorizediou
AT dongjun objectdetectionforuavaerialscenariosbasedonvectorizediou
AT wushuang objectdetectionforuavaerialscenariosbasedonvectorizediou