Cargando…

Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images

The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitabl...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Mengzi, Li, Ziyang, Yu, Jiong, Wan, Xueqiang, Tan, Haotian, Lin, Zeyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10385816/
https://www.ncbi.nlm.nih.gov/pubmed/37514717
http://dx.doi.org/10.3390/s23146423
_version_ 1785081504111525888
author Hu, Mengzi
Li, Ziyang
Yu, Jiong
Wan, Xueqiang
Tan, Haotian
Lin, Zeyu
author_facet Hu, Mengzi
Li, Ziyang
Yu, Jiong
Wan, Xueqiang
Tan, Haotian
Lin, Zeyu
author_sort Hu, Mengzi
collection PubMed
description The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitable for aerial image object-detection scenarios oriented toward low-end GPUs. To address this technical challenge, we propose efficient-lightweight You Only Look Once (EL-YOLO), an innovative model that overcomes the limitations of existing detectors and low-end GPU orientation. EL-YOLO surpasses the baseline models in three key areas. Firstly, we design and scrutinize three model architectures to intensify the model’s focus on small objects and identify the most effective network structure. Secondly, we design efficient spatial pyramid pooling (ESPP) to augment the representation of small-object features in aerial images. Lastly, we introduce the alpha-complete intersection over union (α-CIoU) loss function to tackle the imbalance between positive and negative samples in aerial images. Our proposed EL-YOLO method demonstrates a strong generalization and robustness for the small-object detection problem in aerial images. The experimental results show that, with the model parameters maintained below 10 M while the input image size was unified at 640 × 640 pixels, the AP(S) of the EL-YOLOv5 reached 10.8% and 10.7% and enhanced the APs by 1.9% and 2.2% compared to YOLOv5 on two challenging aerial image datasets, DIOR and VisDrone, respectively.
format Online
Article
Text
id pubmed-10385816
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103858162023-07-30 Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images Hu, Mengzi Li, Ziyang Yu, Jiong Wan, Xueqiang Tan, Haotian Lin, Zeyu Sensors (Basel) Article The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitable for aerial image object-detection scenarios oriented toward low-end GPUs. To address this technical challenge, we propose efficient-lightweight You Only Look Once (EL-YOLO), an innovative model that overcomes the limitations of existing detectors and low-end GPU orientation. EL-YOLO surpasses the baseline models in three key areas. Firstly, we design and scrutinize three model architectures to intensify the model’s focus on small objects and identify the most effective network structure. Secondly, we design efficient spatial pyramid pooling (ESPP) to augment the representation of small-object features in aerial images. Lastly, we introduce the alpha-complete intersection over union (α-CIoU) loss function to tackle the imbalance between positive and negative samples in aerial images. Our proposed EL-YOLO method demonstrates a strong generalization and robustness for the small-object detection problem in aerial images. The experimental results show that, with the model parameters maintained below 10 M while the input image size was unified at 640 × 640 pixels, the AP(S) of the EL-YOLOv5 reached 10.8% and 10.7% and enhanced the APs by 1.9% and 2.2% compared to YOLOv5 on two challenging aerial image datasets, DIOR and VisDrone, respectively. MDPI 2023-07-15 /pmc/articles/PMC10385816/ /pubmed/37514717 http://dx.doi.org/10.3390/s23146423 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hu, Mengzi
Li, Ziyang
Yu, Jiong
Wan, Xueqiang
Tan, Haotian
Lin, Zeyu
Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images
title Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images
title_full Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images
title_fullStr Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images
title_full_unstemmed Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images
title_short Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images
title_sort efficient-lightweight yolo: improving small object detection in yolo for aerial images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10385816/
https://www.ncbi.nlm.nih.gov/pubmed/37514717
http://dx.doi.org/10.3390/s23146423
work_keys_str_mv AT humengzi efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages
AT liziyang efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages
AT yujiong efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages
AT wanxueqiang efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages
AT tanhaotian efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages
AT linzeyu efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages