Cargando…
Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images
The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitabl...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10385816/ https://www.ncbi.nlm.nih.gov/pubmed/37514717 http://dx.doi.org/10.3390/s23146423 |
_version_ | 1785081504111525888 |
---|---|
author | Hu, Mengzi Li, Ziyang Yu, Jiong Wan, Xueqiang Tan, Haotian Lin, Zeyu |
author_facet | Hu, Mengzi Li, Ziyang Yu, Jiong Wan, Xueqiang Tan, Haotian Lin, Zeyu |
author_sort | Hu, Mengzi |
collection | PubMed |
description | The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitable for aerial image object-detection scenarios oriented toward low-end GPUs. To address this technical challenge, we propose efficient-lightweight You Only Look Once (EL-YOLO), an innovative model that overcomes the limitations of existing detectors and low-end GPU orientation. EL-YOLO surpasses the baseline models in three key areas. Firstly, we design and scrutinize three model architectures to intensify the model’s focus on small objects and identify the most effective network structure. Secondly, we design efficient spatial pyramid pooling (ESPP) to augment the representation of small-object features in aerial images. Lastly, we introduce the alpha-complete intersection over union (α-CIoU) loss function to tackle the imbalance between positive and negative samples in aerial images. Our proposed EL-YOLO method demonstrates a strong generalization and robustness for the small-object detection problem in aerial images. The experimental results show that, with the model parameters maintained below 10 M while the input image size was unified at 640 × 640 pixels, the AP(S) of the EL-YOLOv5 reached 10.8% and 10.7% and enhanced the APs by 1.9% and 2.2% compared to YOLOv5 on two challenging aerial image datasets, DIOR and VisDrone, respectively. |
format | Online Article Text |
id | pubmed-10385816 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103858162023-07-30 Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images Hu, Mengzi Li, Ziyang Yu, Jiong Wan, Xueqiang Tan, Haotian Lin, Zeyu Sensors (Basel) Article The most significant technical challenges of current aerial image object-detection tasks are the extremely low accuracy for detecting small objects that are densely distributed within a scene and the lack of semantic information. Moreover, existing detectors with large parameter scales are unsuitable for aerial image object-detection scenarios oriented toward low-end GPUs. To address this technical challenge, we propose efficient-lightweight You Only Look Once (EL-YOLO), an innovative model that overcomes the limitations of existing detectors and low-end GPU orientation. EL-YOLO surpasses the baseline models in three key areas. Firstly, we design and scrutinize three model architectures to intensify the model’s focus on small objects and identify the most effective network structure. Secondly, we design efficient spatial pyramid pooling (ESPP) to augment the representation of small-object features in aerial images. Lastly, we introduce the alpha-complete intersection over union (α-CIoU) loss function to tackle the imbalance between positive and negative samples in aerial images. Our proposed EL-YOLO method demonstrates a strong generalization and robustness for the small-object detection problem in aerial images. The experimental results show that, with the model parameters maintained below 10 M while the input image size was unified at 640 × 640 pixels, the AP(S) of the EL-YOLOv5 reached 10.8% and 10.7% and enhanced the APs by 1.9% and 2.2% compared to YOLOv5 on two challenging aerial image datasets, DIOR and VisDrone, respectively. MDPI 2023-07-15 /pmc/articles/PMC10385816/ /pubmed/37514717 http://dx.doi.org/10.3390/s23146423 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Hu, Mengzi Li, Ziyang Yu, Jiong Wan, Xueqiang Tan, Haotian Lin, Zeyu Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images |
title | Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images |
title_full | Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images |
title_fullStr | Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images |
title_full_unstemmed | Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images |
title_short | Efficient-Lightweight YOLO: Improving Small Object Detection in YOLO for Aerial Images |
title_sort | efficient-lightweight yolo: improving small object detection in yolo for aerial images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10385816/ https://www.ncbi.nlm.nih.gov/pubmed/37514717 http://dx.doi.org/10.3390/s23146423 |
work_keys_str_mv | AT humengzi efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages AT liziyang efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages AT yujiong efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages AT wanxueqiang efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages AT tanhaotian efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages AT linzeyu efficientlightweightyoloimprovingsmallobjectdetectioninyoloforaerialimages |