Cargando…
LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images
You Only Look Once (YOLO) series detectors are suitable for aerial image object detection because of their excellent real-time ability and performance. Their high performance depends heavily on the anchor generated by clustering the training set. However, the effectiveness of the general Anchor Gene...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144023/ https://www.ncbi.nlm.nih.gov/pubmed/35632300 http://dx.doi.org/10.3390/s22103891 |
_version_ | 1784715948032262144 |
---|---|
author | Wan, Xueqiang Yu, Jiong Tan, Haotian Wang, Junjie |
author_facet | Wan, Xueqiang Yu, Jiong Tan, Haotian Wang, Junjie |
author_sort | Wan, Xueqiang |
collection | PubMed |
description | You Only Look Once (YOLO) series detectors are suitable for aerial image object detection because of their excellent real-time ability and performance. Their high performance depends heavily on the anchor generated by clustering the training set. However, the effectiveness of the general Anchor Generation algorithm is limited by the unique data distribution of the aerial image dataset. The divergence in the distribution of the number of objects with different sizes can cause the anchors to overfit some objects or be assigned to suboptimal layers because anchors of each layer are generated uniformly and affected by the overall data distribution. In this paper, we are inspired by experiments under different anchors settings and proposed the Layered Anchor Generation (LAG) algorithm. In the LAG, objects are layered by their diagonals, and then anchors of each layer are generated by analyzing the diagonals and aspect ratio of objects of the corresponding layer. In this way, anchors of each layer can better match the detection range of each layer. Experiment results showed that our algorithm is of good generality that significantly uprises the performance of You Only Look Once version 3 (YOLOv3), You Only Look Once version 5 (YOLOv5), You Only Learn One Representation (YOLOR), and Cascade Regions with CNN features (Cascade R-CNN) on the Vision Meets Drone (VisDrone) dataset and the object DetectIon in Optical Remote sensing images (DIOR) dataset, and these improvements are cost-free. |
format | Online Article Text |
id | pubmed-9144023 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91440232022-05-29 LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images Wan, Xueqiang Yu, Jiong Tan, Haotian Wang, Junjie Sensors (Basel) Article You Only Look Once (YOLO) series detectors are suitable for aerial image object detection because of their excellent real-time ability and performance. Their high performance depends heavily on the anchor generated by clustering the training set. However, the effectiveness of the general Anchor Generation algorithm is limited by the unique data distribution of the aerial image dataset. The divergence in the distribution of the number of objects with different sizes can cause the anchors to overfit some objects or be assigned to suboptimal layers because anchors of each layer are generated uniformly and affected by the overall data distribution. In this paper, we are inspired by experiments under different anchors settings and proposed the Layered Anchor Generation (LAG) algorithm. In the LAG, objects are layered by their diagonals, and then anchors of each layer are generated by analyzing the diagonals and aspect ratio of objects of the corresponding layer. In this way, anchors of each layer can better match the detection range of each layer. Experiment results showed that our algorithm is of good generality that significantly uprises the performance of You Only Look Once version 3 (YOLOv3), You Only Look Once version 5 (YOLOv5), You Only Learn One Representation (YOLOR), and Cascade Regions with CNN features (Cascade R-CNN) on the Vision Meets Drone (VisDrone) dataset and the object DetectIon in Optical Remote sensing images (DIOR) dataset, and these improvements are cost-free. MDPI 2022-05-20 /pmc/articles/PMC9144023/ /pubmed/35632300 http://dx.doi.org/10.3390/s22103891 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wan, Xueqiang Yu, Jiong Tan, Haotian Wang, Junjie LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images |
title | LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images |
title_full | LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images |
title_fullStr | LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images |
title_full_unstemmed | LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images |
title_short | LAG: Layered Objects to Generate Better Anchors for Object Detection in Aerial Images |
title_sort | lag: layered objects to generate better anchors for object detection in aerial images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144023/ https://www.ncbi.nlm.nih.gov/pubmed/35632300 http://dx.doi.org/10.3390/s22103891 |
work_keys_str_mv | AT wanxueqiang laglayeredobjectstogeneratebetteranchorsforobjectdetectioninaerialimages AT yujiong laglayeredobjectstogeneratebetteranchorsforobjectdetectioninaerialimages AT tanhaotian laglayeredobjectstogeneratebetteranchorsforobjectdetectioninaerialimages AT wangjunjie laglayeredobjectstogeneratebetteranchorsforobjectdetectioninaerialimages |