Cargando…

NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection

To address the problems of tiny objects and high resolution of object detection in remote sensing imagery, the methods with coarse-grained image cropping have been widely studied. However, these methods are always inefficient and complex due to the two-stage architecture and the huge computation for...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yukuan, He, Guanglin, Wang, Zehu, Li, Weizhe, Huang, Hongfei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269754/
https://www.ncbi.nlm.nih.gov/pubmed/35808445
http://dx.doi.org/10.3390/s22134953
_version_ 1784744298578706432
author Liu, Yukuan
He, Guanglin
Wang, Zehu
Li, Weizhe
Huang, Hongfei
author_facet Liu, Yukuan
He, Guanglin
Wang, Zehu
Li, Weizhe
Huang, Hongfei
author_sort Liu, Yukuan
collection PubMed
description To address the problems of tiny objects and high resolution of object detection in remote sensing imagery, the methods with coarse-grained image cropping have been widely studied. However, these methods are always inefficient and complex due to the two-stage architecture and the huge computation for split images. For these reasons, this article employs YOLO and presents an improved architecture, NRT-YOLO. Specifically, the improvements can be summarized as: extra prediction head and related feature fusion layers; novel nested residual Transformer module, C3NRT; nested residual attention module, C3NRA; and multi-scale testing. The C3NRT module presented in this paper could boost accuracy and reduce complexity of the network at the same time. Moreover, the effectiveness of the proposed method is demonstrated by three kinds of experiments. NRT-YOLO achieves 56.9% mAP(0.5) with only 38.1 M parameters in the DOTA dataset, exceeding YOLOv5l by 4.5%. Also, the results of different classifications show its excellent ability to detect small sample objects. As for the C3NRT module, the ablation study and comparison experiment verified that it has the largest contribution to accuracy increment (2.7% in mAP(0.5)) among the improvements. In conclusion, NRT-YOLO has excellent performance in accuracy improvement and parameter reduction, which is suitable for tiny remote sensing object detection.
format Online
Article
Text
id pubmed-9269754
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-92697542022-07-09 NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection Liu, Yukuan He, Guanglin Wang, Zehu Li, Weizhe Huang, Hongfei Sensors (Basel) Article To address the problems of tiny objects and high resolution of object detection in remote sensing imagery, the methods with coarse-grained image cropping have been widely studied. However, these methods are always inefficient and complex due to the two-stage architecture and the huge computation for split images. For these reasons, this article employs YOLO and presents an improved architecture, NRT-YOLO. Specifically, the improvements can be summarized as: extra prediction head and related feature fusion layers; novel nested residual Transformer module, C3NRT; nested residual attention module, C3NRA; and multi-scale testing. The C3NRT module presented in this paper could boost accuracy and reduce complexity of the network at the same time. Moreover, the effectiveness of the proposed method is demonstrated by three kinds of experiments. NRT-YOLO achieves 56.9% mAP(0.5) with only 38.1 M parameters in the DOTA dataset, exceeding YOLOv5l by 4.5%. Also, the results of different classifications show its excellent ability to detect small sample objects. As for the C3NRT module, the ablation study and comparison experiment verified that it has the largest contribution to accuracy increment (2.7% in mAP(0.5)) among the improvements. In conclusion, NRT-YOLO has excellent performance in accuracy improvement and parameter reduction, which is suitable for tiny remote sensing object detection. MDPI 2022-06-30 /pmc/articles/PMC9269754/ /pubmed/35808445 http://dx.doi.org/10.3390/s22134953 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Yukuan
He, Guanglin
Wang, Zehu
Li, Weizhe
Huang, Hongfei
NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection
title NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection
title_full NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection
title_fullStr NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection
title_full_unstemmed NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection
title_short NRT-YOLO: Improved YOLOv5 Based on Nested Residual Transformer for Tiny Remote Sensing Object Detection
title_sort nrt-yolo: improved yolov5 based on nested residual transformer for tiny remote sensing object detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269754/
https://www.ncbi.nlm.nih.gov/pubmed/35808445
http://dx.doi.org/10.3390/s22134953
work_keys_str_mv AT liuyukuan nrtyoloimprovedyolov5basedonnestedresidualtransformerfortinyremotesensingobjectdetection
AT heguanglin nrtyoloimprovedyolov5basedonnestedresidualtransformerfortinyremotesensingobjectdetection
AT wangzehu nrtyoloimprovedyolov5basedonnestedresidualtransformerfortinyremotesensingobjectdetection
AT liweizhe nrtyoloimprovedyolov5basedonnestedresidualtransformerfortinyremotesensingobjectdetection
AT huanghongfei nrtyoloimprovedyolov5basedonnestedresidualtransformerfortinyremotesensingobjectdetection