Cargando…

AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection

Small object detection is one of the key challenges in the current computer vision field due to the low amount of information carried and the information loss caused by feature extraction. You Only Look Once v5 (YOLOv5) adopts the Path Aggregation Network to alleviate the problem of information loss...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Bingnan, Li, Jiaxin, Yang, Zhaozhao, Zhang, Xinpeng, Hao, Xiaolong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658690/
https://www.ncbi.nlm.nih.gov/pubmed/36365919
http://dx.doi.org/10.3390/s22218221
_version_ 1784830013567139840
author Yan, Bingnan
Li, Jiaxin
Yang, Zhaozhao
Zhang, Xinpeng
Hao, Xiaolong
author_facet Yan, Bingnan
Li, Jiaxin
Yang, Zhaozhao
Zhang, Xinpeng
Hao, Xiaolong
author_sort Yan, Bingnan
collection PubMed
description Small object detection is one of the key challenges in the current computer vision field due to the low amount of information carried and the information loss caused by feature extraction. You Only Look Once v5 (YOLOv5) adopts the Path Aggregation Network to alleviate the problem of information loss, but it cannot restore the information that has been lost. To this end, an auxiliary information-enhanced YOLO is proposed to improve the sensitivity and detection performance of YOLOv5 to small objects. Firstly, a context enhancement module containing a receptive field size of 21×21 is proposed, which captures the global and local information of the image by fusing multi-scale receptive fields, and introduces an attention branch to enhance the expressive ability of key features and suppress background noise. To further enhance the feature expression ability of small objects, we introduce the high- and low-frequency information decomposed by wavelet transform into PANet to participate in multi-scale feature fusion, so as to solve the problem that the features of small objects gradually disappear after multiple downsampling and pooling operations. Experiments on the challenging dataset Tsinghua–Tencent 100 K show that the mean average precision of the proposed model is 9.5% higher than that of the original YOLOv5 while maintaining the real-time speed, which is better than the mainstream object detection models.
format Online
Article
Text
id pubmed-9658690
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96586902022-11-15 AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection Yan, Bingnan Li, Jiaxin Yang, Zhaozhao Zhang, Xinpeng Hao, Xiaolong Sensors (Basel) Article Small object detection is one of the key challenges in the current computer vision field due to the low amount of information carried and the information loss caused by feature extraction. You Only Look Once v5 (YOLOv5) adopts the Path Aggregation Network to alleviate the problem of information loss, but it cannot restore the information that has been lost. To this end, an auxiliary information-enhanced YOLO is proposed to improve the sensitivity and detection performance of YOLOv5 to small objects. Firstly, a context enhancement module containing a receptive field size of 21×21 is proposed, which captures the global and local information of the image by fusing multi-scale receptive fields, and introduces an attention branch to enhance the expressive ability of key features and suppress background noise. To further enhance the feature expression ability of small objects, we introduce the high- and low-frequency information decomposed by wavelet transform into PANet to participate in multi-scale feature fusion, so as to solve the problem that the features of small objects gradually disappear after multiple downsampling and pooling operations. Experiments on the challenging dataset Tsinghua–Tencent 100 K show that the mean average precision of the proposed model is 9.5% higher than that of the original YOLOv5 while maintaining the real-time speed, which is better than the mainstream object detection models. MDPI 2022-10-27 /pmc/articles/PMC9658690/ /pubmed/36365919 http://dx.doi.org/10.3390/s22218221 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yan, Bingnan
Li, Jiaxin
Yang, Zhaozhao
Zhang, Xinpeng
Hao, Xiaolong
AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection
title AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection
title_full AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection
title_fullStr AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection
title_full_unstemmed AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection
title_short AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection
title_sort aie-yolo: auxiliary information enhanced yolo for small object detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658690/
https://www.ncbi.nlm.nih.gov/pubmed/36365919
http://dx.doi.org/10.3390/s22218221
work_keys_str_mv AT yanbingnan aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection
AT lijiaxin aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection
AT yangzhaozhao aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection
AT zhangxinpeng aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection
AT haoxiaolong aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection