Cargando…
AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection
Small object detection is one of the key challenges in the current computer vision field due to the low amount of information carried and the information loss caused by feature extraction. You Only Look Once v5 (YOLOv5) adopts the Path Aggregation Network to alleviate the problem of information loss...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658690/ https://www.ncbi.nlm.nih.gov/pubmed/36365919 http://dx.doi.org/10.3390/s22218221 |
_version_ | 1784830013567139840 |
---|---|
author | Yan, Bingnan Li, Jiaxin Yang, Zhaozhao Zhang, Xinpeng Hao, Xiaolong |
author_facet | Yan, Bingnan Li, Jiaxin Yang, Zhaozhao Zhang, Xinpeng Hao, Xiaolong |
author_sort | Yan, Bingnan |
collection | PubMed |
description | Small object detection is one of the key challenges in the current computer vision field due to the low amount of information carried and the information loss caused by feature extraction. You Only Look Once v5 (YOLOv5) adopts the Path Aggregation Network to alleviate the problem of information loss, but it cannot restore the information that has been lost. To this end, an auxiliary information-enhanced YOLO is proposed to improve the sensitivity and detection performance of YOLOv5 to small objects. Firstly, a context enhancement module containing a receptive field size of 21×21 is proposed, which captures the global and local information of the image by fusing multi-scale receptive fields, and introduces an attention branch to enhance the expressive ability of key features and suppress background noise. To further enhance the feature expression ability of small objects, we introduce the high- and low-frequency information decomposed by wavelet transform into PANet to participate in multi-scale feature fusion, so as to solve the problem that the features of small objects gradually disappear after multiple downsampling and pooling operations. Experiments on the challenging dataset Tsinghua–Tencent 100 K show that the mean average precision of the proposed model is 9.5% higher than that of the original YOLOv5 while maintaining the real-time speed, which is better than the mainstream object detection models. |
format | Online Article Text |
id | pubmed-9658690 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96586902022-11-15 AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection Yan, Bingnan Li, Jiaxin Yang, Zhaozhao Zhang, Xinpeng Hao, Xiaolong Sensors (Basel) Article Small object detection is one of the key challenges in the current computer vision field due to the low amount of information carried and the information loss caused by feature extraction. You Only Look Once v5 (YOLOv5) adopts the Path Aggregation Network to alleviate the problem of information loss, but it cannot restore the information that has been lost. To this end, an auxiliary information-enhanced YOLO is proposed to improve the sensitivity and detection performance of YOLOv5 to small objects. Firstly, a context enhancement module containing a receptive field size of 21×21 is proposed, which captures the global and local information of the image by fusing multi-scale receptive fields, and introduces an attention branch to enhance the expressive ability of key features and suppress background noise. To further enhance the feature expression ability of small objects, we introduce the high- and low-frequency information decomposed by wavelet transform into PANet to participate in multi-scale feature fusion, so as to solve the problem that the features of small objects gradually disappear after multiple downsampling and pooling operations. Experiments on the challenging dataset Tsinghua–Tencent 100 K show that the mean average precision of the proposed model is 9.5% higher than that of the original YOLOv5 while maintaining the real-time speed, which is better than the mainstream object detection models. MDPI 2022-10-27 /pmc/articles/PMC9658690/ /pubmed/36365919 http://dx.doi.org/10.3390/s22218221 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yan, Bingnan Li, Jiaxin Yang, Zhaozhao Zhang, Xinpeng Hao, Xiaolong AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection |
title | AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection |
title_full | AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection |
title_fullStr | AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection |
title_full_unstemmed | AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection |
title_short | AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection |
title_sort | aie-yolo: auxiliary information enhanced yolo for small object detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9658690/ https://www.ncbi.nlm.nih.gov/pubmed/36365919 http://dx.doi.org/10.3390/s22218221 |
work_keys_str_mv | AT yanbingnan aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection AT lijiaxin aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection AT yangzhaozhao aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection AT zhangxinpeng aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection AT haoxiaolong aieyoloauxiliaryinformationenhancedyoloforsmallobjectdetection |