Cargando…

Single-Shot Object Detection via Feature Enhancement and Channel Attention

Features play a critical role in computer vision tasks. Deep learning methods have resulted in significant breakthroughs in the field of object detection, but it is still an extremely challenging obstacle when an object is very small. In this work, we propose a feature-enhancement- and channel-atten...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Yi, Wang, Lingna, Wang, Zeji
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503941/ https://www.ncbi.nlm.nih.gov/pubmed/36146207 http://dx.doi.org/10.3390/s22186857

_version_	1784796090983251968
author	Li, Yi Wang, Lingna Wang, Zeji
author_facet	Li, Yi Wang, Lingna Wang, Zeji
author_sort	Li, Yi
collection	PubMed
description	Features play a critical role in computer vision tasks. Deep learning methods have resulted in significant breakthroughs in the field of object detection, but it is still an extremely challenging obstacle when an object is very small. In this work, we propose a feature-enhancement- and channel-attention-guided single-shot detector called the FCSSD with four modules to improve object detection performance. Specifically, inspired by the structure of atrous convolution, we built an efficient feature-extraction module (EFM) in order to explore contextual information along the spatial dimension, and then pyramidal aggregation module (PAM) is presented to explore the semantic features of deep layers, thus reducing the semantic gap between multi-scale features. Furthermore, we construct an effective feature pyramid refinement fusion (FPRF) to refine the multi-scale features and create benefits for richer object knowledge. Finally, an attention-guided module (AGM) is developed to balance the channel weights and optimize the final integrated features on each level; this alleviates the aliasing effects of the FPN with negligible computational costs. The FCSSD exploits richer information of shallow layers and higher layers by using our designed modules, thus accomplishing excellent detection performance for multi-scale object detection and reaching a better tradeoff between accuracy and inference time. Experiments on PASCAL VOC and MS COCO datasets were conducted to evaluate the performance, showing that our FCSSD achieves competitive detection performance compared with existing mainstream object detection methods.
format	Online Article Text
id	pubmed-9503941
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-95039412022-09-24 Single-Shot Object Detection via Feature Enhancement and Channel Attention Li, Yi Wang, Lingna Wang, Zeji Sensors (Basel) Article Features play a critical role in computer vision tasks. Deep learning methods have resulted in significant breakthroughs in the field of object detection, but it is still an extremely challenging obstacle when an object is very small. In this work, we propose a feature-enhancement- and channel-attention-guided single-shot detector called the FCSSD with four modules to improve object detection performance. Specifically, inspired by the structure of atrous convolution, we built an efficient feature-extraction module (EFM) in order to explore contextual information along the spatial dimension, and then pyramidal aggregation module (PAM) is presented to explore the semantic features of deep layers, thus reducing the semantic gap between multi-scale features. Furthermore, we construct an effective feature pyramid refinement fusion (FPRF) to refine the multi-scale features and create benefits for richer object knowledge. Finally, an attention-guided module (AGM) is developed to balance the channel weights and optimize the final integrated features on each level; this alleviates the aliasing effects of the FPN with negligible computational costs. The FCSSD exploits richer information of shallow layers and higher layers by using our designed modules, thus accomplishing excellent detection performance for multi-scale object detection and reaching a better tradeoff between accuracy and inference time. Experiments on PASCAL VOC and MS COCO datasets were conducted to evaluate the performance, showing that our FCSSD achieves competitive detection performance compared with existing mainstream object detection methods. MDPI 2022-09-10 /pmc/articles/PMC9503941/ /pubmed/36146207 http://dx.doi.org/10.3390/s22186857 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Li, Yi Wang, Lingna Wang, Zeji Single-Shot Object Detection via Feature Enhancement and Channel Attention
title	Single-Shot Object Detection via Feature Enhancement and Channel Attention
title_full	Single-Shot Object Detection via Feature Enhancement and Channel Attention
title_fullStr	Single-Shot Object Detection via Feature Enhancement and Channel Attention
title_full_unstemmed	Single-Shot Object Detection via Feature Enhancement and Channel Attention
title_short	Single-Shot Object Detection via Feature Enhancement and Channel Attention
title_sort	single-shot object detection via feature enhancement and channel attention
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503941/ https://www.ncbi.nlm.nih.gov/pubmed/36146207 http://dx.doi.org/10.3390/s22186857
work_keys_str_mv	AT liyi singleshotobjectdetectionviafeatureenhancementandchannelattention AT wanglingna singleshotobjectdetectionviafeatureenhancementandchannelattention AT wangzeji singleshotobjectdetectionviafeatureenhancementandchannelattention

Single-Shot Object Detection via Feature Enhancement and Channel Attention

Ejemplares similares