Cargando…

MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning

Few-shot object detection (FSOD) is proposed to solve the application problem of traditional detectors in scenarios lacking training samples. The meta-learning methods have attracted the researchers’ attention for their excellent generalization performance. They usually select the same class of supp...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Tianzhao, Sun, Ruoxi, Wan, Yong, Zhang, Fuping, Wei, Jianming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099036/ https://www.ncbi.nlm.nih.gov/pubmed/37050671 http://dx.doi.org/10.3390/s23073609

_version_	1785024961087275008
author	Zhang, Tianzhao Sun, Ruoxi Wan, Yong Zhang, Fuping Wei, Jianming
author_facet	Zhang, Tianzhao Sun, Ruoxi Wan, Yong Zhang, Fuping Wei, Jianming
author_sort	Zhang, Tianzhao
collection	PubMed
description	Few-shot object detection (FSOD) is proposed to solve the application problem of traditional detectors in scenarios lacking training samples. The meta-learning methods have attracted the researchers’ attention for their excellent generalization performance. They usually select the same class of support features according to the query labels to weight the query features. However, the model cannot possess the ability of active identification only by using the same category support features, and feature selection causes difficulties in the testing process without labels. The single-scale feature of the model also leads to poor performance in small object detection. In addition, the hard samples in the support branch impact the backbone’s representation of the support features, thus impacting the feature weighting process. To overcome these problems, we propose a multi-scale feature fusion and attentive learning (MSFFAL) framework for few-shot object detection. We first design the backbone with multi-scale feature fusion and channel attention mechanism to improve the model’s detection accuracy on small objects and the representation of hard support samples. Based on this, we propose an attention loss to replace the feature weighting module. The loss allows the model to consistently represent the objects of the same category in the two branches and realizes the active recognition of the model. The model no longer depends on query labels to select features when testing, optimizing the model testing process. The experiments show that MSFFAL outperforms the state-of-the-art (SOTA) by 0.7–7.8% on the Pascal VOC and exhibits 1.61 times the result of the baseline model in MS COCO’s small objects detection.
format	Online Article Text
id	pubmed-10099036
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-100990362023-04-14 MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning Zhang, Tianzhao Sun, Ruoxi Wan, Yong Zhang, Fuping Wei, Jianming Sensors (Basel) Article Few-shot object detection (FSOD) is proposed to solve the application problem of traditional detectors in scenarios lacking training samples. The meta-learning methods have attracted the researchers’ attention for their excellent generalization performance. They usually select the same class of support features according to the query labels to weight the query features. However, the model cannot possess the ability of active identification only by using the same category support features, and feature selection causes difficulties in the testing process without labels. The single-scale feature of the model also leads to poor performance in small object detection. In addition, the hard samples in the support branch impact the backbone’s representation of the support features, thus impacting the feature weighting process. To overcome these problems, we propose a multi-scale feature fusion and attentive learning (MSFFAL) framework for few-shot object detection. We first design the backbone with multi-scale feature fusion and channel attention mechanism to improve the model’s detection accuracy on small objects and the representation of hard support samples. Based on this, we propose an attention loss to replace the feature weighting module. The loss allows the model to consistently represent the objects of the same category in the two branches and realizes the active recognition of the model. The model no longer depends on query labels to select features when testing, optimizing the model testing process. The experiments show that MSFFAL outperforms the state-of-the-art (SOTA) by 0.7–7.8% on the Pascal VOC and exhibits 1.61 times the result of the baseline model in MS COCO’s small objects detection. MDPI 2023-03-30 /pmc/articles/PMC10099036/ /pubmed/37050671 http://dx.doi.org/10.3390/s23073609 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Tianzhao Sun, Ruoxi Wan, Yong Zhang, Fuping Wei, Jianming MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning
title	MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning
title_full	MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning
title_fullStr	MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning
title_full_unstemmed	MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning
title_short	MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning
title_sort	msffal: few-shot object detection via multi-scale feature fusion and attentive learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10099036/ https://www.ncbi.nlm.nih.gov/pubmed/37050671 http://dx.doi.org/10.3390/s23073609
work_keys_str_mv	AT zhangtianzhao msffalfewshotobjectdetectionviamultiscalefeaturefusionandattentivelearning AT sunruoxi msffalfewshotobjectdetectionviamultiscalefeaturefusionandattentivelearning AT wanyong msffalfewshotobjectdetectionviamultiscalefeaturefusionandattentivelearning AT zhangfuping msffalfewshotobjectdetectionviamultiscalefeaturefusionandattentivelearning AT weijianming msffalfewshotobjectdetectionviamultiscalefeaturefusionandattentivelearning

MSFFAL: Few-Shot Object Detection via Multi-Scale Feature Fusion and Attentive Learning

Ejemplares similares