Cargando…

A Multi-Layer Feature Fusion Method for Few-Shot Image Classification

In image classification, few-shot learning deals with recognizing visual categories from a few tagged examples. The degree of expressiveness of the encoded features in this scenario is a crucial question that needs to be addressed in the models being trained. Recent approaches have achieved encourag...

Descripción completa

Detalles Bibliográficos
Autores principales: Gomes, Jacó C., Borges, Lurdineide de A. B., Borges, Díbio L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422596/
https://www.ncbi.nlm.nih.gov/pubmed/37571662
http://dx.doi.org/10.3390/s23156880
_version_ 1785089249883717632
author Gomes, Jacó C.
Borges, Lurdineide de A. B.
Borges, Díbio L.
author_facet Gomes, Jacó C.
Borges, Lurdineide de A. B.
Borges, Díbio L.
author_sort Gomes, Jacó C.
collection PubMed
description In image classification, few-shot learning deals with recognizing visual categories from a few tagged examples. The degree of expressiveness of the encoded features in this scenario is a crucial question that needs to be addressed in the models being trained. Recent approaches have achieved encouraging results in improving few-shot models in deep learning, but designing a competitive and simple architecture is challenging, especially considering its requirement in many practical applications. This work proposes an improved few-shot model based on a multi-layer feature fusion (FMLF) method. The presented approach includes extended feature extraction and fusion mechanisms in the Convolutional Neural Network (CNN) backbone, as well as an effective metric to compute the divergences in the end. In order to evaluate the proposed method, a challenging visual classification problem, maize crop insect classification with specific pests and beneficial categories, is addressed, serving both as a test of our model and as a means to propose a novel dataset. Experiments were carried out to compare the results with ResNet50, VGG16, and MobileNetv2, used as feature extraction backbones, and the FMLF method demonstrated higher accuracy with fewer parameters. The proposed FMLF method improved accuracy scores by up to 3.62% in one-shot and 2.82% in five-shot classification tasks compared to a traditional backbone, which uses only global image features.
format Online
Article
Text
id pubmed-10422596
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104225962023-08-13 A Multi-Layer Feature Fusion Method for Few-Shot Image Classification Gomes, Jacó C. Borges, Lurdineide de A. B. Borges, Díbio L. Sensors (Basel) Article In image classification, few-shot learning deals with recognizing visual categories from a few tagged examples. The degree of expressiveness of the encoded features in this scenario is a crucial question that needs to be addressed in the models being trained. Recent approaches have achieved encouraging results in improving few-shot models in deep learning, but designing a competitive and simple architecture is challenging, especially considering its requirement in many practical applications. This work proposes an improved few-shot model based on a multi-layer feature fusion (FMLF) method. The presented approach includes extended feature extraction and fusion mechanisms in the Convolutional Neural Network (CNN) backbone, as well as an effective metric to compute the divergences in the end. In order to evaluate the proposed method, a challenging visual classification problem, maize crop insect classification with specific pests and beneficial categories, is addressed, serving both as a test of our model and as a means to propose a novel dataset. Experiments were carried out to compare the results with ResNet50, VGG16, and MobileNetv2, used as feature extraction backbones, and the FMLF method demonstrated higher accuracy with fewer parameters. The proposed FMLF method improved accuracy scores by up to 3.62% in one-shot and 2.82% in five-shot classification tasks compared to a traditional backbone, which uses only global image features. MDPI 2023-08-03 /pmc/articles/PMC10422596/ /pubmed/37571662 http://dx.doi.org/10.3390/s23156880 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gomes, Jacó C.
Borges, Lurdineide de A. B.
Borges, Díbio L.
A Multi-Layer Feature Fusion Method for Few-Shot Image Classification
title A Multi-Layer Feature Fusion Method for Few-Shot Image Classification
title_full A Multi-Layer Feature Fusion Method for Few-Shot Image Classification
title_fullStr A Multi-Layer Feature Fusion Method for Few-Shot Image Classification
title_full_unstemmed A Multi-Layer Feature Fusion Method for Few-Shot Image Classification
title_short A Multi-Layer Feature Fusion Method for Few-Shot Image Classification
title_sort multi-layer feature fusion method for few-shot image classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422596/
https://www.ncbi.nlm.nih.gov/pubmed/37571662
http://dx.doi.org/10.3390/s23156880
work_keys_str_mv AT gomesjacoc amultilayerfeaturefusionmethodforfewshotimageclassification
AT borgeslurdineidedeab amultilayerfeaturefusionmethodforfewshotimageclassification
AT borgesdibiol amultilayerfeaturefusionmethodforfewshotimageclassification
AT gomesjacoc multilayerfeaturefusionmethodforfewshotimageclassification
AT borgeslurdineidedeab multilayerfeaturefusionmethodforfewshotimageclassification
AT borgesdibiol multilayerfeaturefusionmethodforfewshotimageclassification