Cargando…

Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion

The zero-shot image classification (ZSIC) is designed to solve the classification problem when the sample is very small, or the category is missing. A common method is to use attribute or word vectors as a priori category features (auxiliary information) and complete the domain transfer from trainin...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Yaru, Feng, Lilong, Song, Xiaoke, Xu, Dawei, Zhai, Yongjie
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9966441/ https://www.ncbi.nlm.nih.gov/pubmed/36850908 http://dx.doi.org/10.3390/s23042311

_version_	1784897017915375616
author	Wang, Yaru Feng, Lilong Song, Xiaoke Xu, Dawei Zhai, Yongjie
author_facet	Wang, Yaru Feng, Lilong Song, Xiaoke Xu, Dawei Zhai, Yongjie
author_sort	Wang, Yaru
collection	PubMed
description	The zero-shot image classification (ZSIC) is designed to solve the classification problem when the sample is very small, or the category is missing. A common method is to use attribute or word vectors as a priori category features (auxiliary information) and complete the domain transfer from training of seen classes to recognition of unseen classes by building a mapping between image features and a priori category features. However, feature extraction of the whole image lacks discrimination, and the amount of information of single attribute features or word vector features of categories is insufficient, which makes the matching degree between image features and prior class features not high and affects the accuracy of the ZSIC model. To this end, a spatial attention mechanism is designed, and an image feature extraction module based on this attention mechanism is constructed to screen critical features with discrimination. A semantic information fusion method based on matrix decomposition is proposed, which first decomposes the attribute features and then fuses them with the extracted word vector features of a dataset to achieve information expansion. Through the above two improvement measures, the classification accuracy of the ZSIC model for unseen images is improved. The experimental results on public datasets verify the effect and superiority of the proposed methods.
format	Online Article Text
id	pubmed-9966441
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-99664412023-02-26 Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion Wang, Yaru Feng, Lilong Song, Xiaoke Xu, Dawei Zhai, Yongjie Sensors (Basel) Article The zero-shot image classification (ZSIC) is designed to solve the classification problem when the sample is very small, or the category is missing. A common method is to use attribute or word vectors as a priori category features (auxiliary information) and complete the domain transfer from training of seen classes to recognition of unseen classes by building a mapping between image features and a priori category features. However, feature extraction of the whole image lacks discrimination, and the amount of information of single attribute features or word vector features of categories is insufficient, which makes the matching degree between image features and prior class features not high and affects the accuracy of the ZSIC model. To this end, a spatial attention mechanism is designed, and an image feature extraction module based on this attention mechanism is constructed to screen critical features with discrimination. A semantic information fusion method based on matrix decomposition is proposed, which first decomposes the attribute features and then fuses them with the extracted word vector features of a dataset to achieve information expansion. Through the above two improvement measures, the classification accuracy of the ZSIC model for unseen images is improved. The experimental results on public datasets verify the effect and superiority of the proposed methods. MDPI 2023-02-19 /pmc/articles/PMC9966441/ /pubmed/36850908 http://dx.doi.org/10.3390/s23042311 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Yaru Feng, Lilong Song, Xiaoke Xu, Dawei Zhai, Yongjie Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion
title	Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion
title_full	Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion
title_fullStr	Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion
title_full_unstemmed	Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion
title_short	Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion
title_sort	zero-shot image classification method based on attention mechanism and semantic information fusion
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9966441/ https://www.ncbi.nlm.nih.gov/pubmed/36850908 http://dx.doi.org/10.3390/s23042311
work_keys_str_mv	AT wangyaru zeroshotimageclassificationmethodbasedonattentionmechanismandsemanticinformationfusion AT fenglilong zeroshotimageclassificationmethodbasedonattentionmechanismandsemanticinformationfusion AT songxiaoke zeroshotimageclassificationmethodbasedonattentionmechanismandsemanticinformationfusion AT xudawei zeroshotimageclassificationmethodbasedonattentionmechanismandsemanticinformationfusion AT zhaiyongjie zeroshotimageclassificationmethodbasedonattentionmechanismandsemanticinformationfusion

Zero-Shot Image Classification Method Based on Attention Mechanism and Semantic Information Fusion

Ejemplares similares