Cargando…

Zero-Shot Image Classification Based on a Learnable Deep Metric

The supervised model based on deep learning has made great achievements in the field of image classification after training with a large number of labeled samples. However, there are many categories without or only with a few labeled training samples in practice, and some categories even have no tra...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Jingyi, Shi, Caijuan, Tu, Dongjing, Shi, Ze, Liu, Yazhi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124744/ https://www.ncbi.nlm.nih.gov/pubmed/34067100 http://dx.doi.org/10.3390/s21093241

_version_	1783693297250729984
author	Liu, Jingyi Shi, Caijuan Tu, Dongjing Shi, Ze Liu, Yazhi
author_facet	Liu, Jingyi Shi, Caijuan Tu, Dongjing Shi, Ze Liu, Yazhi
author_sort	Liu, Jingyi
collection	PubMed
description	The supervised model based on deep learning has made great achievements in the field of image classification after training with a large number of labeled samples. However, there are many categories without or only with a few labeled training samples in practice, and some categories even have no training samples at all. The proposed zero-shot learning greatly reduces the dependence on labeled training samples for image classification models. Nevertheless, there are limitations in learning the similarity of visual features and semantic features with a predefined fixed metric (e.g., as Euclidean distance), as well as the problem of semantic gap in the mapping process. To address these problems, a new zero-shot image classification method based on an end-to-end learnable deep metric is proposed in this paper. First, the common space embedding is adopted to map the visual features and semantic features into a common space. Second, an end-to-end learnable deep metric, that is, the relation network is utilized to learn the similarity of visual features and semantic features. Finally, the invisible images are classified, according to the similarity score. Extensive experiments are carried out on four datasets and the results indicate the effectiveness of the proposed method.
format	Online Article Text
id	pubmed-8124744
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-81247442021-05-17 Zero-Shot Image Classification Based on a Learnable Deep Metric Liu, Jingyi Shi, Caijuan Tu, Dongjing Shi, Ze Liu, Yazhi Sensors (Basel) Article The supervised model based on deep learning has made great achievements in the field of image classification after training with a large number of labeled samples. However, there are many categories without or only with a few labeled training samples in practice, and some categories even have no training samples at all. The proposed zero-shot learning greatly reduces the dependence on labeled training samples for image classification models. Nevertheless, there are limitations in learning the similarity of visual features and semantic features with a predefined fixed metric (e.g., as Euclidean distance), as well as the problem of semantic gap in the mapping process. To address these problems, a new zero-shot image classification method based on an end-to-end learnable deep metric is proposed in this paper. First, the common space embedding is adopted to map the visual features and semantic features into a common space. Second, an end-to-end learnable deep metric, that is, the relation network is utilized to learn the similarity of visual features and semantic features. Finally, the invisible images are classified, according to the similarity score. Extensive experiments are carried out on four datasets and the results indicate the effectiveness of the proposed method. MDPI 2021-05-07 /pmc/articles/PMC8124744/ /pubmed/34067100 http://dx.doi.org/10.3390/s21093241 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Liu, Jingyi Shi, Caijuan Tu, Dongjing Shi, Ze Liu, Yazhi Zero-Shot Image Classification Based on a Learnable Deep Metric
title	Zero-Shot Image Classification Based on a Learnable Deep Metric
title_full	Zero-Shot Image Classification Based on a Learnable Deep Metric
title_fullStr	Zero-Shot Image Classification Based on a Learnable Deep Metric
title_full_unstemmed	Zero-Shot Image Classification Based on a Learnable Deep Metric
title_short	Zero-Shot Image Classification Based on a Learnable Deep Metric
title_sort	zero-shot image classification based on a learnable deep metric
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8124744/ https://www.ncbi.nlm.nih.gov/pubmed/34067100 http://dx.doi.org/10.3390/s21093241
work_keys_str_mv	AT liujingyi zeroshotimageclassificationbasedonalearnabledeepmetric AT shicaijuan zeroshotimageclassificationbasedonalearnabledeepmetric AT tudongjing zeroshotimageclassificationbasedonalearnabledeepmetric AT shize zeroshotimageclassificationbasedonalearnabledeepmetric AT liuyazhi zeroshotimageclassificationbasedonalearnabledeepmetric

Zero-Shot Image Classification Based on a Learnable Deep Metric

Ejemplares similares