Cargando…

Discrete Infomax Codes for Supervised Representation Learning

For high-dimensional data such as images, learning an encoder that can output a compact yet informative representation is a key task on its own, in addition to facilitating subsequent processing of data. We present a model that produces discrete infomax codes (DIMCO); we train a probabilistic encode...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Yoonho, Kim, Wonjae, Park, Wonpyo, Choi, Seungjin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9030705/
https://www.ncbi.nlm.nih.gov/pubmed/35455164
http://dx.doi.org/10.3390/e24040501
_version_ 1784692207378235392
author Lee, Yoonho
Kim, Wonjae
Park, Wonpyo
Choi, Seungjin
author_facet Lee, Yoonho
Kim, Wonjae
Park, Wonpyo
Choi, Seungjin
author_sort Lee, Yoonho
collection PubMed
description For high-dimensional data such as images, learning an encoder that can output a compact yet informative representation is a key task on its own, in addition to facilitating subsequent processing of data. We present a model that produces discrete infomax codes (DIMCO); we train a probabilistic encoder that yields k-way d-dimensional codes associated with input data. Our model maximizes the mutual information between codes and ground-truth class labels, with a regularization which encourages entries of a codeword to be statistically independent. In this context, we show that the infomax principle also justifies existing loss functions, such as cross-entropy as its special cases. Our analysis also shows that using shorter codes reduces overfitting in the context of few-shot classification, and our various experiments show this implicit task-level regularization effect of DIMCO. Furthermore, we show that the codes learned by DIMCO are efficient in terms of both memory and retrieval time compared to prior methods.
format Online
Article
Text
id pubmed-9030705
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90307052022-04-23 Discrete Infomax Codes for Supervised Representation Learning Lee, Yoonho Kim, Wonjae Park, Wonpyo Choi, Seungjin Entropy (Basel) Article For high-dimensional data such as images, learning an encoder that can output a compact yet informative representation is a key task on its own, in addition to facilitating subsequent processing of data. We present a model that produces discrete infomax codes (DIMCO); we train a probabilistic encoder that yields k-way d-dimensional codes associated with input data. Our model maximizes the mutual information between codes and ground-truth class labels, with a regularization which encourages entries of a codeword to be statistically independent. In this context, we show that the infomax principle also justifies existing loss functions, such as cross-entropy as its special cases. Our analysis also shows that using shorter codes reduces overfitting in the context of few-shot classification, and our various experiments show this implicit task-level regularization effect of DIMCO. Furthermore, we show that the codes learned by DIMCO are efficient in terms of both memory and retrieval time compared to prior methods. MDPI 2022-04-02 /pmc/articles/PMC9030705/ /pubmed/35455164 http://dx.doi.org/10.3390/e24040501 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lee, Yoonho
Kim, Wonjae
Park, Wonpyo
Choi, Seungjin
Discrete Infomax Codes for Supervised Representation Learning
title Discrete Infomax Codes for Supervised Representation Learning
title_full Discrete Infomax Codes for Supervised Representation Learning
title_fullStr Discrete Infomax Codes for Supervised Representation Learning
title_full_unstemmed Discrete Infomax Codes for Supervised Representation Learning
title_short Discrete Infomax Codes for Supervised Representation Learning
title_sort discrete infomax codes for supervised representation learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9030705/
https://www.ncbi.nlm.nih.gov/pubmed/35455164
http://dx.doi.org/10.3390/e24040501
work_keys_str_mv AT leeyoonho discreteinfomaxcodesforsupervisedrepresentationlearning
AT kimwonjae discreteinfomaxcodesforsupervisedrepresentationlearning
AT parkwonpyo discreteinfomaxcodesforsupervisedrepresentationlearning
AT choiseungjin discreteinfomaxcodesforsupervisedrepresentationlearning