Cargando…
Discrete Infomax Codes for Supervised Representation Learning
For high-dimensional data such as images, learning an encoder that can output a compact yet informative representation is a key task on its own, in addition to facilitating subsequent processing of data. We present a model that produces discrete infomax codes (DIMCO); we train a probabilistic encode...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9030705/ https://www.ncbi.nlm.nih.gov/pubmed/35455164 http://dx.doi.org/10.3390/e24040501 |
_version_ | 1784692207378235392 |
---|---|
author | Lee, Yoonho Kim, Wonjae Park, Wonpyo Choi, Seungjin |
author_facet | Lee, Yoonho Kim, Wonjae Park, Wonpyo Choi, Seungjin |
author_sort | Lee, Yoonho |
collection | PubMed |
description | For high-dimensional data such as images, learning an encoder that can output a compact yet informative representation is a key task on its own, in addition to facilitating subsequent processing of data. We present a model that produces discrete infomax codes (DIMCO); we train a probabilistic encoder that yields k-way d-dimensional codes associated with input data. Our model maximizes the mutual information between codes and ground-truth class labels, with a regularization which encourages entries of a codeword to be statistically independent. In this context, we show that the infomax principle also justifies existing loss functions, such as cross-entropy as its special cases. Our analysis also shows that using shorter codes reduces overfitting in the context of few-shot classification, and our various experiments show this implicit task-level regularization effect of DIMCO. Furthermore, we show that the codes learned by DIMCO are efficient in terms of both memory and retrieval time compared to prior methods. |
format | Online Article Text |
id | pubmed-9030705 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-90307052022-04-23 Discrete Infomax Codes for Supervised Representation Learning Lee, Yoonho Kim, Wonjae Park, Wonpyo Choi, Seungjin Entropy (Basel) Article For high-dimensional data such as images, learning an encoder that can output a compact yet informative representation is a key task on its own, in addition to facilitating subsequent processing of data. We present a model that produces discrete infomax codes (DIMCO); we train a probabilistic encoder that yields k-way d-dimensional codes associated with input data. Our model maximizes the mutual information between codes and ground-truth class labels, with a regularization which encourages entries of a codeword to be statistically independent. In this context, we show that the infomax principle also justifies existing loss functions, such as cross-entropy as its special cases. Our analysis also shows that using shorter codes reduces overfitting in the context of few-shot classification, and our various experiments show this implicit task-level regularization effect of DIMCO. Furthermore, we show that the codes learned by DIMCO are efficient in terms of both memory and retrieval time compared to prior methods. MDPI 2022-04-02 /pmc/articles/PMC9030705/ /pubmed/35455164 http://dx.doi.org/10.3390/e24040501 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lee, Yoonho Kim, Wonjae Park, Wonpyo Choi, Seungjin Discrete Infomax Codes for Supervised Representation Learning |
title | Discrete Infomax Codes for Supervised Representation Learning |
title_full | Discrete Infomax Codes for Supervised Representation Learning |
title_fullStr | Discrete Infomax Codes for Supervised Representation Learning |
title_full_unstemmed | Discrete Infomax Codes for Supervised Representation Learning |
title_short | Discrete Infomax Codes for Supervised Representation Learning |
title_sort | discrete infomax codes for supervised representation learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9030705/ https://www.ncbi.nlm.nih.gov/pubmed/35455164 http://dx.doi.org/10.3390/e24040501 |
work_keys_str_mv | AT leeyoonho discreteinfomaxcodesforsupervisedrepresentationlearning AT kimwonjae discreteinfomaxcodesforsupervisedrepresentationlearning AT parkwonpyo discreteinfomaxcodesforsupervisedrepresentationlearning AT choiseungjin discreteinfomaxcodesforsupervisedrepresentationlearning |