Cargando…

Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach

Canonical Correlation Analysis (CCA) is a linear representation learning method that seeks maximally correlated variables in multi-view data. Nonlinear CCA extends this notion to a broader family of transformations, which are more powerful in many real-world applications. Given the joint probability...

Descripción completa

Detalles Bibliográficos
Autores principales: Painsky, Amichai, Feder, Meir, Tishby, Naftali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516638/
https://www.ncbi.nlm.nih.gov/pubmed/33285982
http://dx.doi.org/10.3390/e22020208
_version_ 1783587047192133632
author Painsky, Amichai
Feder, Meir
Tishby, Naftali
author_facet Painsky, Amichai
Feder, Meir
Tishby, Naftali
author_sort Painsky, Amichai
collection PubMed
description Canonical Correlation Analysis (CCA) is a linear representation learning method that seeks maximally correlated variables in multi-view data. Nonlinear CCA extends this notion to a broader family of transformations, which are more powerful in many real-world applications. Given the joint probability, the Alternating Conditional Expectation (ACE) algorithm provides an optimal solution to the nonlinear CCA problem. However, it suffers from limited performance and an increasing computational burden when only a finite number of samples is available. In this work, we introduce an information-theoretic compressed representation framework for the nonlinear CCA problem (CRCCA), which extends the classical ACE approach. Our suggested framework seeks compact representations of the data that allow a maximal level of correlation. This way, we control the trade-off between the flexibility and the complexity of the model. CRCCA provides theoretical bounds and optimality conditions, as we establish fundamental connections to rate-distortion theory, the information bottleneck and remote source coding. In addition, it allows a soft dimensionality reduction, as the compression level is determined by the mutual information between the original noisy data and the extracted signals. Finally, we introduce a simple implementation of the CRCCA framework, based on lattice quantization.
format Online
Article
Text
id pubmed-7516638
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75166382020-11-09 Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach Painsky, Amichai Feder, Meir Tishby, Naftali Entropy (Basel) Article Canonical Correlation Analysis (CCA) is a linear representation learning method that seeks maximally correlated variables in multi-view data. Nonlinear CCA extends this notion to a broader family of transformations, which are more powerful in many real-world applications. Given the joint probability, the Alternating Conditional Expectation (ACE) algorithm provides an optimal solution to the nonlinear CCA problem. However, it suffers from limited performance and an increasing computational burden when only a finite number of samples is available. In this work, we introduce an information-theoretic compressed representation framework for the nonlinear CCA problem (CRCCA), which extends the classical ACE approach. Our suggested framework seeks compact representations of the data that allow a maximal level of correlation. This way, we control the trade-off between the flexibility and the complexity of the model. CRCCA provides theoretical bounds and optimality conditions, as we establish fundamental connections to rate-distortion theory, the information bottleneck and remote source coding. In addition, it allows a soft dimensionality reduction, as the compression level is determined by the mutual information between the original noisy data and the extracted signals. Finally, we introduce a simple implementation of the CRCCA framework, based on lattice quantization. MDPI 2020-02-12 /pmc/articles/PMC7516638/ /pubmed/33285982 http://dx.doi.org/10.3390/e22020208 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Painsky, Amichai
Feder, Meir
Tishby, Naftali
Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach
title Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach
title_full Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach
title_fullStr Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach
title_full_unstemmed Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach
title_short Nonlinear Canonical Correlation Analysis:A Compressed Representation Approach
title_sort nonlinear canonical correlation analysis:a compressed representation approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516638/
https://www.ncbi.nlm.nih.gov/pubmed/33285982
http://dx.doi.org/10.3390/e22020208
work_keys_str_mv AT painskyamichai nonlinearcanonicalcorrelationanalysisacompressedrepresentationapproach
AT federmeir nonlinearcanonicalcorrelationanalysisacompressedrepresentationapproach
AT tishbynaftali nonlinearcanonicalcorrelationanalysisacompressedrepresentationapproach