Cargando…

Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization

Learning invariant (causal) features for out-of-distribution (OOD) generalization have attracted extensive attention recently, and among the proposals, invariant risk minimization (IRM) is a notable solution. In spite of its theoretical promise for linear regression, the challenges of using IRM in l...

Descripción completa

Detalles Bibliográficos
Autores principales: Deng, Bin, Jia, Kui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955031/
https://www.ncbi.nlm.nih.gov/pubmed/36832560
http://dx.doi.org/10.3390/e25020193
_version_ 1784894256556539904
author Deng, Bin
Jia, Kui
author_facet Deng, Bin
Jia, Kui
author_sort Deng, Bin
collection PubMed
description Learning invariant (causal) features for out-of-distribution (OOD) generalization have attracted extensive attention recently, and among the proposals, invariant risk minimization (IRM) is a notable solution. In spite of its theoretical promise for linear regression, the challenges of using IRM in linear classification problems remain. By introducing the information bottleneck (IB) principle into the learning of IRM, the IB-IRM approach has demonstrated its power to solve these challenges. In this paper, we further improve IB-IRM from two aspects. First, we show that the key assumption of support overlap of invariant features used in IB-IRM guarantees OOD generalization, and it is still possible to achieve the optimal solution without this assumption. Second, we illustrate two failure modes where IB-IRM (and IRM) could fail in learning the invariant features, and to address such failures, we propose a Counterfactual Supervision-based Information Bottleneck (CSIB) learning algorithm that recovers the invariant features. By requiring counterfactual inference, CSIB works even when accessing data from a single environment. Empirical experiments on several datasets verify our theoretical results.
format Online
Article
Text
id pubmed-9955031
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99550312023-02-25 Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization Deng, Bin Jia, Kui Entropy (Basel) Article Learning invariant (causal) features for out-of-distribution (OOD) generalization have attracted extensive attention recently, and among the proposals, invariant risk minimization (IRM) is a notable solution. In spite of its theoretical promise for linear regression, the challenges of using IRM in linear classification problems remain. By introducing the information bottleneck (IB) principle into the learning of IRM, the IB-IRM approach has demonstrated its power to solve these challenges. In this paper, we further improve IB-IRM from two aspects. First, we show that the key assumption of support overlap of invariant features used in IB-IRM guarantees OOD generalization, and it is still possible to achieve the optimal solution without this assumption. Second, we illustrate two failure modes where IB-IRM (and IRM) could fail in learning the invariant features, and to address such failures, we propose a Counterfactual Supervision-based Information Bottleneck (CSIB) learning algorithm that recovers the invariant features. By requiring counterfactual inference, CSIB works even when accessing data from a single environment. Empirical experiments on several datasets verify our theoretical results. MDPI 2023-01-18 /pmc/articles/PMC9955031/ /pubmed/36832560 http://dx.doi.org/10.3390/e25020193 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Deng, Bin
Jia, Kui
Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization
title Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization
title_full Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization
title_fullStr Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization
title_full_unstemmed Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization
title_short Counterfactual Supervision-Based Information Bottleneck for Out-of-Distribution Generalization
title_sort counterfactual supervision-based information bottleneck for out-of-distribution generalization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955031/
https://www.ncbi.nlm.nih.gov/pubmed/36832560
http://dx.doi.org/10.3390/e25020193
work_keys_str_mv AT dengbin counterfactualsupervisionbasedinformationbottleneckforoutofdistributiongeneralization
AT jiakui counterfactualsupervisionbasedinformationbottleneckforoutofdistributiongeneralization