Cargando…
Active Inference and Epistemic Value in Graphical Models
The Free Energy Principle (FEP) postulates that biological agents perceive and interact with their environment in order to minimize a Variational Free Energy (VFE) with respect to a generative model of their environment. The inference of a policy (future control sequence) according to the FEP is kno...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9019474/ https://www.ncbi.nlm.nih.gov/pubmed/35462780 http://dx.doi.org/10.3389/frobt.2022.794464 |
_version_ | 1784689290361438208 |
---|---|
author | van de Laar, Thijs Koudahl, Magnus van Erp, Bart de Vries, Bert |
author_facet | van de Laar, Thijs Koudahl, Magnus van Erp, Bart de Vries, Bert |
author_sort | van de Laar, Thijs |
collection | PubMed |
description | The Free Energy Principle (FEP) postulates that biological agents perceive and interact with their environment in order to minimize a Variational Free Energy (VFE) with respect to a generative model of their environment. The inference of a policy (future control sequence) according to the FEP is known as Active Inference (AIF). The AIF literature describes multiple VFE objectives for policy planning that lead to epistemic (information-seeking) behavior. However, most objectives have limited modeling flexibility. This paper approaches epistemic behavior from a constrained Bethe Free Energy (CBFE) perspective. Crucially, variational optimization of the CBFE can be expressed in terms of message passing on free-form generative models. The key intuition behind the CBFE is that we impose a point-mass constraint on predicted outcomes, which explicitly encodes the assumption that the agent will make observations in the future. We interpret the CBFE objective in terms of its constituent behavioral drives. We then illustrate resulting behavior of the CBFE by planning and interacting with a simulated T-maze environment. Simulations for the T-maze task illustrate how the CBFE agent exhibits an epistemic drive, and actively plans ahead to account for the impact of predicted outcomes. Compared to an EFE agent, the CBFE agent incurs expected reward in significantly more environmental scenarios. We conclude that CBFE optimization by message passing suggests a general mechanism for epistemic-aware AIF in free-form generative models. |
format | Online Article Text |
id | pubmed-9019474 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90194742022-04-21 Active Inference and Epistemic Value in Graphical Models van de Laar, Thijs Koudahl, Magnus van Erp, Bart de Vries, Bert Front Robot AI Robotics and AI The Free Energy Principle (FEP) postulates that biological agents perceive and interact with their environment in order to minimize a Variational Free Energy (VFE) with respect to a generative model of their environment. The inference of a policy (future control sequence) according to the FEP is known as Active Inference (AIF). The AIF literature describes multiple VFE objectives for policy planning that lead to epistemic (information-seeking) behavior. However, most objectives have limited modeling flexibility. This paper approaches epistemic behavior from a constrained Bethe Free Energy (CBFE) perspective. Crucially, variational optimization of the CBFE can be expressed in terms of message passing on free-form generative models. The key intuition behind the CBFE is that we impose a point-mass constraint on predicted outcomes, which explicitly encodes the assumption that the agent will make observations in the future. We interpret the CBFE objective in terms of its constituent behavioral drives. We then illustrate resulting behavior of the CBFE by planning and interacting with a simulated T-maze environment. Simulations for the T-maze task illustrate how the CBFE agent exhibits an epistemic drive, and actively plans ahead to account for the impact of predicted outcomes. Compared to an EFE agent, the CBFE agent incurs expected reward in significantly more environmental scenarios. We conclude that CBFE optimization by message passing suggests a general mechanism for epistemic-aware AIF in free-form generative models. Frontiers Media S.A. 2022-04-06 /pmc/articles/PMC9019474/ /pubmed/35462780 http://dx.doi.org/10.3389/frobt.2022.794464 Text en Copyright © 2022 van de Laar, Koudahl, van Erp and de Vries. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI van de Laar, Thijs Koudahl, Magnus van Erp, Bart de Vries, Bert Active Inference and Epistemic Value in Graphical Models |
title | Active Inference and Epistemic Value in Graphical Models |
title_full | Active Inference and Epistemic Value in Graphical Models |
title_fullStr | Active Inference and Epistemic Value in Graphical Models |
title_full_unstemmed | Active Inference and Epistemic Value in Graphical Models |
title_short | Active Inference and Epistemic Value in Graphical Models |
title_sort | active inference and epistemic value in graphical models |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9019474/ https://www.ncbi.nlm.nih.gov/pubmed/35462780 http://dx.doi.org/10.3389/frobt.2022.794464 |
work_keys_str_mv | AT vandelaarthijs activeinferenceandepistemicvalueingraphicalmodels AT koudahlmagnus activeinferenceandepistemicvalueingraphicalmodels AT vanerpbart activeinferenceandepistemicvalueingraphicalmodels AT devriesbert activeinferenceandepistemicvalueingraphicalmodels |