Cargando…
Active inference and learning
This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agent...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Pergamon Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167251/ https://www.ncbi.nlm.nih.gov/pubmed/27375276 http://dx.doi.org/10.1016/j.neubiorev.2016.06.022 |
_version_ | 1782483145215442944 |
---|---|
author | Friston, Karl FitzGerald, Thomas Rigoli, Francesco Schwartenbeck, Philipp O’Doherty, John Pezzulo, Giovanni |
author_facet | Friston, Karl FitzGerald, Thomas Rigoli, Francesco Schwartenbeck, Philipp O’Doherty, John Pezzulo, Giovanni |
author_sort | Friston, Karl |
collection | PubMed |
description | This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity. |
format | Online Article Text |
id | pubmed-5167251 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Pergamon Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-51672512016-12-22 Active inference and learning Friston, Karl FitzGerald, Thomas Rigoli, Francesco Schwartenbeck, Philipp O’Doherty, John Pezzulo, Giovanni Neurosci Biobehav Rev Article This paper offers an active inference account of choice behaviour and learning. It focuses on the distinction between goal-directed and habitual behaviour and how they contextualise each other. We show that habits emerge naturally (and autodidactically) from sequential policy optimisation when agents are equipped with state-action policies. In active inference, behaviour has explorative (epistemic) and exploitative (pragmatic) aspects that are sensitive to ambiguity and risk respectively, where epistemic (ambiguity-resolving) behaviour enables pragmatic (reward-seeking) behaviour and the subsequent emergence of habits. Although goal-directed and habitual policies are usually associated with model-based and model-free schemes, we find the more important distinction is between belief-free and belief-based schemes. The underlying (variational) belief updating provides a comprehensive (if metaphorical) process theory for several phenomena, including the transfer of dopamine responses, reversal learning, habit formation and devaluation. Finally, we show that active inference reduces to a classical (Bellman) scheme, in the absence of ambiguity. Pergamon Press 2016-09 /pmc/articles/PMC5167251/ /pubmed/27375276 http://dx.doi.org/10.1016/j.neubiorev.2016.06.022 Text en © 2016 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Friston, Karl FitzGerald, Thomas Rigoli, Francesco Schwartenbeck, Philipp O’Doherty, John Pezzulo, Giovanni Active inference and learning |
title | Active inference and learning |
title_full | Active inference and learning |
title_fullStr | Active inference and learning |
title_full_unstemmed | Active inference and learning |
title_short | Active inference and learning |
title_sort | active inference and learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167251/ https://www.ncbi.nlm.nih.gov/pubmed/27375276 http://dx.doi.org/10.1016/j.neubiorev.2016.06.022 |
work_keys_str_mv | AT fristonkarl activeinferenceandlearning AT fitzgeraldthomas activeinferenceandlearning AT rigolifrancesco activeinferenceandlearning AT schwartenbeckphilipp activeinferenceandlearning AT odohertyjohn activeinferenceandlearning AT pezzulogiovanni activeinferenceandlearning |