Cargando…
Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales
Cognitive control is typically understood as a set of mechanisms that enable humans to reach goals that require integrating the consequences of actions over longer time scales. Importantly, using routine behaviour or making choices beneficial only at short time scales would prevent one from attainin...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8208938/ https://www.ncbi.nlm.nih.gov/pubmed/33372237 http://dx.doi.org/10.3758/s13415-020-00837-x |
_version_ | 1783709024437403648 |
---|---|
author | Marković, Dimitrije Goschke, Thomas Kiebel, Stefan J. |
author_facet | Marković, Dimitrije Goschke, Thomas Kiebel, Stefan J. |
author_sort | Marković, Dimitrije |
collection | PubMed |
description | Cognitive control is typically understood as a set of mechanisms that enable humans to reach goals that require integrating the consequences of actions over longer time scales. Importantly, using routine behaviour or making choices beneficial only at short time scales would prevent one from attaining these goals. During the past two decades, researchers have proposed various computational cognitive models that successfully account for behaviour related to cognitive control in a wide range of laboratory tasks. As humans operate in a dynamic and uncertain environment, making elaborate plans and integrating experience over multiple time scales is computationally expensive. Importantly, it remains poorly understood how uncertain consequences at different time scales are integrated into adaptive decisions. Here, we pursue the idea that cognitive control can be cast as active inference over a hierarchy of time scales, where inference, i.e., planning, at higher levels of the hierarchy controls inference at lower levels. We introduce the novel concept of meta-control states, which link higher-level beliefs with lower-level policy inference. Specifically, we conceptualize cognitive control as inference over these meta-control states, where solutions to cognitive control dilemmas emerge through surprisal minimisation at different hierarchy levels. We illustrate this concept using the exploration-exploitation dilemma based on a variant of a restless multi-armed bandit task. We demonstrate that beliefs about contexts and meta-control states at a higher level dynamically modulate the balance of exploration and exploitation at the lower level of a single action. Finally, we discuss the generalisation of this meta-control concept to other control dilemmas. |
format | Online Article Text |
id | pubmed-8208938 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-82089382021-07-01 Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales Marković, Dimitrije Goschke, Thomas Kiebel, Stefan J. Cogn Affect Behav Neurosci Article Cognitive control is typically understood as a set of mechanisms that enable humans to reach goals that require integrating the consequences of actions over longer time scales. Importantly, using routine behaviour or making choices beneficial only at short time scales would prevent one from attaining these goals. During the past two decades, researchers have proposed various computational cognitive models that successfully account for behaviour related to cognitive control in a wide range of laboratory tasks. As humans operate in a dynamic and uncertain environment, making elaborate plans and integrating experience over multiple time scales is computationally expensive. Importantly, it remains poorly understood how uncertain consequences at different time scales are integrated into adaptive decisions. Here, we pursue the idea that cognitive control can be cast as active inference over a hierarchy of time scales, where inference, i.e., planning, at higher levels of the hierarchy controls inference at lower levels. We introduce the novel concept of meta-control states, which link higher-level beliefs with lower-level policy inference. Specifically, we conceptualize cognitive control as inference over these meta-control states, where solutions to cognitive control dilemmas emerge through surprisal minimisation at different hierarchy levels. We illustrate this concept using the exploration-exploitation dilemma based on a variant of a restless multi-armed bandit task. We demonstrate that beliefs about contexts and meta-control states at a higher level dynamically modulate the balance of exploration and exploitation at the lower level of a single action. Finally, we discuss the generalisation of this meta-control concept to other control dilemmas. Springer US 2020-12-28 2021 /pmc/articles/PMC8208938/ /pubmed/33372237 http://dx.doi.org/10.3758/s13415-020-00837-x Text en © The Author(s) 2020 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Marković, Dimitrije Goschke, Thomas Kiebel, Stefan J. Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales |
title | Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales |
title_full | Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales |
title_fullStr | Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales |
title_full_unstemmed | Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales |
title_short | Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales |
title_sort | meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8208938/ https://www.ncbi.nlm.nih.gov/pubmed/33372237 http://dx.doi.org/10.3758/s13415-020-00837-x |
work_keys_str_mv | AT markovicdimitrije metacontroloftheexplorationexploitationdilemmaemergesfromprobabilisticinferenceoverahierarchyoftimescales AT goschkethomas metacontroloftheexplorationexploitationdilemmaemergesfromprobabilisticinferenceoverahierarchyoftimescales AT kiebelstefanj metacontroloftheexplorationexploitationdilemmaemergesfromprobabilisticinferenceoverahierarchyoftimescales |