Cargando…

Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task

Humans often face sequential decision-making problems, in which information about the environmental reward structure is detached from rewards for a subset of actions. In the current exploratory study, we introduce an information-selective symmetric reversal bandit task to model such situations and o...

Descripción completa

Detalles Bibliográficos
Autores principales:	Horvath, Lilla, Colcombe, Stanley, Milham, Michael, Ray, Shruti, Schwartenbeck, Philipp, Ostwald, Dirk
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2021
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8327602/ https://www.ncbi.nlm.nih.gov/pubmed/34368622 http://dx.doi.org/10.1007/s42113-021-00112-3

_version_	1783732124510060544
author	Horvath, Lilla Colcombe, Stanley Milham, Michael Ray, Shruti Schwartenbeck, Philipp Ostwald, Dirk
author_facet	Horvath, Lilla Colcombe, Stanley Milham, Michael Ray, Shruti Schwartenbeck, Philipp Ostwald, Dirk
author_sort	Horvath, Lilla
collection	PubMed
description	Humans often face sequential decision-making problems, in which information about the environmental reward structure is detached from rewards for a subset of actions. In the current exploratory study, we introduce an information-selective symmetric reversal bandit task to model such situations and obtained choice data on this task from 24 participants. To arbitrate between different decision-making strategies that participants may use on this task, we developed a set of probabilistic agent-based behavioral models, including exploitative and explorative Bayesian agents, as well as heuristic control agents. Upon validating the model and parameter recovery properties of our model set and summarizing the participants’ choice data in a descriptive way, we used a maximum likelihood approach to evaluate the participants’ choice data from the perspective of our model set. In brief, we provide quantitative evidence that participants employ a belief state-based hybrid explorative-exploitative strategy on the information-selective symmetric reversal bandit task, lending further support to the finding that humans are guided by their subjective uncertainty when solving exploration-exploitation dilemmas. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42113-021-00112-3.
format	Online Article Text
id	pubmed-8327602
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-83276022021-08-02 Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task Horvath, Lilla Colcombe, Stanley Milham, Michael Ray, Shruti Schwartenbeck, Philipp Ostwald, Dirk Comput Brain Behav Original Paper Humans often face sequential decision-making problems, in which information about the environmental reward structure is detached from rewards for a subset of actions. In the current exploratory study, we introduce an information-selective symmetric reversal bandit task to model such situations and obtained choice data on this task from 24 participants. To arbitrate between different decision-making strategies that participants may use on this task, we developed a set of probabilistic agent-based behavioral models, including exploitative and explorative Bayesian agents, as well as heuristic control agents. Upon validating the model and parameter recovery properties of our model set and summarizing the participants’ choice data in a descriptive way, we used a maximum likelihood approach to evaluate the participants’ choice data from the perspective of our model set. In brief, we provide quantitative evidence that participants employ a belief state-based hybrid explorative-exploitative strategy on the information-selective symmetric reversal bandit task, lending further support to the finding that humans are guided by their subjective uncertainty when solving exploration-exploitation dilemmas. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s42113-021-00112-3. Springer International Publishing 2021-08-02 2021 /pmc/articles/PMC8327602/ /pubmed/34368622 http://dx.doi.org/10.1007/s42113-021-00112-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Original Paper Horvath, Lilla Colcombe, Stanley Milham, Michael Ray, Shruti Schwartenbeck, Philipp Ostwald, Dirk Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
title	Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
title_full	Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
title_fullStr	Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
title_full_unstemmed	Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
title_short	Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task
title_sort	human belief state-based exploration and exploitation in an information-selective symmetric reversal bandit task
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8327602/ https://www.ncbi.nlm.nih.gov/pubmed/34368622 http://dx.doi.org/10.1007/s42113-021-00112-3
work_keys_str_mv	AT horvathlilla humanbeliefstatebasedexplorationandexploitationinaninformationselectivesymmetricreversalbandittask AT colcombestanley humanbeliefstatebasedexplorationandexploitationinaninformationselectivesymmetricreversalbandittask AT milhammichael humanbeliefstatebasedexplorationandexploitationinaninformationselectivesymmetricreversalbandittask AT rayshruti humanbeliefstatebasedexplorationandexploitationinaninformationselectivesymmetricreversalbandittask AT schwartenbeckphilipp humanbeliefstatebasedexplorationandexploitationinaninformationselectivesymmetricreversalbandittask AT ostwalddirk humanbeliefstatebasedexplorationandexploitationinaninformationselectivesymmetricreversalbandittask

Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task

Ejemplares similares