Cargando…

Theory of Choice in Bandit, Information Sampling and Foraging Tasks

Decision making has been studied with a wide array of tasks. Here we examine the theoretical structure of bandit, information sampling and foraging tasks. These tasks move beyond tasks where the choice in the current trial does not affect future expected rewards. We have modeled these tasks using Ma...

Descripción completa

Detalles Bibliográficos
Autor principal:	Averbeck, Bruno B.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2015
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4376795/ https://www.ncbi.nlm.nih.gov/pubmed/25815510 http://dx.doi.org/10.1371/journal.pcbi.1004164

_version_	1782363788584943616
author	Averbeck, Bruno B.
author_facet	Averbeck, Bruno B.
author_sort	Averbeck, Bruno B.
collection	PubMed
description	Decision making has been studied with a wide array of tasks. Here we examine the theoretical structure of bandit, information sampling and foraging tasks. These tasks move beyond tasks where the choice in the current trial does not affect future expected rewards. We have modeled these tasks using Markov decision processes (MDPs). MDPs provide a general framework for modeling tasks in which decisions affect the information on which future choices will be made. Under the assumption that agents are maximizing expected rewards, MDPs provide normative solutions. We find that all three classes of tasks pose choices among actions which trade-off immediate and future expected rewards. The tasks drive these trade-offs in unique ways, however. For bandit and information sampling tasks, increasing uncertainty or the time horizon shifts value to actions that pay-off in the future. Correspondingly, decreasing uncertainty increases the relative value of actions that pay-off immediately. For foraging tasks the time-horizon plays the dominant role, as choices do not affect future uncertainty in these tasks.
format	Online Article Text
id	pubmed-4376795
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-43767952015-04-04 Theory of Choice in Bandit, Information Sampling and Foraging Tasks Averbeck, Bruno B. PLoS Comput Biol Research Article Decision making has been studied with a wide array of tasks. Here we examine the theoretical structure of bandit, information sampling and foraging tasks. These tasks move beyond tasks where the choice in the current trial does not affect future expected rewards. We have modeled these tasks using Markov decision processes (MDPs). MDPs provide a general framework for modeling tasks in which decisions affect the information on which future choices will be made. Under the assumption that agents are maximizing expected rewards, MDPs provide normative solutions. We find that all three classes of tasks pose choices among actions which trade-off immediate and future expected rewards. The tasks drive these trade-offs in unique ways, however. For bandit and information sampling tasks, increasing uncertainty or the time horizon shifts value to actions that pay-off in the future. Correspondingly, decreasing uncertainty increases the relative value of actions that pay-off immediately. For foraging tasks the time-horizon plays the dominant role, as choices do not affect future uncertainty in these tasks. Public Library of Science 2015-03-27 /pmc/articles/PMC4376795/ /pubmed/25815510 http://dx.doi.org/10.1371/journal.pcbi.1004164 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle	Research Article Averbeck, Bruno B. Theory of Choice in Bandit, Information Sampling and Foraging Tasks
title	Theory of Choice in Bandit, Information Sampling and Foraging Tasks
title_full	Theory of Choice in Bandit, Information Sampling and Foraging Tasks
title_fullStr	Theory of Choice in Bandit, Information Sampling and Foraging Tasks
title_full_unstemmed	Theory of Choice in Bandit, Information Sampling and Foraging Tasks
title_short	Theory of Choice in Bandit, Information Sampling and Foraging Tasks
title_sort	theory of choice in bandit, information sampling and foraging tasks
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4376795/ https://www.ncbi.nlm.nih.gov/pubmed/25815510 http://dx.doi.org/10.1371/journal.pcbi.1004164
work_keys_str_mv	AT averbeckbrunob theoryofchoiceinbanditinformationsamplingandforagingtasks

Theory of Choice in Bandit, Information Sampling and Foraging Tasks

Ejemplares similares