Cargando…

Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty

A key problem in neuroscience is understanding how the brain makes decisions under uncertainty. Important insights have been gained using tasks such as the random dots motion discrimination task in which the subject makes decisions based on noisy stimuli. A descriptive model known as the drift diffu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Huang, Yanping, Rao, Rajesh P. N.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2013
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3551910/ https://www.ncbi.nlm.nih.gov/pubmed/23349707 http://dx.doi.org/10.1371/journal.pone.0053344

_version_	1782256639820169216
author	Huang, Yanping Rao, Rajesh P. N.
author_facet	Huang, Yanping Rao, Rajesh P. N.
author_sort	Huang, Yanping
collection	PubMed
description	A key problem in neuroscience is understanding how the brain makes decisions under uncertainty. Important insights have been gained using tasks such as the random dots motion discrimination task in which the subject makes decisions based on noisy stimuli. A descriptive model known as the drift diffusion model has previously been used to explain psychometric and reaction time data from such tasks but to fully explain the data, one is forced to make ad-hoc assumptions such as a time-dependent collapsing decision boundary. We show that such assumptions are unnecessary when decision making is viewed within the framework of partially observable Markov decision processes (POMDPs). We propose an alternative model for decision making based on POMDPs. We show that the motion discrimination task reduces to the problems of (1) computing beliefs (posterior distributions) over the unknown direction and motion strength from noisy observations in a Bayesian manner, and (2) selecting actions based on these beliefs to maximize the expected sum of future rewards. The resulting optimal policy (belief-to-action mapping) is shown to be equivalent to a collapsing decision threshold that governs the switch from evidence accumulation to a discrimination decision. We show that the model accounts for both accuracy and reaction time as a function of stimulus strength as well as different speed-accuracy conditions in the random dots task.
format	Online Article Text
id	pubmed-3551910
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-35519102013-01-24 Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty Huang, Yanping Rao, Rajesh P. N. PLoS One Research Article A key problem in neuroscience is understanding how the brain makes decisions under uncertainty. Important insights have been gained using tasks such as the random dots motion discrimination task in which the subject makes decisions based on noisy stimuli. A descriptive model known as the drift diffusion model has previously been used to explain psychometric and reaction time data from such tasks but to fully explain the data, one is forced to make ad-hoc assumptions such as a time-dependent collapsing decision boundary. We show that such assumptions are unnecessary when decision making is viewed within the framework of partially observable Markov decision processes (POMDPs). We propose an alternative model for decision making based on POMDPs. We show that the motion discrimination task reduces to the problems of (1) computing beliefs (posterior distributions) over the unknown direction and motion strength from noisy observations in a Bayesian manner, and (2) selecting actions based on these beliefs to maximize the expected sum of future rewards. The resulting optimal policy (belief-to-action mapping) is shown to be equivalent to a collapsing decision threshold that governs the switch from evidence accumulation to a discrimination decision. We show that the model accounts for both accuracy and reaction time as a function of stimulus strength as well as different speed-accuracy conditions in the random dots task. Public Library of Science 2013-01-22 /pmc/articles/PMC3551910/ /pubmed/23349707 http://dx.doi.org/10.1371/journal.pone.0053344 Text en © 2013 Rao, Huang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Huang, Yanping Rao, Rajesh P. N. Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty
title	Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty
title_full	Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty
title_fullStr	Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty
title_full_unstemmed	Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty
title_short	Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty
title_sort	reward optimization in the primate brain: a probabilistic model of decision making under uncertainty
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3551910/ https://www.ncbi.nlm.nih.gov/pubmed/23349707 http://dx.doi.org/10.1371/journal.pone.0053344
work_keys_str_mv	AT huangyanping rewardoptimizationintheprimatebrainaprobabilisticmodelofdecisionmakingunderuncertainty AT raorajeshpn rewardoptimizationintheprimatebrainaprobabilisticmodelofdecisionmakingunderuncertainty

Reward Optimization in the Primate Brain: A Probabilistic Model of Decision Making under Uncertainty

Ejemplares similares