Cargando…

Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes

A fundamental problem faced by animals is learning to select actions based on noisy sensory information and incomplete knowledge of the world. It has been suggested that the brain engages in Bayesian inference during perception but how such probabilistic representations are used to select actions ha...

Descripción completa

Detalles Bibliográficos
Autor principal: Rao, Rajesh P. N.
Formato: Texto
Lenguaje:English
Publicado: Frontiers Research Foundation 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2998859/
https://www.ncbi.nlm.nih.gov/pubmed/21152255
http://dx.doi.org/10.3389/fncom.2010.00146
_version_ 1782193406694391808
author Rao, Rajesh P. N.
author_facet Rao, Rajesh P. N.
author_sort Rao, Rajesh P. N.
collection PubMed
description A fundamental problem faced by animals is learning to select actions based on noisy sensory information and incomplete knowledge of the world. It has been suggested that the brain engages in Bayesian inference during perception but how such probabilistic representations are used to select actions has remained unclear. Here we propose a neural model of action selection and decision making based on the theory of partially observable Markov decision processes (POMDPs). Actions are selected based not on a single “optimal” estimate of state but on the posterior distribution over states (the “belief” state). We show how such a model provides a unified framework for explaining experimental results in decision making that involve both information gathering and overt actions. The model utilizes temporal difference (TD) learning for maximizing expected reward. The resulting neural architecture posits an active role for the neocortex in belief computation while ascribing a role to the basal ganglia in belief representation, value computation, and action selection. When applied to the random dots motion discrimination task, model neurons representing belief exhibit responses similar to those of LIP neurons in primate neocortex. The appropriate threshold for switching from information gathering to overt actions emerges naturally during reward maximization. Additionally, the time course of reward prediction error in the model shares similarities with dopaminergic responses in the basal ganglia during the random dots task. For tasks with a deadline, the model learns a decision making strategy that changes with elapsed time, predicting a collapsing decision threshold consistent with some experimental studies. The model provides a new framework for understanding neural decision making and suggests an important role for interactions between the neocortex and the basal ganglia in learning the mapping between probabilistic sensory representations and actions that maximize rewards.
format Text
id pubmed-2998859
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Frontiers Research Foundation
record_format MEDLINE/PubMed
spelling pubmed-29988592010-12-09 Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes Rao, Rajesh P. N. Front Comput Neurosci Neuroscience A fundamental problem faced by animals is learning to select actions based on noisy sensory information and incomplete knowledge of the world. It has been suggested that the brain engages in Bayesian inference during perception but how such probabilistic representations are used to select actions has remained unclear. Here we propose a neural model of action selection and decision making based on the theory of partially observable Markov decision processes (POMDPs). Actions are selected based not on a single “optimal” estimate of state but on the posterior distribution over states (the “belief” state). We show how such a model provides a unified framework for explaining experimental results in decision making that involve both information gathering and overt actions. The model utilizes temporal difference (TD) learning for maximizing expected reward. The resulting neural architecture posits an active role for the neocortex in belief computation while ascribing a role to the basal ganglia in belief representation, value computation, and action selection. When applied to the random dots motion discrimination task, model neurons representing belief exhibit responses similar to those of LIP neurons in primate neocortex. The appropriate threshold for switching from information gathering to overt actions emerges naturally during reward maximization. Additionally, the time course of reward prediction error in the model shares similarities with dopaminergic responses in the basal ganglia during the random dots task. For tasks with a deadline, the model learns a decision making strategy that changes with elapsed time, predicting a collapsing decision threshold consistent with some experimental studies. The model provides a new framework for understanding neural decision making and suggests an important role for interactions between the neocortex and the basal ganglia in learning the mapping between probabilistic sensory representations and actions that maximize rewards. Frontiers Research Foundation 2010-11-24 /pmc/articles/PMC2998859/ /pubmed/21152255 http://dx.doi.org/10.3389/fncom.2010.00146 Text en Copyright © 2010 Rao. http://www.frontiersin.org/licenseagreement This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited.
spellingShingle Neuroscience
Rao, Rajesh P. N.
Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes
title Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes
title_full Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes
title_fullStr Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes
title_full_unstemmed Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes
title_short Decision Making Under Uncertainty: A Neural Model Based on Partially Observable Markov Decision Processes
title_sort decision making under uncertainty: a neural model based on partially observable markov decision processes
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2998859/
https://www.ncbi.nlm.nih.gov/pubmed/21152255
http://dx.doi.org/10.3389/fncom.2010.00146
work_keys_str_mv AT raorajeshpn decisionmakingunderuncertaintyaneuralmodelbasedonpartiallyobservablemarkovdecisionprocesses