Cargando…

Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost

Finding the right amount of deliberation, between insufficient and excessive, is a hard decision making problem that depends on the value we place on our time. Average-reward, putatively encoded by tonic dopamine, serves in existing reinforcement learning theory as the opportunity cost of time, incl...

Descripción completa

Detalles Bibliográficos
Autores principales:	Puelma Touzel, Maximilian, Cisek, Paul, Lajoie, Guillaume
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176815/ https://www.ncbi.nlm.nih.gov/pubmed/35617370 http://dx.doi.org/10.1371/journal.pcbi.1010080

_version_	1784722752509313024
author	Puelma Touzel, Maximilian Cisek, Paul Lajoie, Guillaume
author_facet	Puelma Touzel, Maximilian Cisek, Paul Lajoie, Guillaume
author_sort	Puelma Touzel, Maximilian
collection	PubMed
description	Finding the right amount of deliberation, between insufficient and excessive, is a hard decision making problem that depends on the value we place on our time. Average-reward, putatively encoded by tonic dopamine, serves in existing reinforcement learning theory as the opportunity cost of time, including deliberation time. Importantly, this cost can itself vary with the environmental context and is not trivial to estimate. Here, we propose how the opportunity cost of deliberation can be estimated adaptively on multiple timescales to account for non-stationary contextual factors. We use it in a simple decision-making heuristic based on average-reward reinforcement learning (AR-RL) that we call Performance-Gated Deliberation (PGD). We propose PGD as a strategy used by animals wherein deliberation cost is implemented directly as urgency, a previously characterized neural signal effectively controlling the speed of the decision-making process. We show PGD outperforms AR-RL solutions in explaining behaviour and urgency of non-human primates in a context-varying random walk prediction task and is consistent with relative performance and urgency in a context-varying random dot motion task. We make readily testable predictions for both neural activity and behaviour.
format	Online Article Text
id	pubmed-9176815
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-91768152022-06-09 Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost Puelma Touzel, Maximilian Cisek, Paul Lajoie, Guillaume PLoS Comput Biol Research Article Finding the right amount of deliberation, between insufficient and excessive, is a hard decision making problem that depends on the value we place on our time. Average-reward, putatively encoded by tonic dopamine, serves in existing reinforcement learning theory as the opportunity cost of time, including deliberation time. Importantly, this cost can itself vary with the environmental context and is not trivial to estimate. Here, we propose how the opportunity cost of deliberation can be estimated adaptively on multiple timescales to account for non-stationary contextual factors. We use it in a simple decision-making heuristic based on average-reward reinforcement learning (AR-RL) that we call Performance-Gated Deliberation (PGD). We propose PGD as a strategy used by animals wherein deliberation cost is implemented directly as urgency, a previously characterized neural signal effectively controlling the speed of the decision-making process. We show PGD outperforms AR-RL solutions in explaining behaviour and urgency of non-human primates in a context-varying random walk prediction task and is consistent with relative performance and urgency in a context-varying random dot motion task. We make readily testable predictions for both neural activity and behaviour. Public Library of Science 2022-05-26 /pmc/articles/PMC9176815/ /pubmed/35617370 http://dx.doi.org/10.1371/journal.pcbi.1010080 Text en © 2022 Puelma Touzel et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Puelma Touzel, Maximilian Cisek, Paul Lajoie, Guillaume Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost
title	Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost
title_full	Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost
title_fullStr	Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost
title_full_unstemmed	Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost
title_short	Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost
title_sort	performance-gated deliberation: a context-adapted strategy in which urgency is opportunity cost
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176815/ https://www.ncbi.nlm.nih.gov/pubmed/35617370 http://dx.doi.org/10.1371/journal.pcbi.1010080
work_keys_str_mv	AT puelmatouzelmaximilian performancegateddeliberationacontextadaptedstrategyinwhichurgencyisopportunitycost AT cisekpaul performancegateddeliberationacontextadaptedstrategyinwhichurgencyisopportunitycost AT lajoieguillaume performancegateddeliberationacontextadaptedstrategyinwhichurgencyisopportunitycost

Performance-gated deliberation: A context-adapted strategy in which urgency is opportunity cost

Ejemplares similares