Cargando…

A Reinforcement Learning Model of Precommitment in Decision Making

Addiction and many other disorders are linked to impulsivity, where a suboptimal choice is preferred when it is immediately available. One solution to impulsivity is precommitment: constraining one's future to avoid being offered a suboptimal choice. A form of impulsivity can be measured experi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kurth-Nelson, Zeb, Redish, A. David
Formato: Texto
Lenguaje:English
Publicado: Frontiers Research Foundation 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004435/
https://www.ncbi.nlm.nih.gov/pubmed/21179584
http://dx.doi.org/10.3389/fnbeh.2010.00184
_version_ 1782193980475179008
author Kurth-Nelson, Zeb
Redish, A. David
author_facet Kurth-Nelson, Zeb
Redish, A. David
author_sort Kurth-Nelson, Zeb
collection PubMed
description Addiction and many other disorders are linked to impulsivity, where a suboptimal choice is preferred when it is immediately available. One solution to impulsivity is precommitment: constraining one's future to avoid being offered a suboptimal choice. A form of impulsivity can be measured experimentally by offering a choice between a smaller reward delivered sooner and a larger reward delivered later. Impulsive subjects are more likely to select the smaller-sooner choice; however, when offered an option to precommit, even impulsive subjects can precommit to the larger-later choice. To precommit or not is a decision between two conditions: (A) the original choice (smaller-sooner vs. larger-later), and (B) a new condition with only larger-later available. It has been observed that precommitment appears as a consequence of the preference reversal inherent in non-exponential delay-discounting. Here we show that most models of hyperbolic discounting cannot precommit, but a distributed model of hyperbolic discounting does precommit. Using this model, we find (1) faster discounters may be more or less likely than slow discounters to precommit, depending on the precommitment delay, (2) for a constant smaller-sooner vs. larger-later preference, a higher ratio of larger reward to smaller reward increases the probability of precommitment, and (3) precommitment is highly sensitive to the shape of the discount curve. These predictions imply that manipulations that alter the discount curve, such as diet or context, may qualitatively affect precommitment.
format Text
id pubmed-3004435
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Frontiers Research Foundation
record_format MEDLINE/PubMed
spelling pubmed-30044352010-12-21 A Reinforcement Learning Model of Precommitment in Decision Making Kurth-Nelson, Zeb Redish, A. David Front Behav Neurosci Neuroscience Addiction and many other disorders are linked to impulsivity, where a suboptimal choice is preferred when it is immediately available. One solution to impulsivity is precommitment: constraining one's future to avoid being offered a suboptimal choice. A form of impulsivity can be measured experimentally by offering a choice between a smaller reward delivered sooner and a larger reward delivered later. Impulsive subjects are more likely to select the smaller-sooner choice; however, when offered an option to precommit, even impulsive subjects can precommit to the larger-later choice. To precommit or not is a decision between two conditions: (A) the original choice (smaller-sooner vs. larger-later), and (B) a new condition with only larger-later available. It has been observed that precommitment appears as a consequence of the preference reversal inherent in non-exponential delay-discounting. Here we show that most models of hyperbolic discounting cannot precommit, but a distributed model of hyperbolic discounting does precommit. Using this model, we find (1) faster discounters may be more or less likely than slow discounters to precommit, depending on the precommitment delay, (2) for a constant smaller-sooner vs. larger-later preference, a higher ratio of larger reward to smaller reward increases the probability of precommitment, and (3) precommitment is highly sensitive to the shape of the discount curve. These predictions imply that manipulations that alter the discount curve, such as diet or context, may qualitatively affect precommitment. Frontiers Research Foundation 2010-12-14 /pmc/articles/PMC3004435/ /pubmed/21179584 http://dx.doi.org/10.3389/fnbeh.2010.00184 Text en Copyright © 2010 Kurth-Nelson and Redish. http://www.frontiersin.org/licenseagreement This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited.
spellingShingle Neuroscience
Kurth-Nelson, Zeb
Redish, A. David
A Reinforcement Learning Model of Precommitment in Decision Making
title A Reinforcement Learning Model of Precommitment in Decision Making
title_full A Reinforcement Learning Model of Precommitment in Decision Making
title_fullStr A Reinforcement Learning Model of Precommitment in Decision Making
title_full_unstemmed A Reinforcement Learning Model of Precommitment in Decision Making
title_short A Reinforcement Learning Model of Precommitment in Decision Making
title_sort reinforcement learning model of precommitment in decision making
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3004435/
https://www.ncbi.nlm.nih.gov/pubmed/21179584
http://dx.doi.org/10.3389/fnbeh.2010.00184
work_keys_str_mv AT kurthnelsonzeb areinforcementlearningmodelofprecommitmentindecisionmaking
AT redishadavid areinforcementlearningmodelofprecommitmentindecisionmaking
AT kurthnelsonzeb reinforcementlearningmodelofprecommitmentindecisionmaking
AT redishadavid reinforcementlearningmodelofprecommitmentindecisionmaking