Cargando…

The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons

In evaluating our choices, we often suffer from two tragic relativities. First, when our lives change for the better, we rapidly habituate to the higher standard of living. Second, we cannot escape comparing ourselves to various relative standards. Habituation and comparisons can be very disruptive...

Descripción completa

Detalles Bibliográficos
Autores principales: Dubey, Rachit, Griffiths, Thomas L., Dayan, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9352009/
https://www.ncbi.nlm.nih.gov/pubmed/35925875
http://dx.doi.org/10.1371/journal.pcbi.1010316
_version_ 1784762557113827328
author Dubey, Rachit
Griffiths, Thomas L.
Dayan, Peter
author_facet Dubey, Rachit
Griffiths, Thomas L.
Dayan, Peter
author_sort Dubey, Rachit
collection PubMed
description In evaluating our choices, we often suffer from two tragic relativities. First, when our lives change for the better, we rapidly habituate to the higher standard of living. Second, we cannot escape comparing ourselves to various relative standards. Habituation and comparisons can be very disruptive to decision-making and happiness, and till date, it remains a puzzle why they have come to be a part of cognition in the first place. Here, we present computational evidence that suggests that these features might play an important role in promoting adaptive behavior. Using the framework of reinforcement learning, we explore the benefit of employing a reward function that, in addition to the reward provided by the underlying task, also depends on prior expectations and relative comparisons. We find that while agents equipped with this reward function are less happy, they learn faster and significantly outperform standard reward-based agents in a wide range of environments. Specifically, we find that relative comparisons speed up learning by providing an exploration incentive to the agents, and prior expectations serve as a useful aid to comparisons, especially in sparsely-rewarded and non-stationary environments. Our simulations also reveal potential drawbacks of this reward function and show that agents perform sub-optimally when comparisons are left unchecked and when there are too many similar options. Together, our results help explain why we are prone to becoming trapped in a cycle of never-ending wants and desires, and may shed light on psychopathologies such as depression, materialism, and overconsumption.
format Online
Article
Text
id pubmed-9352009
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-93520092022-08-05 The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons Dubey, Rachit Griffiths, Thomas L. Dayan, Peter PLoS Comput Biol Research Article In evaluating our choices, we often suffer from two tragic relativities. First, when our lives change for the better, we rapidly habituate to the higher standard of living. Second, we cannot escape comparing ourselves to various relative standards. Habituation and comparisons can be very disruptive to decision-making and happiness, and till date, it remains a puzzle why they have come to be a part of cognition in the first place. Here, we present computational evidence that suggests that these features might play an important role in promoting adaptive behavior. Using the framework of reinforcement learning, we explore the benefit of employing a reward function that, in addition to the reward provided by the underlying task, also depends on prior expectations and relative comparisons. We find that while agents equipped with this reward function are less happy, they learn faster and significantly outperform standard reward-based agents in a wide range of environments. Specifically, we find that relative comparisons speed up learning by providing an exploration incentive to the agents, and prior expectations serve as a useful aid to comparisons, especially in sparsely-rewarded and non-stationary environments. Our simulations also reveal potential drawbacks of this reward function and show that agents perform sub-optimally when comparisons are left unchecked and when there are too many similar options. Together, our results help explain why we are prone to becoming trapped in a cycle of never-ending wants and desires, and may shed light on psychopathologies such as depression, materialism, and overconsumption. Public Library of Science 2022-08-04 /pmc/articles/PMC9352009/ /pubmed/35925875 http://dx.doi.org/10.1371/journal.pcbi.1010316 Text en © 2022 Dubey et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Dubey, Rachit
Griffiths, Thomas L.
Dayan, Peter
The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons
title The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons
title_full The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons
title_fullStr The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons
title_full_unstemmed The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons
title_short The pursuit of happiness: A reinforcement learning perspective on habituation and comparisons
title_sort pursuit of happiness: a reinforcement learning perspective on habituation and comparisons
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9352009/
https://www.ncbi.nlm.nih.gov/pubmed/35925875
http://dx.doi.org/10.1371/journal.pcbi.1010316
work_keys_str_mv AT dubeyrachit thepursuitofhappinessareinforcementlearningperspectiveonhabituationandcomparisons
AT griffithsthomasl thepursuitofhappinessareinforcementlearningperspectiveonhabituationandcomparisons
AT dayanpeter thepursuitofhappinessareinforcementlearningperspectiveonhabituationandcomparisons
AT dubeyrachit pursuitofhappinessareinforcementlearningperspectiveonhabituationandcomparisons
AT griffithsthomasl pursuitofhappinessareinforcementlearningperspectiveonhabituationandcomparisons
AT dayanpeter pursuitofhappinessareinforcementlearningperspectiveonhabituationandcomparisons