Cargando…

A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?

Procrastination is the voluntary but irrational postponing of a task despite being aware that the delay can lead to worse consequences. It has been extensively studied in psychological field, from contributing factors, to theoretical models. From value-based decision making and reinforcement learnin...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Zheyu, Nagase, Asako Mitsuto, Morita, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8481628/
https://www.ncbi.nlm.nih.gov/pubmed/34602962
http://dx.doi.org/10.3389/fnins.2021.660595
_version_ 1784576717426262016
author Feng, Zheyu
Nagase, Asako Mitsuto
Morita, Kenji
author_facet Feng, Zheyu
Nagase, Asako Mitsuto
Morita, Kenji
author_sort Feng, Zheyu
collection PubMed
description Procrastination is the voluntary but irrational postponing of a task despite being aware that the delay can lead to worse consequences. It has been extensively studied in psychological field, from contributing factors, to theoretical models. From value-based decision making and reinforcement learning (RL) perspective, procrastination has been suggested to be caused by non-optimal choice resulting from cognitive limitations. Exactly what sort of cognitive limitations are involved, however, remains elusive. In the current study, we examined if a particular type of cognitive limitation, namely, inaccurate valuation resulting from inadequate state representation, would cause procrastination. Recent work has suggested that humans may adopt a particular type of state representation called the successor representation (SR) and that humans can learn to represent states by relatively low-dimensional features. Combining these suggestions, we assumed a dimension-reduced version of SR. We modeled a series of behaviors of a “student” doing assignments during the school term, when putting off doing the assignments (i.e., procrastination) is not allowed, and during the vacation, when whether to procrastinate or not can be freely chosen. We assumed that the “student” had acquired a rigid reduced SR of each state, corresponding to each step in completing an assignment, under the policy without procrastination. The “student” learned the approximated value of each state which was computed as a linear function of features of the states in the rigid reduced SR, through temporal-difference (TD) learning. During the vacation, the “student” made decisions at each time-step whether to procrastinate based on these approximated values. Simulation results showed that the reduced SR-based RL model generated procrastination behavior, which worsened across episodes. According to the values approximated by the “student,” to procrastinate was the better choice, whereas not to procrastinate was mostly better according to the true values. Thus, the current model generated procrastination behavior caused by inaccurate value approximation, which resulted from the adoption of the reduced SR as state representation. These findings indicate that the reduced SR, or more generally, the dimension reduction in state representation, can be a potential form of cognitive limitation that leads to procrastination.
format Online
Article
Text
id pubmed-8481628
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84816282021-10-01 A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task? Feng, Zheyu Nagase, Asako Mitsuto Morita, Kenji Front Neurosci Neuroscience Procrastination is the voluntary but irrational postponing of a task despite being aware that the delay can lead to worse consequences. It has been extensively studied in psychological field, from contributing factors, to theoretical models. From value-based decision making and reinforcement learning (RL) perspective, procrastination has been suggested to be caused by non-optimal choice resulting from cognitive limitations. Exactly what sort of cognitive limitations are involved, however, remains elusive. In the current study, we examined if a particular type of cognitive limitation, namely, inaccurate valuation resulting from inadequate state representation, would cause procrastination. Recent work has suggested that humans may adopt a particular type of state representation called the successor representation (SR) and that humans can learn to represent states by relatively low-dimensional features. Combining these suggestions, we assumed a dimension-reduced version of SR. We modeled a series of behaviors of a “student” doing assignments during the school term, when putting off doing the assignments (i.e., procrastination) is not allowed, and during the vacation, when whether to procrastinate or not can be freely chosen. We assumed that the “student” had acquired a rigid reduced SR of each state, corresponding to each step in completing an assignment, under the policy without procrastination. The “student” learned the approximated value of each state which was computed as a linear function of features of the states in the rigid reduced SR, through temporal-difference (TD) learning. During the vacation, the “student” made decisions at each time-step whether to procrastinate based on these approximated values. Simulation results showed that the reduced SR-based RL model generated procrastination behavior, which worsened across episodes. According to the values approximated by the “student,” to procrastinate was the better choice, whereas not to procrastinate was mostly better according to the true values. Thus, the current model generated procrastination behavior caused by inaccurate value approximation, which resulted from the adoption of the reduced SR as state representation. These findings indicate that the reduced SR, or more generally, the dimension reduction in state representation, can be a potential form of cognitive limitation that leads to procrastination. Frontiers Media S.A. 2021-09-16 /pmc/articles/PMC8481628/ /pubmed/34602962 http://dx.doi.org/10.3389/fnins.2021.660595 Text en Copyright © 2021 Feng, Nagase and Morita. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Feng, Zheyu
Nagase, Asako Mitsuto
Morita, Kenji
A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?
title A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?
title_full A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?
title_fullStr A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?
title_full_unstemmed A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?
title_short A Reinforcement Learning Approach to Understanding Procrastination: Does Inaccurate Value Approximation Cause Irrational Postponing of a Task?
title_sort reinforcement learning approach to understanding procrastination: does inaccurate value approximation cause irrational postponing of a task?
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8481628/
https://www.ncbi.nlm.nih.gov/pubmed/34602962
http://dx.doi.org/10.3389/fnins.2021.660595
work_keys_str_mv AT fengzheyu areinforcementlearningapproachtounderstandingprocrastinationdoesinaccuratevalueapproximationcauseirrationalpostponingofatask
AT nagaseasakomitsuto areinforcementlearningapproachtounderstandingprocrastinationdoesinaccuratevalueapproximationcauseirrationalpostponingofatask
AT moritakenji areinforcementlearningapproachtounderstandingprocrastinationdoesinaccuratevalueapproximationcauseirrationalpostponingofatask
AT fengzheyu reinforcementlearningapproachtounderstandingprocrastinationdoesinaccuratevalueapproximationcauseirrationalpostponingofatask
AT nagaseasakomitsuto reinforcementlearningapproachtounderstandingprocrastinationdoesinaccuratevalueapproximationcauseirrationalpostponingofatask
AT moritakenji reinforcementlearningapproachtounderstandingprocrastinationdoesinaccuratevalueapproximationcauseirrationalpostponingofatask