Cargando…
Predictive Movements and Human Reinforcement Learning of Sequential Action
Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response....
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6001690/ https://www.ncbi.nlm.nih.gov/pubmed/29498434 http://dx.doi.org/10.1111/cogs.12599 |
_version_ | 1783332063535955968 |
---|---|
author | de Kleijn, Roy Kachergis, George Hommel, Bernhard |
author_facet | de Kleijn, Roy Kachergis, George Hommel, Bernhard |
author_sort | de Kleijn, Roy |
collection | PubMed |
description | Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse movement trajectory SRT task in which the cursor must be moved to a cued location. We replicated keypress SRT results, but also found that predictive movement—before the next cue appears—increased during the experiment. Moreover, trajectory analyses revealed that people developed a centering strategy under uncertainty. In a second experiment, we made prediction explicit, no longer cueing targets. Thus, participants had to explore the response alternatives and learn via reinforcement, receiving rewards and penalties for correct and incorrect actions, respectively. Participants were not told whether the sequence of stimuli was deterministic, nor if it would repeat, nor how long it was. Given the difficulty of the task, it is unsurprising that some learners performed poorly. However, many learners performed remarkably well, and some acquired the full 10‐item sequence within 10 repetitions. Comparing the high‐ and low‐performers’ detailed results in this reinforcement learning (RL) task with the first experiment's cued trajectory SRT task, we found similarities between the two tasks, suggesting that the effects in Experiment 1 are due to predictive, rather than reactive processes. Finally, we found that two standard model‐free reinforcement learning models fit the high‐performing participants, while the four low‐performing participants provide better fit with a simple negative recency bias model. |
format | Online Article Text |
id | pubmed-6001690 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-60016902018-06-21 Predictive Movements and Human Reinforcement Learning of Sequential Action de Kleijn, Roy Kachergis, George Hommel, Bernhard Cogn Sci Regular Articles Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse movement trajectory SRT task in which the cursor must be moved to a cued location. We replicated keypress SRT results, but also found that predictive movement—before the next cue appears—increased during the experiment. Moreover, trajectory analyses revealed that people developed a centering strategy under uncertainty. In a second experiment, we made prediction explicit, no longer cueing targets. Thus, participants had to explore the response alternatives and learn via reinforcement, receiving rewards and penalties for correct and incorrect actions, respectively. Participants were not told whether the sequence of stimuli was deterministic, nor if it would repeat, nor how long it was. Given the difficulty of the task, it is unsurprising that some learners performed poorly. However, many learners performed remarkably well, and some acquired the full 10‐item sequence within 10 repetitions. Comparing the high‐ and low‐performers’ detailed results in this reinforcement learning (RL) task with the first experiment's cued trajectory SRT task, we found similarities between the two tasks, suggesting that the effects in Experiment 1 are due to predictive, rather than reactive processes. Finally, we found that two standard model‐free reinforcement learning models fit the high‐performing participants, while the four low‐performing participants provide better fit with a simple negative recency bias model. John Wiley and Sons Inc. 2018-03-02 2018-06 /pmc/articles/PMC6001690/ /pubmed/29498434 http://dx.doi.org/10.1111/cogs.12599 Text en Copyright © 2018 The Authors. Cognitive Science ‐ A Multidisciplinary Journal published by Wiley Periodicals, Inc. on behalf of Cognitive Science Society. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made. |
spellingShingle | Regular Articles de Kleijn, Roy Kachergis, George Hommel, Bernhard Predictive Movements and Human Reinforcement Learning of Sequential Action |
title | Predictive Movements and Human Reinforcement Learning of Sequential Action |
title_full | Predictive Movements and Human Reinforcement Learning of Sequential Action |
title_fullStr | Predictive Movements and Human Reinforcement Learning of Sequential Action |
title_full_unstemmed | Predictive Movements and Human Reinforcement Learning of Sequential Action |
title_short | Predictive Movements and Human Reinforcement Learning of Sequential Action |
title_sort | predictive movements and human reinforcement learning of sequential action |
topic | Regular Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6001690/ https://www.ncbi.nlm.nih.gov/pubmed/29498434 http://dx.doi.org/10.1111/cogs.12599 |
work_keys_str_mv | AT dekleijnroy predictivemovementsandhumanreinforcementlearningofsequentialaction AT kachergisgeorge predictivemovementsandhumanreinforcementlearningofsequentialaction AT hommelbernhard predictivemovementsandhumanreinforcementlearningofsequentialaction |