Cargando…
Credit Assignment during Movement Reinforcement Learning
We often need to learn how to move based on a single performance measure that reflects the overall success of our movements. However, movements have many properties, such as their trajectories, speeds and timing of end-points, thus the brain needs to decide which properties of movements should be im...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3568147/ https://www.ncbi.nlm.nih.gov/pubmed/23408972 http://dx.doi.org/10.1371/journal.pone.0055352 |
_version_ | 1782258778656210944 |
---|---|
author | Dam, Gregory Kording, Konrad Wei, Kunlin |
author_facet | Dam, Gregory Kording, Konrad Wei, Kunlin |
author_sort | Dam, Gregory |
collection | PubMed |
description | We often need to learn how to move based on a single performance measure that reflects the overall success of our movements. However, movements have many properties, such as their trajectories, speeds and timing of end-points, thus the brain needs to decide which properties of movements should be improved; it needs to solve the credit assignment problem. Currently, little is known about how humans solve credit assignment problems in the context of reinforcement learning. Here we tested how human participants solve such problems during a trajectory-learning task. Without an explicitly-defined target movement, participants made hand reaches and received monetary rewards as feedback on a trial-by-trial basis. The curvature and direction of the attempted reach trajectories determined the monetary rewards received in a manner that can be manipulated experimentally. Based on the history of action-reward pairs, participants quickly solved the credit assignment problem and learned the implicit payoff function. A Bayesian credit-assignment model with built-in forgetting accurately predicts their trial-by-trial learning. |
format | Online Article Text |
id | pubmed-3568147 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-35681472013-02-13 Credit Assignment during Movement Reinforcement Learning Dam, Gregory Kording, Konrad Wei, Kunlin PLoS One Research Article We often need to learn how to move based on a single performance measure that reflects the overall success of our movements. However, movements have many properties, such as their trajectories, speeds and timing of end-points, thus the brain needs to decide which properties of movements should be improved; it needs to solve the credit assignment problem. Currently, little is known about how humans solve credit assignment problems in the context of reinforcement learning. Here we tested how human participants solve such problems during a trajectory-learning task. Without an explicitly-defined target movement, participants made hand reaches and received monetary rewards as feedback on a trial-by-trial basis. The curvature and direction of the attempted reach trajectories determined the monetary rewards received in a manner that can be manipulated experimentally. Based on the history of action-reward pairs, participants quickly solved the credit assignment problem and learned the implicit payoff function. A Bayesian credit-assignment model with built-in forgetting accurately predicts their trial-by-trial learning. Public Library of Science 2013-02-08 /pmc/articles/PMC3568147/ /pubmed/23408972 http://dx.doi.org/10.1371/journal.pone.0055352 Text en © 2013 Dam et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Dam, Gregory Kording, Konrad Wei, Kunlin Credit Assignment during Movement Reinforcement Learning |
title | Credit Assignment during Movement Reinforcement Learning |
title_full | Credit Assignment during Movement Reinforcement Learning |
title_fullStr | Credit Assignment during Movement Reinforcement Learning |
title_full_unstemmed | Credit Assignment during Movement Reinforcement Learning |
title_short | Credit Assignment during Movement Reinforcement Learning |
title_sort | credit assignment during movement reinforcement learning |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3568147/ https://www.ncbi.nlm.nih.gov/pubmed/23408972 http://dx.doi.org/10.1371/journal.pone.0055352 |
work_keys_str_mv | AT damgregory creditassignmentduringmovementreinforcementlearning AT kordingkonrad creditassignmentduringmovementreinforcementlearning AT weikunlin creditassignmentduringmovementreinforcementlearning |