Cargando…

Credit Assignment during Movement Reinforcement Learning

We often need to learn how to move based on a single performance measure that reflects the overall success of our movements. However, movements have many properties, such as their trajectories, speeds and timing of end-points, thus the brain needs to decide which properties of movements should be im...

Descripción completa

Detalles Bibliográficos
Autores principales: Dam, Gregory, Kording, Konrad, Wei, Kunlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3568147/
https://www.ncbi.nlm.nih.gov/pubmed/23408972
http://dx.doi.org/10.1371/journal.pone.0055352
_version_ 1782258778656210944
author Dam, Gregory
Kording, Konrad
Wei, Kunlin
author_facet Dam, Gregory
Kording, Konrad
Wei, Kunlin
author_sort Dam, Gregory
collection PubMed
description We often need to learn how to move based on a single performance measure that reflects the overall success of our movements. However, movements have many properties, such as their trajectories, speeds and timing of end-points, thus the brain needs to decide which properties of movements should be improved; it needs to solve the credit assignment problem. Currently, little is known about how humans solve credit assignment problems in the context of reinforcement learning. Here we tested how human participants solve such problems during a trajectory-learning task. Without an explicitly-defined target movement, participants made hand reaches and received monetary rewards as feedback on a trial-by-trial basis. The curvature and direction of the attempted reach trajectories determined the monetary rewards received in a manner that can be manipulated experimentally. Based on the history of action-reward pairs, participants quickly solved the credit assignment problem and learned the implicit payoff function. A Bayesian credit-assignment model with built-in forgetting accurately predicts their trial-by-trial learning.
format Online
Article
Text
id pubmed-3568147
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35681472013-02-13 Credit Assignment during Movement Reinforcement Learning Dam, Gregory Kording, Konrad Wei, Kunlin PLoS One Research Article We often need to learn how to move based on a single performance measure that reflects the overall success of our movements. However, movements have many properties, such as their trajectories, speeds and timing of end-points, thus the brain needs to decide which properties of movements should be improved; it needs to solve the credit assignment problem. Currently, little is known about how humans solve credit assignment problems in the context of reinforcement learning. Here we tested how human participants solve such problems during a trajectory-learning task. Without an explicitly-defined target movement, participants made hand reaches and received monetary rewards as feedback on a trial-by-trial basis. The curvature and direction of the attempted reach trajectories determined the monetary rewards received in a manner that can be manipulated experimentally. Based on the history of action-reward pairs, participants quickly solved the credit assignment problem and learned the implicit payoff function. A Bayesian credit-assignment model with built-in forgetting accurately predicts their trial-by-trial learning. Public Library of Science 2013-02-08 /pmc/articles/PMC3568147/ /pubmed/23408972 http://dx.doi.org/10.1371/journal.pone.0055352 Text en © 2013 Dam et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dam, Gregory
Kording, Konrad
Wei, Kunlin
Credit Assignment during Movement Reinforcement Learning
title Credit Assignment during Movement Reinforcement Learning
title_full Credit Assignment during Movement Reinforcement Learning
title_fullStr Credit Assignment during Movement Reinforcement Learning
title_full_unstemmed Credit Assignment during Movement Reinforcement Learning
title_short Credit Assignment during Movement Reinforcement Learning
title_sort credit assignment during movement reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3568147/
https://www.ncbi.nlm.nih.gov/pubmed/23408972
http://dx.doi.org/10.1371/journal.pone.0055352
work_keys_str_mv AT damgregory creditassignmentduringmovementreinforcementlearning
AT kordingkonrad creditassignmentduringmovementreinforcementlearning
AT weikunlin creditassignmentduringmovementreinforcementlearning