Cargando…

Clustering analysis of movement kinematics in reinforcement learning

Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors...

Descripción completa

Detalles Bibliográficos
Autores principales: Sidarta, Ananda, Komar, John, Ostry, David J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Physiological Society 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8816628/
https://www.ncbi.nlm.nih.gov/pubmed/34936514
http://dx.doi.org/10.1152/jn.00229.2021
_version_ 1784645477248008192
author Sidarta, Ananda
Komar, John
Ostry, David J.
author_facet Sidarta, Ananda
Komar, John
Ostry, David J.
author_sort Sidarta, Ananda
collection PubMed
description Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors such as task demands and reward availability. In this study, we applied a clustering algorithm to examine how a change in the accuracy requirements of a task affected the choice of exploration over exploitation. Participants made reaching movements to an unseen target using a planar robot arm and received reward after each successful movement. For one group of participants, the width of the hidden target decreased after every other training block. For a second group, it remained constant. The clustering algorithm was applied to the kinematic data to characterize motor learning on a trial-to-trial basis as a sequence of movements, each belonging to one of the identified clusters. By the end of learning, movement trajectories across all participants converged primarily to a single cluster with the greatest number of successful trials. Within this analysis framework, we defined exploration and exploitation as types of behavior in which two successive trajectories belong to different or similar clusters, respectively. The frequency of each mode of behavior was evaluated over the course of learning. It was found that by reducing the target width, participants used a greater variety of different clusters and displayed more exploration than exploitation. Excessive exploration relative to exploitation was found to be detrimental to subsequent motor learning. NEW & NOTEWORTHY The choice of exploration versus exploitation is a fundamental problem in learning new motor skills through reinforcement. In this study, we employed a data-driven approach to characterize movements on a trial-by-trial basis with an unsupervised clustering algorithm. Using this technique, we found that changes in task demands and, in particular, in the required accuracy of movements, influenced the ratio of exploration to exploitation. This analysis framework provides an attractive tool to investigate mechanisms of explorative and exploitative behavior while studying motor learning.
format Online
Article
Text
id pubmed-8816628
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Physiological Society
record_format MEDLINE/PubMed
spelling pubmed-88166282022-02-07 Clustering analysis of movement kinematics in reinforcement learning Sidarta, Ananda Komar, John Ostry, David J. J Neurophysiol Research Article Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors such as task demands and reward availability. In this study, we applied a clustering algorithm to examine how a change in the accuracy requirements of a task affected the choice of exploration over exploitation. Participants made reaching movements to an unseen target using a planar robot arm and received reward after each successful movement. For one group of participants, the width of the hidden target decreased after every other training block. For a second group, it remained constant. The clustering algorithm was applied to the kinematic data to characterize motor learning on a trial-to-trial basis as a sequence of movements, each belonging to one of the identified clusters. By the end of learning, movement trajectories across all participants converged primarily to a single cluster with the greatest number of successful trials. Within this analysis framework, we defined exploration and exploitation as types of behavior in which two successive trajectories belong to different or similar clusters, respectively. The frequency of each mode of behavior was evaluated over the course of learning. It was found that by reducing the target width, participants used a greater variety of different clusters and displayed more exploration than exploitation. Excessive exploration relative to exploitation was found to be detrimental to subsequent motor learning. NEW & NOTEWORTHY The choice of exploration versus exploitation is a fundamental problem in learning new motor skills through reinforcement. In this study, we employed a data-driven approach to characterize movements on a trial-by-trial basis with an unsupervised clustering algorithm. Using this technique, we found that changes in task demands and, in particular, in the required accuracy of movements, influenced the ratio of exploration to exploitation. This analysis framework provides an attractive tool to investigate mechanisms of explorative and exploitative behavior while studying motor learning. American Physiological Society 2022-02-01 2021-12-22 /pmc/articles/PMC8816628/ /pubmed/34936514 http://dx.doi.org/10.1152/jn.00229.2021 Text en Copyright © 2022 The Authors https://creativecommons.org/licenses/by/4.0/Licensed under Creative Commons Attribution CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/) . Published by the American Physiological Society.
spellingShingle Research Article
Sidarta, Ananda
Komar, John
Ostry, David J.
Clustering analysis of movement kinematics in reinforcement learning
title Clustering analysis of movement kinematics in reinforcement learning
title_full Clustering analysis of movement kinematics in reinforcement learning
title_fullStr Clustering analysis of movement kinematics in reinforcement learning
title_full_unstemmed Clustering analysis of movement kinematics in reinforcement learning
title_short Clustering analysis of movement kinematics in reinforcement learning
title_sort clustering analysis of movement kinematics in reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8816628/
https://www.ncbi.nlm.nih.gov/pubmed/34936514
http://dx.doi.org/10.1152/jn.00229.2021
work_keys_str_mv AT sidartaananda clusteringanalysisofmovementkinematicsinreinforcementlearning
AT komarjohn clusteringanalysisofmovementkinematicsinreinforcementlearning
AT ostrydavidj clusteringanalysisofmovementkinematicsinreinforcementlearning