Cargando…

Clustering analysis of movement kinematics in reinforcement learning

Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sidarta, Ananda, Komar, John, Ostry, David J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Physiological Society 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8816628/ https://www.ncbi.nlm.nih.gov/pubmed/34936514 http://dx.doi.org/10.1152/jn.00229.2021

_version_	1784645477248008192
author	Sidarta, Ananda Komar, John Ostry, David J.
author_facet	Sidarta, Ananda Komar, John Ostry, David J.
author_sort	Sidarta, Ananda
collection	PubMed
description	Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors such as task demands and reward availability. In this study, we applied a clustering algorithm to examine how a change in the accuracy requirements of a task affected the choice of exploration over exploitation. Participants made reaching movements to an unseen target using a planar robot arm and received reward after each successful movement. For one group of participants, the width of the hidden target decreased after every other training block. For a second group, it remained constant. The clustering algorithm was applied to the kinematic data to characterize motor learning on a trial-to-trial basis as a sequence of movements, each belonging to one of the identified clusters. By the end of learning, movement trajectories across all participants converged primarily to a single cluster with the greatest number of successful trials. Within this analysis framework, we defined exploration and exploitation as types of behavior in which two successive trajectories belong to different or similar clusters, respectively. The frequency of each mode of behavior was evaluated over the course of learning. It was found that by reducing the target width, participants used a greater variety of different clusters and displayed more exploration than exploitation. Excessive exploration relative to exploitation was found to be detrimental to subsequent motor learning. NEW & NOTEWORTHY The choice of exploration versus exploitation is a fundamental problem in learning new motor skills through reinforcement. In this study, we employed a data-driven approach to characterize movements on a trial-by-trial basis with an unsupervised clustering algorithm. Using this technique, we found that changes in task demands and, in particular, in the required accuracy of movements, influenced the ratio of exploration to exploitation. This analysis framework provides an attractive tool to investigate mechanisms of explorative and exploitative behavior while studying motor learning.
format	Online Article Text
id	pubmed-8816628
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	American Physiological Society
record_format	MEDLINE/PubMed
spelling	pubmed-88166282022-02-07 Clustering analysis of movement kinematics in reinforcement learning Sidarta, Ananda Komar, John Ostry, David J. J Neurophysiol Research Article Reinforcement learning has been used as an experimental model of motor skill acquisition, where at times movements are successful and thus reinforced. One fundamental problem is to understand how humans select exploration over exploitation during learning. The decision could be influenced by factors such as task demands and reward availability. In this study, we applied a clustering algorithm to examine how a change in the accuracy requirements of a task affected the choice of exploration over exploitation. Participants made reaching movements to an unseen target using a planar robot arm and received reward after each successful movement. For one group of participants, the width of the hidden target decreased after every other training block. For a second group, it remained constant. The clustering algorithm was applied to the kinematic data to characterize motor learning on a trial-to-trial basis as a sequence of movements, each belonging to one of the identified clusters. By the end of learning, movement trajectories across all participants converged primarily to a single cluster with the greatest number of successful trials. Within this analysis framework, we defined exploration and exploitation as types of behavior in which two successive trajectories belong to different or similar clusters, respectively. The frequency of each mode of behavior was evaluated over the course of learning. It was found that by reducing the target width, participants used a greater variety of different clusters and displayed more exploration than exploitation. Excessive exploration relative to exploitation was found to be detrimental to subsequent motor learning. NEW & NOTEWORTHY The choice of exploration versus exploitation is a fundamental problem in learning new motor skills through reinforcement. In this study, we employed a data-driven approach to characterize movements on a trial-by-trial basis with an unsupervised clustering algorithm. Using this technique, we found that changes in task demands and, in particular, in the required accuracy of movements, influenced the ratio of exploration to exploitation. This analysis framework provides an attractive tool to investigate mechanisms of explorative and exploitative behavior while studying motor learning. American Physiological Society 2022-02-01 2021-12-22 /pmc/articles/PMC8816628/ /pubmed/34936514 http://dx.doi.org/10.1152/jn.00229.2021 Text en Copyright © 2022 The Authors https://creativecommons.org/licenses/by/4.0/Licensed under Creative Commons Attribution CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/) . Published by the American Physiological Society.
spellingShingle	Research Article Sidarta, Ananda Komar, John Ostry, David J. Clustering analysis of movement kinematics in reinforcement learning
title	Clustering analysis of movement kinematics in reinforcement learning
title_full	Clustering analysis of movement kinematics in reinforcement learning
title_fullStr	Clustering analysis of movement kinematics in reinforcement learning
title_full_unstemmed	Clustering analysis of movement kinematics in reinforcement learning
title_short	Clustering analysis of movement kinematics in reinforcement learning
title_sort	clustering analysis of movement kinematics in reinforcement learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8816628/ https://www.ncbi.nlm.nih.gov/pubmed/34936514 http://dx.doi.org/10.1152/jn.00229.2021
work_keys_str_mv	AT sidartaananda clusteringanalysisofmovementkinematicsinreinforcementlearning AT komarjohn clusteringanalysisofmovementkinematicsinreinforcementlearning AT ostrydavidj clusteringanalysisofmovementkinematicsinreinforcementlearning

Clustering analysis of movement kinematics in reinforcement learning

Ejemplares similares