Cargando…
Reinforcement learning control of a biomechanical model of the upper extremity
Among the infinite number of possible movements that can be produced, humans are commonly assumed to choose those that optimize criteria such as minimizing movement time, subject to certain movement constraints like signal-dependent and constant motor noise. While so far these assumptions have only...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8280157/ https://www.ncbi.nlm.nih.gov/pubmed/34262081 http://dx.doi.org/10.1038/s41598-021-93760-1 |
_version_ | 1783722593290813440 |
---|---|
author | Fischer, Florian Bachinski, Miroslav Klar, Markus Fleig, Arthur Müller, Jörg |
author_facet | Fischer, Florian Bachinski, Miroslav Klar, Markus Fleig, Arthur Müller, Jörg |
author_sort | Fischer, Florian |
collection | PubMed |
description | Among the infinite number of possible movements that can be produced, humans are commonly assumed to choose those that optimize criteria such as minimizing movement time, subject to certain movement constraints like signal-dependent and constant motor noise. While so far these assumptions have only been evaluated for simplified point-mass or planar models, we address the question of whether they can predict reaching movements in a full skeletal model of the human upper extremity. We learn a control policy using a motor babbling approach as implemented in reinforcement learning, using aimed movements of the tip of the right index finger towards randomly placed 3D targets of varying size. We use a state-of-the-art biomechanical model, which includes seven actuated degrees of freedom. To deal with the curse of dimensionality, we use a simplified second-order muscle model, acting at each degree of freedom instead of individual muscles. The results confirm that the assumptions of signal-dependent and constant motor noise, together with the objective of movement time minimization, are sufficient for a state-of-the-art skeletal model of the human upper extremity to reproduce complex phenomena of human movement, in particular Fitts’ Law and the [Formula: see text] Power Law. This result supports the notion that control of the complex human biomechanical system can plausibly be determined by a set of simple assumptions and can easily be learned. |
format | Online Article Text |
id | pubmed-8280157 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-82801572021-07-15 Reinforcement learning control of a biomechanical model of the upper extremity Fischer, Florian Bachinski, Miroslav Klar, Markus Fleig, Arthur Müller, Jörg Sci Rep Article Among the infinite number of possible movements that can be produced, humans are commonly assumed to choose those that optimize criteria such as minimizing movement time, subject to certain movement constraints like signal-dependent and constant motor noise. While so far these assumptions have only been evaluated for simplified point-mass or planar models, we address the question of whether they can predict reaching movements in a full skeletal model of the human upper extremity. We learn a control policy using a motor babbling approach as implemented in reinforcement learning, using aimed movements of the tip of the right index finger towards randomly placed 3D targets of varying size. We use a state-of-the-art biomechanical model, which includes seven actuated degrees of freedom. To deal with the curse of dimensionality, we use a simplified second-order muscle model, acting at each degree of freedom instead of individual muscles. The results confirm that the assumptions of signal-dependent and constant motor noise, together with the objective of movement time minimization, are sufficient for a state-of-the-art skeletal model of the human upper extremity to reproduce complex phenomena of human movement, in particular Fitts’ Law and the [Formula: see text] Power Law. This result supports the notion that control of the complex human biomechanical system can plausibly be determined by a set of simple assumptions and can easily be learned. Nature Publishing Group UK 2021-07-14 /pmc/articles/PMC8280157/ /pubmed/34262081 http://dx.doi.org/10.1038/s41598-021-93760-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Fischer, Florian Bachinski, Miroslav Klar, Markus Fleig, Arthur Müller, Jörg Reinforcement learning control of a biomechanical model of the upper extremity |
title | Reinforcement learning control of a biomechanical model of the upper extremity |
title_full | Reinforcement learning control of a biomechanical model of the upper extremity |
title_fullStr | Reinforcement learning control of a biomechanical model of the upper extremity |
title_full_unstemmed | Reinforcement learning control of a biomechanical model of the upper extremity |
title_short | Reinforcement learning control of a biomechanical model of the upper extremity |
title_sort | reinforcement learning control of a biomechanical model of the upper extremity |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8280157/ https://www.ncbi.nlm.nih.gov/pubmed/34262081 http://dx.doi.org/10.1038/s41598-021-93760-1 |
work_keys_str_mv | AT fischerflorian reinforcementlearningcontrolofabiomechanicalmodeloftheupperextremity AT bachinskimiroslav reinforcementlearningcontrolofabiomechanicalmodeloftheupperextremity AT klarmarkus reinforcementlearningcontrolofabiomechanicalmodeloftheupperextremity AT fleigarthur reinforcementlearningcontrolofabiomechanicalmodeloftheupperextremity AT mullerjorg reinforcementlearningcontrolofabiomechanicalmodeloftheupperextremity |