Cargando…

Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics

This work addresses the problem of reference tracking in autonomously learning robots with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that lear...

Descripción completa

Detalles Bibliográficos
Autores principales: Meindl, Michael, Lehmann, Dustin, Seel, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9315427/
https://www.ncbi.nlm.nih.gov/pubmed/35903721
http://dx.doi.org/10.3389/frobt.2022.793512
_version_ 1784754559343656960
author Meindl, Michael
Lehmann, Dustin
Seel, Thomas
author_facet Meindl, Michael
Lehmann, Dustin
Seel, Thomas
author_sort Meindl, Michael
collection PubMed
description This work addresses the problem of reference tracking in autonomously learning robots with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method’s plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics.
format Online
Article
Text
id pubmed-9315427
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93154272022-07-27 Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics Meindl, Michael Lehmann, Dustin Seel, Thomas Front Robot AI Robotics and AI This work addresses the problem of reference tracking in autonomously learning robots with unknown, nonlinear dynamics. Existing solutions require model information or extensive parameter tuning, and have rarely been validated in real-world experiments. We propose a learning control scheme that learns to approximate the unknown dynamics by a Gaussian Process (GP), which is used to optimize and apply a feedforward control input on each trial. Unlike existing approaches, the proposed method neither requires knowledge of the system states and their dynamics nor knowledge of an effective feedback control structure. All algorithm parameters are chosen automatically, i.e. the learning method works plug and play. The proposed method is validated in extensive simulations and real-world experiments. In contrast to most existing work, we study learning dynamics for more than one motion task as well as the robustness of performance across a large range of learning parameters. The method’s plug and play applicability is demonstrated by experiments with a balancing robot, in which the proposed method rapidly learns to track the desired output. Due to its model-agnostic and plug and play properties, the proposed method is expected to have high potential for application to a large class of reference tracking problems in systems with unknown, nonlinear dynamics. Frontiers Media S.A. 2022-07-12 /pmc/articles/PMC9315427/ /pubmed/35903721 http://dx.doi.org/10.3389/frobt.2022.793512 Text en Copyright © 2022 Meindl, Lehmann and Seel. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Meindl, Michael
Lehmann, Dustin
Seel, Thomas
Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics
title Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics
title_full Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics
title_fullStr Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics
title_full_unstemmed Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics
title_short Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics
title_sort bridging reinforcement learning and iterative learning control: autonomous motion learning for unknown, nonlinear dynamics
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9315427/
https://www.ncbi.nlm.nih.gov/pubmed/35903721
http://dx.doi.org/10.3389/frobt.2022.793512
work_keys_str_mv AT meindlmichael bridgingreinforcementlearninganditerativelearningcontrolautonomousmotionlearningforunknownnonlineardynamics
AT lehmanndustin bridgingreinforcementlearninganditerativelearningcontrolautonomousmotionlearningforunknownnonlineardynamics
AT seelthomas bridgingreinforcementlearninganditerativelearningcontrolautonomousmotionlearningforunknownnonlineardynamics