Cargando…

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a...

Descripción completa

Detalles Bibliográficos
Autores principales: Israilov, Sardor, Fu, Li, Sánchez-Rodríguez, Jesús, Fusco, Franco, Allibert, Guillaume, Raufaste, Christophe, Argentina, Médéric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9925229/
https://www.ncbi.nlm.nih.gov/pubmed/36780874
http://dx.doi.org/10.1371/journal.pone.0280071
_version_ 1784888019562528768
author Israilov, Sardor
Fu, Li
Sánchez-Rodríguez, Jesús
Fusco, Franco
Allibert, Guillaume
Raufaste, Christophe
Argentina, Médéric
author_facet Israilov, Sardor
Fu, Li
Sánchez-Rodríguez, Jesús
Fusco, Franco
Allibert, Guillaume
Raufaste, Christophe
Argentina, Médéric
author_sort Israilov, Sardor
collection PubMed
description Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach.
format Online
Article
Text
id pubmed-9925229
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99252292023-02-14 Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric PLoS One Research Article Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach. Public Library of Science 2023-02-13 /pmc/articles/PMC9925229/ /pubmed/36780874 http://dx.doi.org/10.1371/journal.pone.0280071 Text en © 2023 Israilov et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Israilov, Sardor
Fu, Li
Sánchez-Rodríguez, Jesús
Fusco, Franco
Allibert, Guillaume
Raufaste, Christophe
Argentina, Médéric
Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_full Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_fullStr Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_full_unstemmed Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_short Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_sort reinforcement learning approach to control an inverted pendulum: a general framework for educational purposes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9925229/
https://www.ncbi.nlm.nih.gov/pubmed/36780874
http://dx.doi.org/10.1371/journal.pone.0280071
work_keys_str_mv AT israilovsardor reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes
AT fuli reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes
AT sanchezrodriguezjesus reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes
AT fuscofranco reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes
AT allibertguillaume reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes
AT raufastechristophe reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes
AT argentinamederic reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes