Cargando…
Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9925229/ https://www.ncbi.nlm.nih.gov/pubmed/36780874 http://dx.doi.org/10.1371/journal.pone.0280071 |
_version_ | 1784888019562528768 |
---|---|
author | Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric |
author_facet | Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric |
author_sort | Israilov, Sardor |
collection | PubMed |
description | Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach. |
format | Online Article Text |
id | pubmed-9925229 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-99252292023-02-14 Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric PLoS One Research Article Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach. Public Library of Science 2023-02-13 /pmc/articles/PMC9925229/ /pubmed/36780874 http://dx.doi.org/10.1371/journal.pone.0280071 Text en © 2023 Israilov et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes |
title | Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes |
title_full | Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes |
title_fullStr | Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes |
title_full_unstemmed | Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes |
title_short | Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes |
title_sort | reinforcement learning approach to control an inverted pendulum: a general framework for educational purposes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9925229/ https://www.ncbi.nlm.nih.gov/pubmed/36780874 http://dx.doi.org/10.1371/journal.pone.0280071 |
work_keys_str_mv | AT israilovsardor reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT fuli reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT sanchezrodriguezjesus reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT fuscofranco reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT allibertguillaume reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT raufastechristophe reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT argentinamederic reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes |