Cargando…

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Israilov, Sardor, Fu, Li, Sánchez-Rodríguez, Jesús, Fusco, Franco, Allibert, Guillaume, Raufaste, Christophe, Argentina, Médéric
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9925229/ https://www.ncbi.nlm.nih.gov/pubmed/36780874 http://dx.doi.org/10.1371/journal.pone.0280071

_version_	1784888019562528768
author	Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric
author_facet	Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric
author_sort	Israilov, Sardor
collection	PubMed
description	Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach.
format	Online Article Text
id	pubmed-9925229
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-99252292023-02-14 Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric PLoS One Research Article Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach. Public Library of Science 2023-02-13 /pmc/articles/PMC9925229/ /pubmed/36780874 http://dx.doi.org/10.1371/journal.pone.0280071 Text en © 2023 Israilov et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Israilov, Sardor Fu, Li Sánchez-Rodríguez, Jesús Fusco, Franco Allibert, Guillaume Raufaste, Christophe Argentina, Médéric Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title	Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_full	Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_fullStr	Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_full_unstemmed	Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_short	Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes
title_sort	reinforcement learning approach to control an inverted pendulum: a general framework for educational purposes
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9925229/ https://www.ncbi.nlm.nih.gov/pubmed/36780874 http://dx.doi.org/10.1371/journal.pone.0280071
work_keys_str_mv	AT israilovsardor reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT fuli reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT sanchezrodriguezjesus reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT fuscofranco reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT allibertguillaume reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT raufastechristophe reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes AT argentinamederic reinforcementlearningapproachtocontrolaninvertedpendulumageneralframeworkforeducationalpurposes

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Ejemplares similares