Cargando…

General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †

This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-spe...

Descripción completa

Detalles Bibliográficos
Autores principales: Pi, Chen-Huan, Dai, Yi-Wei, Hu, Kai-Chun, Cheng, Stone
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8271845/
https://www.ncbi.nlm.nih.gov/pubmed/34283119
http://dx.doi.org/10.3390/s21134560
_version_ 1783721086338203648
author Pi, Chen-Huan
Dai, Yi-Wei
Hu, Kai-Chun
Cheng, Stone
author_facet Pi, Chen-Huan
Dai, Yi-Wei
Hu, Kai-Chun
Cheng, Stone
author_sort Pi, Chen-Huan
collection PubMed
description This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm’s ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states.
format Online
Article
Text
id pubmed-8271845
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82718452021-07-11 General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone Sensors (Basel) Article This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm’s ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states. MDPI 2021-07-02 /pmc/articles/PMC8271845/ /pubmed/34283119 http://dx.doi.org/10.3390/s21134560 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pi, Chen-Huan
Dai, Yi-Wei
Hu, Kai-Chun
Cheng, Stone
General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_full General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_fullStr General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_full_unstemmed General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_short General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_sort general purpose low-level reinforcement learning control for multi-axis rotor aerial vehicles †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8271845/
https://www.ncbi.nlm.nih.gov/pubmed/34283119
http://dx.doi.org/10.3390/s21134560
work_keys_str_mv AT pichenhuan generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles
AT daiyiwei generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles
AT hukaichun generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles
AT chengstone generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles