Cargando…
General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-spe...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8271845/ https://www.ncbi.nlm.nih.gov/pubmed/34283119 http://dx.doi.org/10.3390/s21134560 |
_version_ | 1783721086338203648 |
---|---|
author | Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone |
author_facet | Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone |
author_sort | Pi, Chen-Huan |
collection | PubMed |
description | This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm’s ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states. |
format | Online Article Text |
id | pubmed-8271845 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-82718452021-07-11 General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone Sensors (Basel) Article This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm’s ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states. MDPI 2021-07-02 /pmc/articles/PMC8271845/ /pubmed/34283119 http://dx.doi.org/10.3390/s21134560 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † |
title | General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † |
title_full | General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † |
title_fullStr | General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † |
title_full_unstemmed | General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † |
title_short | General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † |
title_sort | general purpose low-level reinforcement learning control for multi-axis rotor aerial vehicles † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8271845/ https://www.ncbi.nlm.nih.gov/pubmed/34283119 http://dx.doi.org/10.3390/s21134560 |
work_keys_str_mv | AT pichenhuan generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles AT daiyiwei generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles AT hukaichun generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles AT chengstone generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles |