Cargando…

General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †

This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-spe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pi, Chen-Huan, Dai, Yi-Wei, Hu, Kai-Chun, Cheng, Stone
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8271845/ https://www.ncbi.nlm.nih.gov/pubmed/34283119 http://dx.doi.org/10.3390/s21134560

_version_	1783721086338203648
author	Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone
author_facet	Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone
author_sort	Pi, Chen-Huan
collection	PubMed
description	This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm’s ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states.
format	Online Article Text
id	pubmed-8271845
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-82718452021-07-11 General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles † Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone Sensors (Basel) Article This paper proposes a multipurpose reinforcement learning based low-level multirotor unmanned aerial vehicles control structure constructed using neural networks with model-free training. Other low-level reinforcement learning controllers developed in studies have only been applicable to a model-specific and physical-parameter-specific multirotor, and time-consuming training is required when switching to a different vehicle. We use a 6-degree-of-freedom dynamic model combining acceleration-based control from the policy neural network to overcome these problems. The UAV automatically learns the maneuver by an end-to-end neural network from fusion states to acceleration command. The state estimation is performed using the data from on-board sensors and motion capture. The motion capture system provides spatial position information and a multisensory fusion framework fuses the measurement from the onboard inertia measurement units for compensating the time delay and low update frequency of the capture system. Without requiring expert demonstration, the trained control policy implemented using an improved algorithm can be applied to various multirotors with the output directly mapped to actuators. The algorithm’s ability to control multirotors in the hovering and the tracking task is evaluated. Through simulation and actual experiments, we demonstrate the flight control with a quadrotor and hexrotor by using the trained policy. With the same policy, we verify that we can stabilize the quadrotor and hexrotor in the air under random initial states. MDPI 2021-07-02 /pmc/articles/PMC8271845/ /pubmed/34283119 http://dx.doi.org/10.3390/s21134560 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Pi, Chen-Huan Dai, Yi-Wei Hu, Kai-Chun Cheng, Stone General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title	General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_full	General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_fullStr	General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_full_unstemmed	General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_short	General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †
title_sort	general purpose low-level reinforcement learning control for multi-axis rotor aerial vehicles †
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8271845/ https://www.ncbi.nlm.nih.gov/pubmed/34283119 http://dx.doi.org/10.3390/s21134560
work_keys_str_mv	AT pichenhuan generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles AT daiyiwei generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles AT hukaichun generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles AT chengstone generalpurposelowlevelreinforcementlearningcontrolformultiaxisrotoraerialvehicles

General Purpose Low-Level Reinforcement Learning Control for Multi-Axis Rotor Aerial Vehicles †

Ejemplares similares