Cargando…

Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing

When a satellite performs complex tasks such as discarding a payload or capturing a non-cooperative target, it will encounter sudden changes in the attitude and mass parameters, causing unstable flying and rolling of the satellite. In such circumstances, the change of the movement and mass character...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Zhong, Wang, Yuejiao, Yang, Yidai, Wang, Zhuping, Tang, Lei, Ackland, Stephen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6308631/
https://www.ncbi.nlm.nih.gov/pubmed/30544602
http://dx.doi.org/10.3390/s18124331
_version_ 1783383234547023872
author Ma, Zhong
Wang, Yuejiao
Yang, Yidai
Wang, Zhuping
Tang, Lei
Ackland, Stephen
author_facet Ma, Zhong
Wang, Yuejiao
Yang, Yidai
Wang, Zhuping
Tang, Lei
Ackland, Stephen
author_sort Ma, Zhong
collection PubMed
description When a satellite performs complex tasks such as discarding a payload or capturing a non-cooperative target, it will encounter sudden changes in the attitude and mass parameters, causing unstable flying and rolling of the satellite. In such circumstances, the change of the movement and mass characteristics are unpredictable. Thus, the traditional attitude control methods are unable to stabilize the satellite since they are dependent on the mass parameters of the controlled object. In this paper, we proposed a reinforcement learning method to re-stabilize the attitude of a satellite under such circumstances. Specifically, we discretize the continuous control torque, and build a neural network model that can output the discretized control torque to control the satellite. A dynamics simulation environment of the satellite is built, and the deep Q Network algorithm is then performed to train the neural network in this simulation environment. The reward of the training is the stabilization of the satellite. Simulation experiments illustrate that, with the iteration of training progresses, the neural network model gradually learned to re-stabilize the attitude of a satellite after unknown disturbance. As a contrast, the traditional PD (Proportion Differential) controller was unable to re-stabilize the satellite due to its dependence on the mass parameters. The proposed method adopts self-learning to control satellite attitudes, shows considerable intelligence and certain universality, and has a strong application potential for future intelligent control of satellites performing complex space tasks.
format Online
Article
Text
id pubmed-6308631
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-63086312019-01-04 Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing Ma, Zhong Wang, Yuejiao Yang, Yidai Wang, Zhuping Tang, Lei Ackland, Stephen Sensors (Basel) Article When a satellite performs complex tasks such as discarding a payload or capturing a non-cooperative target, it will encounter sudden changes in the attitude and mass parameters, causing unstable flying and rolling of the satellite. In such circumstances, the change of the movement and mass characteristics are unpredictable. Thus, the traditional attitude control methods are unable to stabilize the satellite since they are dependent on the mass parameters of the controlled object. In this paper, we proposed a reinforcement learning method to re-stabilize the attitude of a satellite under such circumstances. Specifically, we discretize the continuous control torque, and build a neural network model that can output the discretized control torque to control the satellite. A dynamics simulation environment of the satellite is built, and the deep Q Network algorithm is then performed to train the neural network in this simulation environment. The reward of the training is the stabilization of the satellite. Simulation experiments illustrate that, with the iteration of training progresses, the neural network model gradually learned to re-stabilize the attitude of a satellite after unknown disturbance. As a contrast, the traditional PD (Proportion Differential) controller was unable to re-stabilize the satellite due to its dependence on the mass parameters. The proposed method adopts self-learning to control satellite attitudes, shows considerable intelligence and certain universality, and has a strong application potential for future intelligent control of satellites performing complex space tasks. MDPI 2018-12-07 /pmc/articles/PMC6308631/ /pubmed/30544602 http://dx.doi.org/10.3390/s18124331 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ma, Zhong
Wang, Yuejiao
Yang, Yidai
Wang, Zhuping
Tang, Lei
Ackland, Stephen
Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing
title Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing
title_full Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing
title_fullStr Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing
title_full_unstemmed Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing
title_short Reinforcement Learning-Based Satellite Attitude Stabilization Method for Non-Cooperative Target Capturing
title_sort reinforcement learning-based satellite attitude stabilization method for non-cooperative target capturing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6308631/
https://www.ncbi.nlm.nih.gov/pubmed/30544602
http://dx.doi.org/10.3390/s18124331
work_keys_str_mv AT mazhong reinforcementlearningbasedsatelliteattitudestabilizationmethodfornoncooperativetargetcapturing
AT wangyuejiao reinforcementlearningbasedsatelliteattitudestabilizationmethodfornoncooperativetargetcapturing
AT yangyidai reinforcementlearningbasedsatelliteattitudestabilizationmethodfornoncooperativetargetcapturing
AT wangzhuping reinforcementlearningbasedsatelliteattitudestabilizationmethodfornoncooperativetargetcapturing
AT tanglei reinforcementlearningbasedsatelliteattitudestabilizationmethodfornoncooperativetargetcapturing
AT acklandstephen reinforcementlearningbasedsatelliteattitudestabilizationmethodfornoncooperativetargetcapturing