Cargando…

Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints

Path planning is an essential technology for lunar rover to achieve safe and efficient autonomous exploration mission, this paper proposes a learning-based end-to-end path planning algorithm for lunar rovers with safety constraints. Firstly, a training environment integrating real lunar surface terr...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yu, Xiaoqiang, Wang, Ping, Zhang, Zexu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7866010/ https://www.ncbi.nlm.nih.gov/pubmed/33504073 http://dx.doi.org/10.3390/s21030796

_version_	1783647981110558720
author	Yu, Xiaoqiang Wang, Ping Zhang, Zexu
author_facet	Yu, Xiaoqiang Wang, Ping Zhang, Zexu
author_sort	Yu, Xiaoqiang
collection	PubMed
description	Path planning is an essential technology for lunar rover to achieve safe and efficient autonomous exploration mission, this paper proposes a learning-based end-to-end path planning algorithm for lunar rovers with safety constraints. Firstly, a training environment integrating real lunar surface terrain data was built using the Gazebo simulation environment and a lunar rover simulator was created in it to simulate the real lunar surface environment and the lunar rover system. Then an end-to-end path planning algorithm based on deep reinforcement learning method is designed, including state space, action space, network structure, reward function considering slip behavior, and training method based on proximal policy optimization. In addition, to improve the generalization ability to different lunar surface topography and different scale environments, a variety of training scenarios were set up to train the network model using the idea of curriculum learning. The simulation results show that the proposed planning algorithm can successfully achieve the end-to-end path planning of the lunar rover, and the path generated by the proposed algorithm has a higher safety guarantee compared with the classical path planning algorithm.
format	Online Article Text
id	pubmed-7866010
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-78660102021-02-07 Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints Yu, Xiaoqiang Wang, Ping Zhang, Zexu Sensors (Basel) Article Path planning is an essential technology for lunar rover to achieve safe and efficient autonomous exploration mission, this paper proposes a learning-based end-to-end path planning algorithm for lunar rovers with safety constraints. Firstly, a training environment integrating real lunar surface terrain data was built using the Gazebo simulation environment and a lunar rover simulator was created in it to simulate the real lunar surface environment and the lunar rover system. Then an end-to-end path planning algorithm based on deep reinforcement learning method is designed, including state space, action space, network structure, reward function considering slip behavior, and training method based on proximal policy optimization. In addition, to improve the generalization ability to different lunar surface topography and different scale environments, a variety of training scenarios were set up to train the network model using the idea of curriculum learning. The simulation results show that the proposed planning algorithm can successfully achieve the end-to-end path planning of the lunar rover, and the path generated by the proposed algorithm has a higher safety guarantee compared with the classical path planning algorithm. MDPI 2021-01-25 /pmc/articles/PMC7866010/ /pubmed/33504073 http://dx.doi.org/10.3390/s21030796 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Yu, Xiaoqiang Wang, Ping Zhang, Zexu Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints
title	Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints
title_full	Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints
title_fullStr	Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints
title_full_unstemmed	Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints
title_short	Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints
title_sort	learning-based end-to-end path planning for lunar rovers with safety constraints
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7866010/ https://www.ncbi.nlm.nih.gov/pubmed/33504073 http://dx.doi.org/10.3390/s21030796
work_keys_str_mv	AT yuxiaoqiang learningbasedendtoendpathplanningforlunarroverswithsafetyconstraints AT wangping learningbasedendtoendpathplanningforlunarroverswithsafetyconstraints AT zhangzexu learningbasedendtoendpathplanningforlunarroverswithsafetyconstraints

Learning-Based End-to-End Path Planning for Lunar Rovers with Safety Constraints

Ejemplares similares