Cargando…

Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning

It is challenging to perform path planning tasks in complex marine environments as the unmanned surface vessel approaches the goal while avoiding obstacles. However, the conflict between the two subtarget tasks of obstacle avoidance and goal approaching makes the path planning difficult. Thus, a pat...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Caipei, Zhao, Yingqi, Cai, Xuan, Wei, Wei, Feng, Xingxing, Zhou, Kaibo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9946747/
https://www.ncbi.nlm.nih.gov/pubmed/36844696
http://dx.doi.org/10.1155/2023/2146314
_version_ 1784892399558852608
author Yang, Caipei
Zhao, Yingqi
Cai, Xuan
Wei, Wei
Feng, Xingxing
Zhou, Kaibo
author_facet Yang, Caipei
Zhao, Yingqi
Cai, Xuan
Wei, Wei
Feng, Xingxing
Zhou, Kaibo
author_sort Yang, Caipei
collection PubMed
description It is challenging to perform path planning tasks in complex marine environments as the unmanned surface vessel approaches the goal while avoiding obstacles. However, the conflict between the two subtarget tasks of obstacle avoidance and goal approaching makes the path planning difficult. Thus, a path planning method for unmanned surface vessel based on multiobjective reinforcement learning is proposed under the complex environment with high randomness and multiple dynamic obstacles. Firstly, the path planning scene is set as the main scene, and the two subtarget scenes including obstacle avoidance and goal approaching are divided from it. The action selection strategy in each subtarget scene is trained through the double deep Q-network with prioritized experience replay. A multiobjective reinforcement learning framework based on ensemble learning is further designed for policy integration in the main scene. Finally, by selecting the strategy from subtarget scenes in the designed framework, an optimized action selection strategy is trained and used for the action decision of the agent in the main scene. Compared with traditional value-based reinforcement learning methods, the proposed method achieves a 93% success rate in path planning in simulation scenes. Furthermore, the average length of the paths planned by the proposed method is 3.28% and 1.97% shorter than that of PER-DDQN and dueling DQN, respectively.
format Online
Article
Text
id pubmed-9946747
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-99467472023-02-23 Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning Yang, Caipei Zhao, Yingqi Cai, Xuan Wei, Wei Feng, Xingxing Zhou, Kaibo Comput Intell Neurosci Research Article It is challenging to perform path planning tasks in complex marine environments as the unmanned surface vessel approaches the goal while avoiding obstacles. However, the conflict between the two subtarget tasks of obstacle avoidance and goal approaching makes the path planning difficult. Thus, a path planning method for unmanned surface vessel based on multiobjective reinforcement learning is proposed under the complex environment with high randomness and multiple dynamic obstacles. Firstly, the path planning scene is set as the main scene, and the two subtarget scenes including obstacle avoidance and goal approaching are divided from it. The action selection strategy in each subtarget scene is trained through the double deep Q-network with prioritized experience replay. A multiobjective reinforcement learning framework based on ensemble learning is further designed for policy integration in the main scene. Finally, by selecting the strategy from subtarget scenes in the designed framework, an optimized action selection strategy is trained and used for the action decision of the agent in the main scene. Compared with traditional value-based reinforcement learning methods, the proposed method achieves a 93% success rate in path planning in simulation scenes. Furthermore, the average length of the paths planned by the proposed method is 3.28% and 1.97% shorter than that of PER-DDQN and dueling DQN, respectively. Hindawi 2023-02-15 /pmc/articles/PMC9946747/ /pubmed/36844696 http://dx.doi.org/10.1155/2023/2146314 Text en Copyright © 2023 Caipei Yang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yang, Caipei
Zhao, Yingqi
Cai, Xuan
Wei, Wei
Feng, Xingxing
Zhou, Kaibo
Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning
title Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning
title_full Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning
title_fullStr Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning
title_full_unstemmed Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning
title_short Path Planning Algorithm for Unmanned Surface Vessel Based on Multiobjective Reinforcement Learning
title_sort path planning algorithm for unmanned surface vessel based on multiobjective reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9946747/
https://www.ncbi.nlm.nih.gov/pubmed/36844696
http://dx.doi.org/10.1155/2023/2146314
work_keys_str_mv AT yangcaipei pathplanningalgorithmforunmannedsurfacevesselbasedonmultiobjectivereinforcementlearning
AT zhaoyingqi pathplanningalgorithmforunmannedsurfacevesselbasedonmultiobjectivereinforcementlearning
AT caixuan pathplanningalgorithmforunmannedsurfacevesselbasedonmultiobjectivereinforcementlearning
AT weiwei pathplanningalgorithmforunmannedsurfacevesselbasedonmultiobjectivereinforcementlearning
AT fengxingxing pathplanningalgorithmforunmannedsurfacevesselbasedonmultiobjectivereinforcementlearning
AT zhoukaibo pathplanningalgorithmforunmannedsurfacevesselbasedonmultiobjectivereinforcementlearning