Cargando…

Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm

With the development of artificial intelligence technology, the behavior decision-making of an intelligent smart marine autonomous surface ship (SMASS) has become particularly important. This research proposed local path planning and a behavior decision-making approach based on improved Proximal Pol...

Descripción completa

Detalles Bibliográficos
Autores principales:	Guan, Wei, Cui, Zhewen, Zhang, Xianku
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371246/ https://www.ncbi.nlm.nih.gov/pubmed/35957288 http://dx.doi.org/10.3390/s22155732

_version_	1784767080531230720
author	Guan, Wei Cui, Zhewen Zhang, Xianku
author_facet	Guan, Wei Cui, Zhewen Zhang, Xianku
author_sort	Guan, Wei
collection	PubMed
description	With the development of artificial intelligence technology, the behavior decision-making of an intelligent smart marine autonomous surface ship (SMASS) has become particularly important. This research proposed local path planning and a behavior decision-making approach based on improved Proximal Policy Optimization (PPO), which could drive an unmanned SMASS to the target without requiring any human experiences. In addition, a generalized advantage estimation was added to the loss function of the PPO algorithm, which allowed baselines in PPO algorithms to be self-adjusted. At first, the SMASS was modeled with the Nomoto model in a simulation waterway. Then, distances, obstacles, and prohibited areas were regularized as rewards or punishments, which were used to judge the performance and manipulation decisions of the vessel Subsequently, improved PPO was introduced to learn the action–reward model, and the neural network model after training was used to manipulate the SMASS’s movement. To achieve higher reward values, the SMASS could find an appropriate path or navigation strategy by itself. After a sufficient number of rounds of training, a convincing path and manipulation strategies would likely be produced. Compared with the proposed approach of the existing methods, this approach is more effective in self-learning and continuous optimization and thus closer to human manipulation.
format	Online Article Text
id	pubmed-9371246
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-93712462022-08-12 Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm Guan, Wei Cui, Zhewen Zhang, Xianku Sensors (Basel) Article With the development of artificial intelligence technology, the behavior decision-making of an intelligent smart marine autonomous surface ship (SMASS) has become particularly important. This research proposed local path planning and a behavior decision-making approach based on improved Proximal Policy Optimization (PPO), which could drive an unmanned SMASS to the target without requiring any human experiences. In addition, a generalized advantage estimation was added to the loss function of the PPO algorithm, which allowed baselines in PPO algorithms to be self-adjusted. At first, the SMASS was modeled with the Nomoto model in a simulation waterway. Then, distances, obstacles, and prohibited areas were regularized as rewards or punishments, which were used to judge the performance and manipulation decisions of the vessel Subsequently, improved PPO was introduced to learn the action–reward model, and the neural network model after training was used to manipulate the SMASS’s movement. To achieve higher reward values, the SMASS could find an appropriate path or navigation strategy by itself. After a sufficient number of rounds of training, a convincing path and manipulation strategies would likely be produced. Compared with the proposed approach of the existing methods, this approach is more effective in self-learning and continuous optimization and thus closer to human manipulation. MDPI 2022-07-31 /pmc/articles/PMC9371246/ /pubmed/35957288 http://dx.doi.org/10.3390/s22155732 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Guan, Wei Cui, Zhewen Zhang, Xianku Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm
title	Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm
title_full	Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm
title_fullStr	Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm
title_full_unstemmed	Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm
title_short	Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm
title_sort	intelligent smart marine autonomous surface ship decision system based on improved ppo algorithm
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371246/ https://www.ncbi.nlm.nih.gov/pubmed/35957288 http://dx.doi.org/10.3390/s22155732
work_keys_str_mv	AT guanwei intelligentsmartmarineautonomoussurfaceshipdecisionsystembasedonimprovedppoalgorithm AT cuizhewen intelligentsmartmarineautonomoussurfaceshipdecisionsystembasedonimprovedppoalgorithm AT zhangxianku intelligentsmartmarineautonomoussurfaceshipdecisionsystembasedonimprovedppoalgorithm

Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm

Ejemplares similares