Cargando…

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots

The path planning problem has gained more attention due to the gradual popularization of mobile robots. The utilization of reinforcement learning techniques facilitates the ability of mobile robots to successfully navigate through an environment containing obstacles and effectively plan their path....

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Tinglong, Wang, Ming, Zhao, Qianchuan, Zheng, Xuehan, Gao, He
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10604071/
https://www.ncbi.nlm.nih.gov/pubmed/37887612
http://dx.doi.org/10.3390/biomimetics8060481
_version_ 1785126748496592896
author Zhao, Tinglong
Wang, Ming
Zhao, Qianchuan
Zheng, Xuehan
Gao, He
author_facet Zhao, Tinglong
Wang, Ming
Zhao, Qianchuan
Zheng, Xuehan
Gao, He
author_sort Zhao, Tinglong
collection PubMed
description The path planning problem has gained more attention due to the gradual popularization of mobile robots. The utilization of reinforcement learning techniques facilitates the ability of mobile robots to successfully navigate through an environment containing obstacles and effectively plan their path. This is achieved by the robots’ interaction with the environment, even in situations when the environment is unfamiliar. Consequently, we provide a refined deep reinforcement learning algorithm that builds upon the soft actor-critic (SAC) algorithm, incorporating the concept of maximum entropy for the purpose of path planning. The objective of this strategy is to mitigate the constraints inherent in conventional reinforcement learning, enhance the efficacy of the learning process, and accommodate intricate situations. In the context of reinforcement learning, two significant issues arise: inadequate incentives and inefficient sample use during the training phase. To address these challenges, the hindsight experience replay (HER) mechanism has been presented as a potential solution. The HER mechanism aims to enhance algorithm performance by effectively reusing past experiences. Through the utilization of simulation studies, it can be demonstrated that the enhanced algorithm exhibits superior performance in comparison with the pre-existing method.
format Online
Article
Text
id pubmed-10604071
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106040712023-10-28 A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots Zhao, Tinglong Wang, Ming Zhao, Qianchuan Zheng, Xuehan Gao, He Biomimetics (Basel) Article The path planning problem has gained more attention due to the gradual popularization of mobile robots. The utilization of reinforcement learning techniques facilitates the ability of mobile robots to successfully navigate through an environment containing obstacles and effectively plan their path. This is achieved by the robots’ interaction with the environment, even in situations when the environment is unfamiliar. Consequently, we provide a refined deep reinforcement learning algorithm that builds upon the soft actor-critic (SAC) algorithm, incorporating the concept of maximum entropy for the purpose of path planning. The objective of this strategy is to mitigate the constraints inherent in conventional reinforcement learning, enhance the efficacy of the learning process, and accommodate intricate situations. In the context of reinforcement learning, two significant issues arise: inadequate incentives and inefficient sample use during the training phase. To address these challenges, the hindsight experience replay (HER) mechanism has been presented as a potential solution. The HER mechanism aims to enhance algorithm performance by effectively reusing past experiences. Through the utilization of simulation studies, it can be demonstrated that the enhanced algorithm exhibits superior performance in comparison with the pre-existing method. MDPI 2023-10-10 /pmc/articles/PMC10604071/ /pubmed/37887612 http://dx.doi.org/10.3390/biomimetics8060481 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Tinglong
Wang, Ming
Zhao, Qianchuan
Zheng, Xuehan
Gao, He
A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots
title A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots
title_full A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots
title_fullStr A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots
title_full_unstemmed A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots
title_short A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots
title_sort path-planning method based on improved soft actor-critic algorithm for mobile robots
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10604071/
https://www.ncbi.nlm.nih.gov/pubmed/37887612
http://dx.doi.org/10.3390/biomimetics8060481
work_keys_str_mv AT zhaotinglong apathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT wangming apathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT zhaoqianchuan apathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT zhengxuehan apathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT gaohe apathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT zhaotinglong pathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT wangming pathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT zhaoqianchuan pathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT zhengxuehan pathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots
AT gaohe pathplanningmethodbasedonimprovedsoftactorcriticalgorithmformobilerobots