Cargando…
A self-learning Monte Carlo tree search algorithm for robot path planning
This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358331/ https://www.ncbi.nlm.nih.gov/pubmed/37483541 http://dx.doi.org/10.3389/fnbot.2023.1039644 |
_version_ | 1785075640349753344 |
---|---|
author | Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue |
author_facet | Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue |
author_sort | Li, Wei |
collection | PubMed |
description | This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance the search for exploration and exploitation. PV-Network replaces the rollout process of MCTS and predicts the promising search direction and the value of nodes, which increases the MCTS convergence speed and search efficiency. The paper proposes an effective method to assess the trajectory of the current model during the self-learning process by comparing the performance of the current model with that of its best-performing historical model. Additionally, this method can encourage SL-MCTS to generate optimal solutions during the self-learning process. We evaluate the performance of SL-MCTS on the robot path planning scenario. The experimental results show that the performance of SL-MCTS is far superior to the traditional MCTS and single-player MCTS algorithms in terms of path quality and time consumption, especially its time consumption is half less than that of the traditional MCTS algorithms. SL-MCTS also performs comparably to other iterative-based search algorithms designed specifically for path planning tasks. |
format | Online Article Text |
id | pubmed-10358331 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-103583312023-07-21 A self-learning Monte Carlo tree search algorithm for robot path planning Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue Front Neurorobot Neuroscience This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance the search for exploration and exploitation. PV-Network replaces the rollout process of MCTS and predicts the promising search direction and the value of nodes, which increases the MCTS convergence speed and search efficiency. The paper proposes an effective method to assess the trajectory of the current model during the self-learning process by comparing the performance of the current model with that of its best-performing historical model. Additionally, this method can encourage SL-MCTS to generate optimal solutions during the self-learning process. We evaluate the performance of SL-MCTS on the robot path planning scenario. The experimental results show that the performance of SL-MCTS is far superior to the traditional MCTS and single-player MCTS algorithms in terms of path quality and time consumption, especially its time consumption is half less than that of the traditional MCTS algorithms. SL-MCTS also performs comparably to other iterative-based search algorithms designed specifically for path planning tasks. Frontiers Media S.A. 2023-07-06 /pmc/articles/PMC10358331/ /pubmed/37483541 http://dx.doi.org/10.3389/fnbot.2023.1039644 Text en Copyright © 2023 Li, Liu, Ma, Xu, Qiu and Gan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue A self-learning Monte Carlo tree search algorithm for robot path planning |
title | A self-learning Monte Carlo tree search algorithm for robot path planning |
title_full | A self-learning Monte Carlo tree search algorithm for robot path planning |
title_fullStr | A self-learning Monte Carlo tree search algorithm for robot path planning |
title_full_unstemmed | A self-learning Monte Carlo tree search algorithm for robot path planning |
title_short | A self-learning Monte Carlo tree search algorithm for robot path planning |
title_sort | self-learning monte carlo tree search algorithm for robot path planning |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358331/ https://www.ncbi.nlm.nih.gov/pubmed/37483541 http://dx.doi.org/10.3389/fnbot.2023.1039644 |
work_keys_str_mv | AT liwei aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT liuyi aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT mayan aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT xukang aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT qiujiang aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT ganzhongxue aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT liwei selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT liuyi selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT mayan selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT xukang selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT qiujiang selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT ganzhongxue selflearningmontecarlotreesearchalgorithmforrobotpathplanning |