Cargando…

A self-learning Monte Carlo tree search algorithm for robot path planning

This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wei, Liu, Yi, Ma, Yan, Xu, Kang, Qiu, Jiang, Gan, Zhongxue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358331/
https://www.ncbi.nlm.nih.gov/pubmed/37483541
http://dx.doi.org/10.3389/fnbot.2023.1039644
_version_ 1785075640349753344
author Li, Wei
Liu, Yi
Ma, Yan
Xu, Kang
Qiu, Jiang
Gan, Zhongxue
author_facet Li, Wei
Liu, Yi
Ma, Yan
Xu, Kang
Qiu, Jiang
Gan, Zhongxue
author_sort Li, Wei
collection PubMed
description This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance the search for exploration and exploitation. PV-Network replaces the rollout process of MCTS and predicts the promising search direction and the value of nodes, which increases the MCTS convergence speed and search efficiency. The paper proposes an effective method to assess the trajectory of the current model during the self-learning process by comparing the performance of the current model with that of its best-performing historical model. Additionally, this method can encourage SL-MCTS to generate optimal solutions during the self-learning process. We evaluate the performance of SL-MCTS on the robot path planning scenario. The experimental results show that the performance of SL-MCTS is far superior to the traditional MCTS and single-player MCTS algorithms in terms of path quality and time consumption, especially its time consumption is half less than that of the traditional MCTS algorithms. SL-MCTS also performs comparably to other iterative-based search algorithms designed specifically for path planning tasks.
format Online
Article
Text
id pubmed-10358331
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103583312023-07-21 A self-learning Monte Carlo tree search algorithm for robot path planning Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue Front Neurorobot Neuroscience This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance the search for exploration and exploitation. PV-Network replaces the rollout process of MCTS and predicts the promising search direction and the value of nodes, which increases the MCTS convergence speed and search efficiency. The paper proposes an effective method to assess the trajectory of the current model during the self-learning process by comparing the performance of the current model with that of its best-performing historical model. Additionally, this method can encourage SL-MCTS to generate optimal solutions during the self-learning process. We evaluate the performance of SL-MCTS on the robot path planning scenario. The experimental results show that the performance of SL-MCTS is far superior to the traditional MCTS and single-player MCTS algorithms in terms of path quality and time consumption, especially its time consumption is half less than that of the traditional MCTS algorithms. SL-MCTS also performs comparably to other iterative-based search algorithms designed specifically for path planning tasks. Frontiers Media S.A. 2023-07-06 /pmc/articles/PMC10358331/ /pubmed/37483541 http://dx.doi.org/10.3389/fnbot.2023.1039644 Text en Copyright © 2023 Li, Liu, Ma, Xu, Qiu and Gan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Li, Wei
Liu, Yi
Ma, Yan
Xu, Kang
Qiu, Jiang
Gan, Zhongxue
A self-learning Monte Carlo tree search algorithm for robot path planning
title A self-learning Monte Carlo tree search algorithm for robot path planning
title_full A self-learning Monte Carlo tree search algorithm for robot path planning
title_fullStr A self-learning Monte Carlo tree search algorithm for robot path planning
title_full_unstemmed A self-learning Monte Carlo tree search algorithm for robot path planning
title_short A self-learning Monte Carlo tree search algorithm for robot path planning
title_sort self-learning monte carlo tree search algorithm for robot path planning
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358331/
https://www.ncbi.nlm.nih.gov/pubmed/37483541
http://dx.doi.org/10.3389/fnbot.2023.1039644
work_keys_str_mv AT liwei aselflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT liuyi aselflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT mayan aselflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT xukang aselflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT qiujiang aselflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT ganzhongxue aselflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT liwei selflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT liuyi selflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT mayan selflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT xukang selflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT qiujiang selflearningmontecarlotreesearchalgorithmforrobotpathplanning
AT ganzhongxue selflearningmontecarlotreesearchalgorithmforrobotpathplanning