Cargando…

A self-learning Monte Carlo tree search algorithm for robot path planning

This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Wei, Liu, Yi, Ma, Yan, Xu, Kang, Qiu, Jiang, Gan, Zhongxue
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358331/ https://www.ncbi.nlm.nih.gov/pubmed/37483541 http://dx.doi.org/10.3389/fnbot.2023.1039644

_version_	1785075640349753344
author	Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue
author_facet	Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue
author_sort	Li, Wei
collection	PubMed
description	This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance the search for exploration and exploitation. PV-Network replaces the rollout process of MCTS and predicts the promising search direction and the value of nodes, which increases the MCTS convergence speed and search efficiency. The paper proposes an effective method to assess the trajectory of the current model during the self-learning process by comparing the performance of the current model with that of its best-performing historical model. Additionally, this method can encourage SL-MCTS to generate optimal solutions during the self-learning process. We evaluate the performance of SL-MCTS on the robot path planning scenario. The experimental results show that the performance of SL-MCTS is far superior to the traditional MCTS and single-player MCTS algorithms in terms of path quality and time consumption, especially its time consumption is half less than that of the traditional MCTS algorithms. SL-MCTS also performs comparably to other iterative-based search algorithms designed specifically for path planning tasks.
format	Online Article Text
id	pubmed-10358331
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-103583312023-07-21 A self-learning Monte Carlo tree search algorithm for robot path planning Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue Front Neurorobot Neuroscience This paper proposes a self-learning Monte Carlo tree search algorithm (SL-MCTS), which has the ability to continuously improve its problem-solving ability in single-player scenarios. SL-MCTS combines the MCTS algorithm with a two-branch neural network (PV-Network). The MCTS architecture can balance the search for exploration and exploitation. PV-Network replaces the rollout process of MCTS and predicts the promising search direction and the value of nodes, which increases the MCTS convergence speed and search efficiency. The paper proposes an effective method to assess the trajectory of the current model during the self-learning process by comparing the performance of the current model with that of its best-performing historical model. Additionally, this method can encourage SL-MCTS to generate optimal solutions during the self-learning process. We evaluate the performance of SL-MCTS on the robot path planning scenario. The experimental results show that the performance of SL-MCTS is far superior to the traditional MCTS and single-player MCTS algorithms in terms of path quality and time consumption, especially its time consumption is half less than that of the traditional MCTS algorithms. SL-MCTS also performs comparably to other iterative-based search algorithms designed specifically for path planning tasks. Frontiers Media S.A. 2023-07-06 /pmc/articles/PMC10358331/ /pubmed/37483541 http://dx.doi.org/10.3389/fnbot.2023.1039644 Text en Copyright © 2023 Li, Liu, Ma, Xu, Qiu and Gan. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Li, Wei Liu, Yi Ma, Yan Xu, Kang Qiu, Jiang Gan, Zhongxue A self-learning Monte Carlo tree search algorithm for robot path planning
title	A self-learning Monte Carlo tree search algorithm for robot path planning
title_full	A self-learning Monte Carlo tree search algorithm for robot path planning
title_fullStr	A self-learning Monte Carlo tree search algorithm for robot path planning
title_full_unstemmed	A self-learning Monte Carlo tree search algorithm for robot path planning
title_short	A self-learning Monte Carlo tree search algorithm for robot path planning
title_sort	self-learning monte carlo tree search algorithm for robot path planning
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10358331/ https://www.ncbi.nlm.nih.gov/pubmed/37483541 http://dx.doi.org/10.3389/fnbot.2023.1039644
work_keys_str_mv	AT liwei aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT liuyi aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT mayan aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT xukang aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT qiujiang aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT ganzhongxue aselflearningmontecarlotreesearchalgorithmforrobotpathplanning AT liwei selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT liuyi selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT mayan selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT xukang selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT qiujiang selflearningmontecarlotreesearchalgorithmforrobotpathplanning AT ganzhongxue selflearningmontecarlotreesearchalgorithmforrobotpathplanning

A self-learning Monte Carlo tree search algorithm for robot path planning

Ejemplares similares