Cargando…

Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments

In this paper, we propose a deep deterministic policy gradient (DDPG)-based path-planning method for mobile robots by applying the hindsight experience replay (HER) technique to overcome the performance degradation resulting from sparse reward problems occurring in autonomous driving mobile robots....

Descripción completa

Detalles Bibliográficos
Autores principales:	Park, Minjae, Lee, Seok Young, Hong, Jin Seok, Kwon, Nam Kyu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9787388/ https://www.ncbi.nlm.nih.gov/pubmed/36559941 http://dx.doi.org/10.3390/s22249574

_version_	1784858499407151104
author	Park, Minjae Lee, Seok Young Hong, Jin Seok Kwon, Nam Kyu
author_facet	Park, Minjae Lee, Seok Young Hong, Jin Seok Kwon, Nam Kyu
author_sort	Park, Minjae
collection	PubMed
description	In this paper, we propose a deep deterministic policy gradient (DDPG)-based path-planning method for mobile robots by applying the hindsight experience replay (HER) technique to overcome the performance degradation resulting from sparse reward problems occurring in autonomous driving mobile robots. The mobile robot in our analysis was a robot operating system-based TurtleBot3, and the experimental environment was a virtual simulation based on Gazebo. A fully connected neural network was used as the DDPG network based on the actor–critic architecture. Noise was added to the actor network. The robot recognized an unknown environment by measuring distances using a laser sensor and determined the optimized policy to reach its destination. The HER technique improved the learning performance by generating three new episodes with normal experience from a failed episode. The proposed method demonstrated that the HER technique could help mitigate the sparse reward problem; this was further corroborated by the successful autonomous driving results obtained after applying the proposed method to two reward systems, as well as actual experimental results.
format	Online Article Text
id	pubmed-9787388
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-97873882022-12-24 Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments Park, Minjae Lee, Seok Young Hong, Jin Seok Kwon, Nam Kyu Sensors (Basel) Article In this paper, we propose a deep deterministic policy gradient (DDPG)-based path-planning method for mobile robots by applying the hindsight experience replay (HER) technique to overcome the performance degradation resulting from sparse reward problems occurring in autonomous driving mobile robots. The mobile robot in our analysis was a robot operating system-based TurtleBot3, and the experimental environment was a virtual simulation based on Gazebo. A fully connected neural network was used as the DDPG network based on the actor–critic architecture. Noise was added to the actor network. The robot recognized an unknown environment by measuring distances using a laser sensor and determined the optimized policy to reach its destination. The HER technique improved the learning performance by generating three new episodes with normal experience from a failed episode. The proposed method demonstrated that the HER technique could help mitigate the sparse reward problem; this was further corroborated by the successful autonomous driving results obtained after applying the proposed method to two reward systems, as well as actual experimental results. MDPI 2022-12-07 /pmc/articles/PMC9787388/ /pubmed/36559941 http://dx.doi.org/10.3390/s22249574 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Park, Minjae Lee, Seok Young Hong, Jin Seok Kwon, Nam Kyu Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments
title	Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments
title_full	Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments
title_fullStr	Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments
title_full_unstemmed	Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments
title_short	Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments
title_sort	deep deterministic policy gradient-based autonomous driving for mobile robots in sparse reward environments
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9787388/ https://www.ncbi.nlm.nih.gov/pubmed/36559941 http://dx.doi.org/10.3390/s22249574
work_keys_str_mv	AT parkminjae deepdeterministicpolicygradientbasedautonomousdrivingformobilerobotsinsparserewardenvironments AT leeseokyoung deepdeterministicpolicygradientbasedautonomousdrivingformobilerobotsinsparserewardenvironments AT hongjinseok deepdeterministicpolicygradientbasedautonomousdrivingformobilerobotsinsparserewardenvironments AT kwonnamkyu deepdeterministicpolicygradientbasedautonomousdrivingformobilerobotsinsparserewardenvironments

Deep Deterministic Policy Gradient-Based Autonomous Driving for Mobile Robots in Sparse Reward Environments

Ejemplares similares