Cargando…

Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm

Deep Reinforcement Learning (DRL) enables agents to make decisions based on a well-designed reward function that suites a particular environment without any prior knowledge related to a given environment. The adaptation of hyperparameters has a great impact on the overall learning process and the le...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ashraf, Nesma M., Mostafa, Reham R., Sakr, Rasha H., Rashad, M. Z.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191943/ https://www.ncbi.nlm.nih.gov/pubmed/34111168 http://dx.doi.org/10.1371/journal.pone.0252754

_version_	1783705952334118912
author	Ashraf, Nesma M. Mostafa, Reham R. Sakr, Rasha H. Rashad, M. Z.
author_facet	Ashraf, Nesma M. Mostafa, Reham R. Sakr, Rasha H. Rashad, M. Z.
author_sort	Ashraf, Nesma M.
collection	PubMed
description	Deep Reinforcement Learning (DRL) enables agents to make decisions based on a well-designed reward function that suites a particular environment without any prior knowledge related to a given environment. The adaptation of hyperparameters has a great impact on the overall learning process and the learning processing times. Hyperparameters should be accurately estimated while training DRL algorithms, which is one of the key challenges that we attempt to address. This paper employs a swarm-based optimization algorithm, namely the Whale Optimization Algorithm (WOA), for optimizing the hyperparameters of the Deep Deterministic Policy Gradient (DDPG) algorithm to achieve the optimum control strategy in an autonomous driving control problem. DDPG is capable of handling complex environments, which contain continuous spaces for actions. To evaluate the proposed algorithm, the Open Racing Car Simulator (TORCS), a realistic autonomous driving simulation environment, was chosen to its ease of design and implementation. Using TORCS, the DDPG agent with optimized hyperparameters was compared with a DDPG agent with reference hyperparameters. The experimental results showed that the DDPG’s hyperparameters optimization leads to maximizing the total rewards, along with testing episodes and maintaining a stable driving policy.
format	Online Article Text
id	pubmed-8191943
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-81919432021-06-10 Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm Ashraf, Nesma M. Mostafa, Reham R. Sakr, Rasha H. Rashad, M. Z. PLoS One Research Article Deep Reinforcement Learning (DRL) enables agents to make decisions based on a well-designed reward function that suites a particular environment without any prior knowledge related to a given environment. The adaptation of hyperparameters has a great impact on the overall learning process and the learning processing times. Hyperparameters should be accurately estimated while training DRL algorithms, which is one of the key challenges that we attempt to address. This paper employs a swarm-based optimization algorithm, namely the Whale Optimization Algorithm (WOA), for optimizing the hyperparameters of the Deep Deterministic Policy Gradient (DDPG) algorithm to achieve the optimum control strategy in an autonomous driving control problem. DDPG is capable of handling complex environments, which contain continuous spaces for actions. To evaluate the proposed algorithm, the Open Racing Car Simulator (TORCS), a realistic autonomous driving simulation environment, was chosen to its ease of design and implementation. Using TORCS, the DDPG agent with optimized hyperparameters was compared with a DDPG agent with reference hyperparameters. The experimental results showed that the DDPG’s hyperparameters optimization leads to maximizing the total rewards, along with testing episodes and maintaining a stable driving policy. Public Library of Science 2021-06-10 /pmc/articles/PMC8191943/ /pubmed/34111168 http://dx.doi.org/10.1371/journal.pone.0252754 Text en © 2021 Ashraf et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Ashraf, Nesma M. Mostafa, Reham R. Sakr, Rasha H. Rashad, M. Z. Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
title	Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
title_full	Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
title_fullStr	Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
title_full_unstemmed	Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
title_short	Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
title_sort	optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8191943/ https://www.ncbi.nlm.nih.gov/pubmed/34111168 http://dx.doi.org/10.1371/journal.pone.0252754
work_keys_str_mv	AT ashrafnesmam optimizinghyperparametersofdeepreinforcementlearningforautonomousdrivingbasedonwhaleoptimizationalgorithm AT mostafarehamr optimizinghyperparametersofdeepreinforcementlearningforautonomousdrivingbasedonwhaleoptimizationalgorithm AT sakrrashah optimizinghyperparametersofdeepreinforcementlearningforautonomousdrivingbasedonwhaleoptimizationalgorithm AT rashadmz optimizinghyperparametersofdeepreinforcementlearningforautonomousdrivingbasedonwhaleoptimizationalgorithm

Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm

Ejemplares similares