Cargando…

Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes

Modern active distribution networks (ADNs) witness increasing complexities that require efforts in control practices, including optimal reactive power dispatch (ORPD). Deep reinforcement learning (DRL) is proposed to manage the network’s reactive power by coordinating different resources, including...

Descripción completa

Detalles Bibliográficos
Autores principales: Zamzam, Tassneem, Shaban, Khaled, Massoud, Ahmed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10459375/
https://www.ncbi.nlm.nih.gov/pubmed/37631753
http://dx.doi.org/10.3390/s23167216
_version_ 1785097396330430464
author Zamzam, Tassneem
Shaban, Khaled
Massoud, Ahmed
author_facet Zamzam, Tassneem
Shaban, Khaled
Massoud, Ahmed
author_sort Zamzam, Tassneem
collection PubMed
description Modern active distribution networks (ADNs) witness increasing complexities that require efforts in control practices, including optimal reactive power dispatch (ORPD). Deep reinforcement learning (DRL) is proposed to manage the network’s reactive power by coordinating different resources, including distributed energy resources, to enhance performance. However, there is a lack of studies examining DRL elements’ performance sensitivity. To this end, in this paper we examine the impact of various DRL reward representations and hyperparameters on the agent’s learning performance when solving the ORPD problem for ADNs. We assess the agent’s performance regarding accuracy and training time metrics, as well as critic estimate measures. Furthermore, different environmental changes are examined to study the DRL model’s scalability by including other resources. Results show that compared to other representations, the complementary reward function exhibits improved performance in terms of power loss minimization and convergence time by 10–15% and 14–18%, respectively. Also, adequate agent performance is observed to be neighboring the best-suited value of each hyperparameter for the studied problem. In addition, scalability analysis depicts that increasing the number of possible action combinations in the action space by approximately nine times results in 1.7 times increase in the training time.
format Online
Article
Text
id pubmed-10459375
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104593752023-08-27 Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes Zamzam, Tassneem Shaban, Khaled Massoud, Ahmed Sensors (Basel) Article Modern active distribution networks (ADNs) witness increasing complexities that require efforts in control practices, including optimal reactive power dispatch (ORPD). Deep reinforcement learning (DRL) is proposed to manage the network’s reactive power by coordinating different resources, including distributed energy resources, to enhance performance. However, there is a lack of studies examining DRL elements’ performance sensitivity. To this end, in this paper we examine the impact of various DRL reward representations and hyperparameters on the agent’s learning performance when solving the ORPD problem for ADNs. We assess the agent’s performance regarding accuracy and training time metrics, as well as critic estimate measures. Furthermore, different environmental changes are examined to study the DRL model’s scalability by including other resources. Results show that compared to other representations, the complementary reward function exhibits improved performance in terms of power loss minimization and convergence time by 10–15% and 14–18%, respectively. Also, adequate agent performance is observed to be neighboring the best-suited value of each hyperparameter for the studied problem. In addition, scalability analysis depicts that increasing the number of possible action combinations in the action space by approximately nine times results in 1.7 times increase in the training time. MDPI 2023-08-17 /pmc/articles/PMC10459375/ /pubmed/37631753 http://dx.doi.org/10.3390/s23167216 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zamzam, Tassneem
Shaban, Khaled
Massoud, Ahmed
Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes
title Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes
title_full Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes
title_fullStr Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes
title_full_unstemmed Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes
title_short Optimal Reactive Power Dispatch in ADNs using DRL and the Impact of Its Various Settings and Environmental Changes
title_sort optimal reactive power dispatch in adns using drl and the impact of its various settings and environmental changes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10459375/
https://www.ncbi.nlm.nih.gov/pubmed/37631753
http://dx.doi.org/10.3390/s23167216
work_keys_str_mv AT zamzamtassneem optimalreactivepowerdispatchinadnsusingdrlandtheimpactofitsvarioussettingsandenvironmentalchanges
AT shabankhaled optimalreactivepowerdispatchinadnsusingdrlandtheimpactofitsvarioussettingsandenvironmentalchanges
AT massoudahmed optimalreactivepowerdispatchinadnsusingdrlandtheimpactofitsvarioussettingsandenvironmentalchanges