Cargando…

Model predictive control of DC/DC boost converter with reinforcement learning

Power electronics is seeing an increase in the use of sophisticated self-learning controllers as single board computers and microcontrollers progress faster. Traditional controllers, such as PI controllers, suffer from transient instability difficulties. The duty cycle and output voltage of a DC/DC...

Descripción completa

Detalles Bibliográficos
Autores principales: Marahatta, Anup, Rajbhandari, Yaju, Shrestha, Ashish, Phuyal, Sudip, Thapa, Anup, Korba, Petr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9650005/
https://www.ncbi.nlm.nih.gov/pubmed/36387550
http://dx.doi.org/10.1016/j.heliyon.2022.e11416
_version_ 1784827911892631552
author Marahatta, Anup
Rajbhandari, Yaju
Shrestha, Ashish
Phuyal, Sudip
Thapa, Anup
Korba, Petr
author_facet Marahatta, Anup
Rajbhandari, Yaju
Shrestha, Ashish
Phuyal, Sudip
Thapa, Anup
Korba, Petr
author_sort Marahatta, Anup
collection PubMed
description Power electronics is seeing an increase in the use of sophisticated self-learning controllers as single board computers and microcontrollers progress faster. Traditional controllers, such as PI controllers, suffer from transient instability difficulties. The duty cycle and output voltage of a DC/DC converter are not linear. Due to this non-linearity, the PI controller generates variable levels of voltage fluctuations depending on the operating region of the converter. In some cases, non-linear controllers outperform PI controllers. The non-linear model of a non-linear controller is determined by data availability. So, a self-calibrating controller that collects data and optimizes itself as the operation goes on is necessary. Iteration and oscillation can be minimized with a well-trained reinforcement learning model utilizing a non-linear policy. A boost converter's output power supply capacity changes with a change in load, due to which the maximum duty cycle limit of a converter also changes. A support vector calibrated by reinforcement learning can dynamically change the duty cycle limit of a converter under variable load. This research highlights how reinforcement learning-based non-linear controllers can improve control and efficiency over standard controllers. The proposed concept is based on a microgrid system. Simulation and experimental analysis have been conducted on how reinforcement learning-based controller works for DC-DC boost converter.
format Online
Article
Text
id pubmed-9650005
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-96500052022-11-15 Model predictive control of DC/DC boost converter with reinforcement learning Marahatta, Anup Rajbhandari, Yaju Shrestha, Ashish Phuyal, Sudip Thapa, Anup Korba, Petr Heliyon Research Article Power electronics is seeing an increase in the use of sophisticated self-learning controllers as single board computers and microcontrollers progress faster. Traditional controllers, such as PI controllers, suffer from transient instability difficulties. The duty cycle and output voltage of a DC/DC converter are not linear. Due to this non-linearity, the PI controller generates variable levels of voltage fluctuations depending on the operating region of the converter. In some cases, non-linear controllers outperform PI controllers. The non-linear model of a non-linear controller is determined by data availability. So, a self-calibrating controller that collects data and optimizes itself as the operation goes on is necessary. Iteration and oscillation can be minimized with a well-trained reinforcement learning model utilizing a non-linear policy. A boost converter's output power supply capacity changes with a change in load, due to which the maximum duty cycle limit of a converter also changes. A support vector calibrated by reinforcement learning can dynamically change the duty cycle limit of a converter under variable load. This research highlights how reinforcement learning-based non-linear controllers can improve control and efficiency over standard controllers. The proposed concept is based on a microgrid system. Simulation and experimental analysis have been conducted on how reinforcement learning-based controller works for DC-DC boost converter. Elsevier 2022-11-05 /pmc/articles/PMC9650005/ /pubmed/36387550 http://dx.doi.org/10.1016/j.heliyon.2022.e11416 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Marahatta, Anup
Rajbhandari, Yaju
Shrestha, Ashish
Phuyal, Sudip
Thapa, Anup
Korba, Petr
Model predictive control of DC/DC boost converter with reinforcement learning
title Model predictive control of DC/DC boost converter with reinforcement learning
title_full Model predictive control of DC/DC boost converter with reinforcement learning
title_fullStr Model predictive control of DC/DC boost converter with reinforcement learning
title_full_unstemmed Model predictive control of DC/DC boost converter with reinforcement learning
title_short Model predictive control of DC/DC boost converter with reinforcement learning
title_sort model predictive control of dc/dc boost converter with reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9650005/
https://www.ncbi.nlm.nih.gov/pubmed/36387550
http://dx.doi.org/10.1016/j.heliyon.2022.e11416
work_keys_str_mv AT marahattaanup modelpredictivecontrolofdcdcboostconverterwithreinforcementlearning
AT rajbhandariyaju modelpredictivecontrolofdcdcboostconverterwithreinforcementlearning
AT shresthaashish modelpredictivecontrolofdcdcboostconverterwithreinforcementlearning
AT phuyalsudip modelpredictivecontrolofdcdcboostconverterwithreinforcementlearning
AT thapaanup modelpredictivecontrolofdcdcboostconverterwithreinforcementlearning
AT korbapetr modelpredictivecontrolofdcdcboostconverterwithreinforcementlearning