Cargando…

Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances

In this paper, the safe optimal control method for continuous-time (CT) nonlinear safety-critical systems with asymmetric input constraints and unmatched disturbances based on the adaptive dynamic programming (ADP) is investigated. Initially, a new non-quadratic form function is implemented to effec...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Chunbin, Jiang, Kaijun, Zhang, Jishi, Zhu, Tianzeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378920/
https://www.ncbi.nlm.nih.gov/pubmed/37510048
http://dx.doi.org/10.3390/e25071101
_version_ 1785079885664878592
author Qin, Chunbin
Jiang, Kaijun
Zhang, Jishi
Zhu, Tianzeng
author_facet Qin, Chunbin
Jiang, Kaijun
Zhang, Jishi
Zhu, Tianzeng
author_sort Qin, Chunbin
collection PubMed
description In this paper, the safe optimal control method for continuous-time (CT) nonlinear safety-critical systems with asymmetric input constraints and unmatched disturbances based on the adaptive dynamic programming (ADP) is investigated. Initially, a new non-quadratic form function is implemented to effectively handle the asymmetric input constraints. Subsequently, the safe optimal control problem is transformed into a two-player zero-sum game (ZSG) problem to suppress the influence of unmatched disturbances, and a new Hamilton–Jacobi–Isaacs (HJI) equation is introduced by integrating the control barrier function (CBF) with the cost function to penalize unsafe behavior. Moreover, a damping factor is embedded in the CBF to balance safety and optimality. To obtain a safe optimal controller, only one critic neural network (CNN) is utilized to tackle the complex HJI equation, leading to a decreased computational load in contrast to the utilization of the conventional actor–critic network. Then, the system state and the parameters of the CNN are uniformly ultimately bounded (UUB) through the application of the Lyapunov stability method. Lastly, two examples are presented to confirm the efficacy of the presented approach.
format Online
Article
Text
id pubmed-10378920
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103789202023-07-29 Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances Qin, Chunbin Jiang, Kaijun Zhang, Jishi Zhu, Tianzeng Entropy (Basel) Article In this paper, the safe optimal control method for continuous-time (CT) nonlinear safety-critical systems with asymmetric input constraints and unmatched disturbances based on the adaptive dynamic programming (ADP) is investigated. Initially, a new non-quadratic form function is implemented to effectively handle the asymmetric input constraints. Subsequently, the safe optimal control problem is transformed into a two-player zero-sum game (ZSG) problem to suppress the influence of unmatched disturbances, and a new Hamilton–Jacobi–Isaacs (HJI) equation is introduced by integrating the control barrier function (CBF) with the cost function to penalize unsafe behavior. Moreover, a damping factor is embedded in the CBF to balance safety and optimality. To obtain a safe optimal controller, only one critic neural network (CNN) is utilized to tackle the complex HJI equation, leading to a decreased computational load in contrast to the utilization of the conventional actor–critic network. Then, the system state and the parameters of the CNN are uniformly ultimately bounded (UUB) through the application of the Lyapunov stability method. Lastly, two examples are presented to confirm the efficacy of the presented approach. MDPI 2023-07-24 /pmc/articles/PMC10378920/ /pubmed/37510048 http://dx.doi.org/10.3390/e25071101 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Qin, Chunbin
Jiang, Kaijun
Zhang, Jishi
Zhu, Tianzeng
Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
title Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
title_full Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
title_fullStr Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
title_full_unstemmed Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
title_short Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances
title_sort critic learning-based safe optimal control for nonlinear systems with asymmetric input constraints and unmatched disturbances
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378920/
https://www.ncbi.nlm.nih.gov/pubmed/37510048
http://dx.doi.org/10.3390/e25071101
work_keys_str_mv AT qinchunbin criticlearningbasedsafeoptimalcontrolfornonlinearsystemswithasymmetricinputconstraintsandunmatcheddisturbances
AT jiangkaijun criticlearningbasedsafeoptimalcontrolfornonlinearsystemswithasymmetricinputconstraintsandunmatcheddisturbances
AT zhangjishi criticlearningbasedsafeoptimalcontrolfornonlinearsystemswithasymmetricinputconstraintsandunmatcheddisturbances
AT zhutianzeng criticlearningbasedsafeoptimalcontrolfornonlinearsystemswithasymmetricinputconstraintsandunmatcheddisturbances