Cargando…

Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces

This paper proposes a vehicle-parking trajectory planning method that addresses the issues of a long trajectory planning time and difficult training convergence during automatic parking. The process involves two stages: finding a parking space and parking planning. The first stage uses model predict...

Descripción completa

Detalles Bibliográficos
Autores principales:	Shi, Junren, Li, Kexin, Piao, Changhao, Gao, Jun, Chen, Lizhi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10458430/ https://www.ncbi.nlm.nih.gov/pubmed/37631658 http://dx.doi.org/10.3390/s23167124

_version_	1785097163733204992
author	Shi, Junren Li, Kexin Piao, Changhao Gao, Jun Chen, Lizhi
author_facet	Shi, Junren Li, Kexin Piao, Changhao Gao, Jun Chen, Lizhi
author_sort	Shi, Junren
collection	PubMed
description	This paper proposes a vehicle-parking trajectory planning method that addresses the issues of a long trajectory planning time and difficult training convergence during automatic parking. The process involves two stages: finding a parking space and parking planning. The first stage uses model predictive control (MPC) for trajectory tracking from the initial position of the vehicle to the starting point of the parking operation. The second stage employs the proximal policy optimization (PPO) algorithm to transform the parking behavior into a reinforcement learning process. A four-dimensional reward function is set to evaluate the strategy based on a formal reward, guiding the adjustment of neural network parameters and reducing the exploration of invalid actions. Finally, a simulation environment is built for the parking scene, and a network framework is designed. The proposed method is compared with the deep deterministic policy gradient and double-delay deep deterministic policy gradient algorithms in the same scene. Results confirm that the MPC controller accurately performs trajectory-tracking control with minimal steering wheel angle changes and smooth, continuous movement. The PPO-based reinforcement learning method achieves shorter learning times, totaling only 30% and 37.5% of the deep deterministic policy gradient (DDPG) and twin-delayed deep deterministic policy gradient (TD3), and the number of iterations to reach convergence for the PPO algorithm with the introduction of the four-dimensional evaluation metrics is 75% and 68% shorter compared to the DDPG and TD3 algorithms, respectively. This study demonstrates the effectiveness of the proposed method in addressing a slow convergence and long training times in parking trajectory planning, improving parking timeliness.
format	Online Article Text
id	pubmed-10458430
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-104584302023-08-27 Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces Shi, Junren Li, Kexin Piao, Changhao Gao, Jun Chen, Lizhi Sensors (Basel) Article This paper proposes a vehicle-parking trajectory planning method that addresses the issues of a long trajectory planning time and difficult training convergence during automatic parking. The process involves two stages: finding a parking space and parking planning. The first stage uses model predictive control (MPC) for trajectory tracking from the initial position of the vehicle to the starting point of the parking operation. The second stage employs the proximal policy optimization (PPO) algorithm to transform the parking behavior into a reinforcement learning process. A four-dimensional reward function is set to evaluate the strategy based on a formal reward, guiding the adjustment of neural network parameters and reducing the exploration of invalid actions. Finally, a simulation environment is built for the parking scene, and a network framework is designed. The proposed method is compared with the deep deterministic policy gradient and double-delay deep deterministic policy gradient algorithms in the same scene. Results confirm that the MPC controller accurately performs trajectory-tracking control with minimal steering wheel angle changes and smooth, continuous movement. The PPO-based reinforcement learning method achieves shorter learning times, totaling only 30% and 37.5% of the deep deterministic policy gradient (DDPG) and twin-delayed deep deterministic policy gradient (TD3), and the number of iterations to reach convergence for the PPO algorithm with the introduction of the four-dimensional evaluation metrics is 75% and 68% shorter compared to the DDPG and TD3 algorithms, respectively. This study demonstrates the effectiveness of the proposed method in addressing a slow convergence and long training times in parking trajectory planning, improving parking timeliness. MDPI 2023-08-11 /pmc/articles/PMC10458430/ /pubmed/37631658 http://dx.doi.org/10.3390/s23167124 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Shi, Junren Li, Kexin Piao, Changhao Gao, Jun Chen, Lizhi Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces
title	Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces
title_full	Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces
title_fullStr	Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces
title_full_unstemmed	Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces
title_short	Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces
title_sort	model-based predictive control and reinforcement learning for planning vehicle-parking trajectories for vertical parking spaces
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10458430/ https://www.ncbi.nlm.nih.gov/pubmed/37631658 http://dx.doi.org/10.3390/s23167124
work_keys_str_mv	AT shijunren modelbasedpredictivecontrolandreinforcementlearningforplanningvehicleparkingtrajectoriesforverticalparkingspaces AT likexin modelbasedpredictivecontrolandreinforcementlearningforplanningvehicleparkingtrajectoriesforverticalparkingspaces AT piaochanghao modelbasedpredictivecontrolandreinforcementlearningforplanningvehicleparkingtrajectoriesforverticalparkingspaces AT gaojun modelbasedpredictivecontrolandreinforcementlearningforplanningvehicleparkingtrajectoriesforverticalparkingspaces AT chenlizhi modelbasedpredictivecontrolandreinforcementlearningforplanningvehicleparkingtrajectoriesforverticalparkingspaces

Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces

Ejemplares similares