Cargando…

Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System

Reinforcement learning (RL) is a promising direction in automated parking systems (APSs), as integrating planning and tracking control using RL can potentially maximize the overall performance. However, commonly used model-free RL requires many interactions to achieve acceptable performance, and mod...

Descripción completa

Detalles Bibliográficos
Autores principales:	Song, Shaoyu, Chen, Hui, Sun, Hongwei, Liu, Meicen
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7766926/ https://www.ncbi.nlm.nih.gov/pubmed/33353153 http://dx.doi.org/10.3390/s20247297

_version_	1783628835560882176
author	Song, Shaoyu Chen, Hui Sun, Hongwei Liu, Meicen
author_facet	Song, Shaoyu Chen, Hui Sun, Hongwei Liu, Meicen
author_sort	Song, Shaoyu
collection	PubMed
description	Reinforcement learning (RL) is a promising direction in automated parking systems (APSs), as integrating planning and tracking control using RL can potentially maximize the overall performance. However, commonly used model-free RL requires many interactions to achieve acceptable performance, and model-based RL in APS cannot continuously learn. In this paper, a data-efficient RL method is constructed to learn from data by use of a model-based method. The proposed method uses a truncated Monte Carlo tree search to evaluate parking states and select moves. Two artificial neural networks are trained to provide the search probability of each tree branch and the final reward for each state using self-trained data. The data efficiency is enhanced by weighting exploration with parking trajectory returns, an adaptive exploration scheme, and experience augmentation with imaginary rollouts. Without human demonstrations, a novel training pipeline is also used to train the initial action guidance network and the state value network. Compared with path planning and path-following methods, the proposed integrated method can flexibly co-ordinate the longitudinal and lateral motion to park a smaller parking space in one maneuver. Its adaptability to changes in the vehicle model is verified by joint Carsim and MATLAB simulation, demonstrating that the algorithm converges within a few iterations. Finally, experiments using a real vehicle platform are used to further verify the effectiveness of the proposed method. Compared with obtaining rewards using simulation, the proposed method achieves a better final parking attitude and success rate.
format	Online Article Text
id	pubmed-7766926
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-77669262020-12-28 Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System Song, Shaoyu Chen, Hui Sun, Hongwei Liu, Meicen Sensors (Basel) Article Reinforcement learning (RL) is a promising direction in automated parking systems (APSs), as integrating planning and tracking control using RL can potentially maximize the overall performance. However, commonly used model-free RL requires many interactions to achieve acceptable performance, and model-based RL in APS cannot continuously learn. In this paper, a data-efficient RL method is constructed to learn from data by use of a model-based method. The proposed method uses a truncated Monte Carlo tree search to evaluate parking states and select moves. Two artificial neural networks are trained to provide the search probability of each tree branch and the final reward for each state using self-trained data. The data efficiency is enhanced by weighting exploration with parking trajectory returns, an adaptive exploration scheme, and experience augmentation with imaginary rollouts. Without human demonstrations, a novel training pipeline is also used to train the initial action guidance network and the state value network. Compared with path planning and path-following methods, the proposed integrated method can flexibly co-ordinate the longitudinal and lateral motion to park a smaller parking space in one maneuver. Its adaptability to changes in the vehicle model is verified by joint Carsim and MATLAB simulation, demonstrating that the algorithm converges within a few iterations. Finally, experiments using a real vehicle platform are used to further verify the effectiveness of the proposed method. Compared with obtaining rewards using simulation, the proposed method achieves a better final parking attitude and success rate. MDPI 2020-12-18 /pmc/articles/PMC7766926/ /pubmed/33353153 http://dx.doi.org/10.3390/s20247297 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Song, Shaoyu Chen, Hui Sun, Hongwei Liu, Meicen Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System
title	Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System
title_full	Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System
title_fullStr	Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System
title_full_unstemmed	Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System
title_short	Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System
title_sort	data efficient reinforcement learning for integrated lateral planning and control in automated parking system
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7766926/ https://www.ncbi.nlm.nih.gov/pubmed/33353153 http://dx.doi.org/10.3390/s20247297
work_keys_str_mv	AT songshaoyu dataefficientreinforcementlearningforintegratedlateralplanningandcontrolinautomatedparkingsystem AT chenhui dataefficientreinforcementlearningforintegratedlateralplanningandcontrolinautomatedparkingsystem AT sunhongwei dataefficientreinforcementlearningforintegratedlateralplanningandcontrolinautomatedparkingsystem AT liumeicen dataefficientreinforcementlearningforintegratedlateralplanningandcontrolinautomatedparkingsystem

Data Efficient Reinforcement Learning for Integrated Lateral Planning and Control in Automated Parking System

Ejemplares similares