Cargando…
Lane Following Method Based on Improved DDPG Algorithm
In an autonomous vehicle, the lane following algorithm is an important component, which is a basic function of autonomous driving. However, the existing lane following system has a few shortcomings: first, the control method it adopts requires an accurate system model, and different vehicles have di...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309792/ https://www.ncbi.nlm.nih.gov/pubmed/34300567 http://dx.doi.org/10.3390/s21144827 |
_version_ | 1783728606593155072 |
---|---|
author | He, Rui Lv, Haipeng Zhang, Sumin Zhang, Dong Zhang, Hang |
author_facet | He, Rui Lv, Haipeng Zhang, Sumin Zhang, Dong Zhang, Hang |
author_sort | He, Rui |
collection | PubMed |
description | In an autonomous vehicle, the lane following algorithm is an important component, which is a basic function of autonomous driving. However, the existing lane following system has a few shortcomings: first, the control method it adopts requires an accurate system model, and different vehicles have different parameters, which needs a lot of parameter calibration work. The second is that it may fail on road sections where the lateral acceleration requirements of vehicles are large, such as large curves. Third, its decision-making system is defined based on rules, which has disadvantages: it is difficult to formulate; human subjective factors cannot guarantee objectivity; coverage is difficult to guarantee. In recent years, the deep deterministic policy gradient (DDPG) algorithm has been widely used in the field of autonomous driving due to its strong nonlinear fitting ability and generalization performance. However, the DDPG algorithm has overestimated state action values and large cumulative errors, low training efficiency and other issues. Therefore, this paper improves the DDPG algorithm based on the double critic networks and priority experience replay mechanism. Then this paper proposes a lane following method based on this algorithm. Experiment shows that the algorithm can achieve excellent following results under various road conditions. |
format | Online Article Text |
id | pubmed-8309792 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83097922021-07-25 Lane Following Method Based on Improved DDPG Algorithm He, Rui Lv, Haipeng Zhang, Sumin Zhang, Dong Zhang, Hang Sensors (Basel) Article In an autonomous vehicle, the lane following algorithm is an important component, which is a basic function of autonomous driving. However, the existing lane following system has a few shortcomings: first, the control method it adopts requires an accurate system model, and different vehicles have different parameters, which needs a lot of parameter calibration work. The second is that it may fail on road sections where the lateral acceleration requirements of vehicles are large, such as large curves. Third, its decision-making system is defined based on rules, which has disadvantages: it is difficult to formulate; human subjective factors cannot guarantee objectivity; coverage is difficult to guarantee. In recent years, the deep deterministic policy gradient (DDPG) algorithm has been widely used in the field of autonomous driving due to its strong nonlinear fitting ability and generalization performance. However, the DDPG algorithm has overestimated state action values and large cumulative errors, low training efficiency and other issues. Therefore, this paper improves the DDPG algorithm based on the double critic networks and priority experience replay mechanism. Then this paper proposes a lane following method based on this algorithm. Experiment shows that the algorithm can achieve excellent following results under various road conditions. MDPI 2021-07-15 /pmc/articles/PMC8309792/ /pubmed/34300567 http://dx.doi.org/10.3390/s21144827 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article He, Rui Lv, Haipeng Zhang, Sumin Zhang, Dong Zhang, Hang Lane Following Method Based on Improved DDPG Algorithm |
title | Lane Following Method Based on Improved DDPG Algorithm |
title_full | Lane Following Method Based on Improved DDPG Algorithm |
title_fullStr | Lane Following Method Based on Improved DDPG Algorithm |
title_full_unstemmed | Lane Following Method Based on Improved DDPG Algorithm |
title_short | Lane Following Method Based on Improved DDPG Algorithm |
title_sort | lane following method based on improved ddpg algorithm |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309792/ https://www.ncbi.nlm.nih.gov/pubmed/34300567 http://dx.doi.org/10.3390/s21144827 |
work_keys_str_mv | AT herui lanefollowingmethodbasedonimprovedddpgalgorithm AT lvhaipeng lanefollowingmethodbasedonimprovedddpgalgorithm AT zhangsumin lanefollowingmethodbasedonimprovedddpgalgorithm AT zhangdong lanefollowingmethodbasedonimprovedddpgalgorithm AT zhanghang lanefollowingmethodbasedonimprovedddpgalgorithm |