Cargando…
Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †
Reinforcement learning (RL) methods for energy saving and greening have recently appeared in the field of autonomous driving. In inter-vehicle communication (IVC), a feasible and increasingly popular research direction of RL is to obtain the optimal action decision of agents in a special environment...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10144777/ https://www.ncbi.nlm.nih.gov/pubmed/37112514 http://dx.doi.org/10.3390/s23084176 |
_version_ | 1785034175035736064 |
---|---|
author | Gu, Zhiru Liu, Zhongwei Wang, Qi Mao, Qiyun Shuai, Zhikang Ma, Ziji |
author_facet | Gu, Zhiru Liu, Zhongwei Wang, Qi Mao, Qiyun Shuai, Zhikang Ma, Ziji |
author_sort | Gu, Zhiru |
collection | PubMed |
description | Reinforcement learning (RL) methods for energy saving and greening have recently appeared in the field of autonomous driving. In inter-vehicle communication (IVC), a feasible and increasingly popular research direction of RL is to obtain the optimal action decision of agents in a special environment. This paper presents the application of reinforcement learning in the vehicle communication simulation framework (Veins). In this research, we explore the application of reinforcement learning algorithms in a green cooperative adaptive cruise control (CACC) platoon. Our aim is to train member vehicles to react appropriately in the event of a severe collision involving the leading vehicle. We seek to reduce collision damage and optimize energy consumption by encouraging behavior that conforms to the platoon’s environmentally friendly aim. Our study provides insight into the potential benefits of using reinforcement learning algorithms to improve the safety and efficiency of CACC platoons while promoting sustainable transportation. The policy gradient algorithm used in this paper has good convergence in the calculation of the minimum energy consumption problem and the optimal solution of vehicle behavior. In terms of energy consumption metrics, the policy gradient algorithm is used first in the IVC field for training the proposed platoon problem. It is a feasible training decision-planning algorithm for solving the minimization of energy consumption caused by decision making in platoon avoidance behavior. |
format | Online Article Text |
id | pubmed-10144777 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-101447772023-04-29 Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † Gu, Zhiru Liu, Zhongwei Wang, Qi Mao, Qiyun Shuai, Zhikang Ma, Ziji Sensors (Basel) Article Reinforcement learning (RL) methods for energy saving and greening have recently appeared in the field of autonomous driving. In inter-vehicle communication (IVC), a feasible and increasingly popular research direction of RL is to obtain the optimal action decision of agents in a special environment. This paper presents the application of reinforcement learning in the vehicle communication simulation framework (Veins). In this research, we explore the application of reinforcement learning algorithms in a green cooperative adaptive cruise control (CACC) platoon. Our aim is to train member vehicles to react appropriately in the event of a severe collision involving the leading vehicle. We seek to reduce collision damage and optimize energy consumption by encouraging behavior that conforms to the platoon’s environmentally friendly aim. Our study provides insight into the potential benefits of using reinforcement learning algorithms to improve the safety and efficiency of CACC platoons while promoting sustainable transportation. The policy gradient algorithm used in this paper has good convergence in the calculation of the minimum energy consumption problem and the optimal solution of vehicle behavior. In terms of energy consumption metrics, the policy gradient algorithm is used first in the IVC field for training the proposed platoon problem. It is a feasible training decision-planning algorithm for solving the minimization of energy consumption caused by decision making in platoon avoidance behavior. MDPI 2023-04-21 /pmc/articles/PMC10144777/ /pubmed/37112514 http://dx.doi.org/10.3390/s23084176 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Gu, Zhiru Liu, Zhongwei Wang, Qi Mao, Qiyun Shuai, Zhikang Ma, Ziji Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † |
title | Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † |
title_full | Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † |
title_fullStr | Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † |
title_full_unstemmed | Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † |
title_short | Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † |
title_sort | reinforcement learning-based approach for minimizing energy loss of driving platoon decisions † |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10144777/ https://www.ncbi.nlm.nih.gov/pubmed/37112514 http://dx.doi.org/10.3390/s23084176 |
work_keys_str_mv | AT guzhiru reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions AT liuzhongwei reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions AT wangqi reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions AT maoqiyun reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions AT shuaizhikang reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions AT maziji reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions |