Cargando…

Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †

Reinforcement learning (RL) methods for energy saving and greening have recently appeared in the field of autonomous driving. In inter-vehicle communication (IVC), a feasible and increasingly popular research direction of RL is to obtain the optimal action decision of agents in a special environment...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Zhiru, Liu, Zhongwei, Wang, Qi, Mao, Qiyun, Shuai, Zhikang, Ma, Ziji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10144777/
https://www.ncbi.nlm.nih.gov/pubmed/37112514
http://dx.doi.org/10.3390/s23084176
_version_ 1785034175035736064
author Gu, Zhiru
Liu, Zhongwei
Wang, Qi
Mao, Qiyun
Shuai, Zhikang
Ma, Ziji
author_facet Gu, Zhiru
Liu, Zhongwei
Wang, Qi
Mao, Qiyun
Shuai, Zhikang
Ma, Ziji
author_sort Gu, Zhiru
collection PubMed
description Reinforcement learning (RL) methods for energy saving and greening have recently appeared in the field of autonomous driving. In inter-vehicle communication (IVC), a feasible and increasingly popular research direction of RL is to obtain the optimal action decision of agents in a special environment. This paper presents the application of reinforcement learning in the vehicle communication simulation framework (Veins). In this research, we explore the application of reinforcement learning algorithms in a green cooperative adaptive cruise control (CACC) platoon. Our aim is to train member vehicles to react appropriately in the event of a severe collision involving the leading vehicle. We seek to reduce collision damage and optimize energy consumption by encouraging behavior that conforms to the platoon’s environmentally friendly aim. Our study provides insight into the potential benefits of using reinforcement learning algorithms to improve the safety and efficiency of CACC platoons while promoting sustainable transportation. The policy gradient algorithm used in this paper has good convergence in the calculation of the minimum energy consumption problem and the optimal solution of vehicle behavior. In terms of energy consumption metrics, the policy gradient algorithm is used first in the IVC field for training the proposed platoon problem. It is a feasible training decision-planning algorithm for solving the minimization of energy consumption caused by decision making in platoon avoidance behavior.
format Online
Article
Text
id pubmed-10144777
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101447772023-04-29 Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions † Gu, Zhiru Liu, Zhongwei Wang, Qi Mao, Qiyun Shuai, Zhikang Ma, Ziji Sensors (Basel) Article Reinforcement learning (RL) methods for energy saving and greening have recently appeared in the field of autonomous driving. In inter-vehicle communication (IVC), a feasible and increasingly popular research direction of RL is to obtain the optimal action decision of agents in a special environment. This paper presents the application of reinforcement learning in the vehicle communication simulation framework (Veins). In this research, we explore the application of reinforcement learning algorithms in a green cooperative adaptive cruise control (CACC) platoon. Our aim is to train member vehicles to react appropriately in the event of a severe collision involving the leading vehicle. We seek to reduce collision damage and optimize energy consumption by encouraging behavior that conforms to the platoon’s environmentally friendly aim. Our study provides insight into the potential benefits of using reinforcement learning algorithms to improve the safety and efficiency of CACC platoons while promoting sustainable transportation. The policy gradient algorithm used in this paper has good convergence in the calculation of the minimum energy consumption problem and the optimal solution of vehicle behavior. In terms of energy consumption metrics, the policy gradient algorithm is used first in the IVC field for training the proposed platoon problem. It is a feasible training decision-planning algorithm for solving the minimization of energy consumption caused by decision making in platoon avoidance behavior. MDPI 2023-04-21 /pmc/articles/PMC10144777/ /pubmed/37112514 http://dx.doi.org/10.3390/s23084176 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gu, Zhiru
Liu, Zhongwei
Wang, Qi
Mao, Qiyun
Shuai, Zhikang
Ma, Ziji
Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †
title Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †
title_full Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †
title_fullStr Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †
title_full_unstemmed Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †
title_short Reinforcement Learning-Based Approach for Minimizing Energy Loss of Driving Platoon Decisions †
title_sort reinforcement learning-based approach for minimizing energy loss of driving platoon decisions †
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10144777/
https://www.ncbi.nlm.nih.gov/pubmed/37112514
http://dx.doi.org/10.3390/s23084176
work_keys_str_mv AT guzhiru reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions
AT liuzhongwei reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions
AT wangqi reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions
AT maoqiyun reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions
AT shuaizhikang reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions
AT maziji reinforcementlearningbasedapproachforminimizingenergylossofdrivingplatoondecisions