Cargando…

Effects analysis of reward functions on reinforcement learning for traffic signal control

The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performanc...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Hyosun, Han, Yohee, Kim, Youngchan, Kim, Yong Hoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678263/
https://www.ncbi.nlm.nih.gov/pubmed/36409713
http://dx.doi.org/10.1371/journal.pone.0277813
_version_ 1784833953058783232
author Lee, Hyosun
Han, Yohee
Kim, Youngchan
Kim, Yong Hoon
author_facet Lee, Hyosun
Han, Yohee
Kim, Youngchan
Kim, Yong Hoon
author_sort Lee, Hyosun
collection PubMed
description The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performance from an intersection perspective, targeting virtual simulation. The performance indexes from intersection perspectives are averaged by the weighted traffic flow; therefore, if the balance of each movement is not considered, the green time may be overly concentrated on the movements of heavy flow rates. Furthermore, as the ultimate purpose of traffic signal control research is to apply these controls to the real-world intersections, it is necessary to consider the real-world constraints. Hence, this study aims to design RL-TSC considering real-world applicability and confirm the appropriate design of the reward function. The limitations of the detector in the real world and the dual-ring traffic signal system are taken into account in the model design to facilitate real-world application. To design the reward for balancing traffic movements, we define the average delay weighted by traffic volume per lane and entropy of delay in the reward function. Model training is performed at the prototype intersection for ensuring scalability to multiple intersections. The model after prototype pre-training is evaluated by applying it to a network with two intersections without additional training. As a result, the reward function considering the equality of traffic movements shows the best performance. The proposed model reduces the average delay by more than 7.4% and 15.0% compared to the existing real-time adaptive signal control at two intersections, respectively.
format Online
Article
Text
id pubmed-9678263
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-96782632022-11-22 Effects analysis of reward functions on reinforcement learning for traffic signal control Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon PLoS One Research Article The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performance from an intersection perspective, targeting virtual simulation. The performance indexes from intersection perspectives are averaged by the weighted traffic flow; therefore, if the balance of each movement is not considered, the green time may be overly concentrated on the movements of heavy flow rates. Furthermore, as the ultimate purpose of traffic signal control research is to apply these controls to the real-world intersections, it is necessary to consider the real-world constraints. Hence, this study aims to design RL-TSC considering real-world applicability and confirm the appropriate design of the reward function. The limitations of the detector in the real world and the dual-ring traffic signal system are taken into account in the model design to facilitate real-world application. To design the reward for balancing traffic movements, we define the average delay weighted by traffic volume per lane and entropy of delay in the reward function. Model training is performed at the prototype intersection for ensuring scalability to multiple intersections. The model after prototype pre-training is evaluated by applying it to a network with two intersections without additional training. As a result, the reward function considering the equality of traffic movements shows the best performance. The proposed model reduces the average delay by more than 7.4% and 15.0% compared to the existing real-time adaptive signal control at two intersections, respectively. Public Library of Science 2022-11-21 /pmc/articles/PMC9678263/ /pubmed/36409713 http://dx.doi.org/10.1371/journal.pone.0277813 Text en © 2022 Lee et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lee, Hyosun
Han, Yohee
Kim, Youngchan
Kim, Yong Hoon
Effects analysis of reward functions on reinforcement learning for traffic signal control
title Effects analysis of reward functions on reinforcement learning for traffic signal control
title_full Effects analysis of reward functions on reinforcement learning for traffic signal control
title_fullStr Effects analysis of reward functions on reinforcement learning for traffic signal control
title_full_unstemmed Effects analysis of reward functions on reinforcement learning for traffic signal control
title_short Effects analysis of reward functions on reinforcement learning for traffic signal control
title_sort effects analysis of reward functions on reinforcement learning for traffic signal control
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678263/
https://www.ncbi.nlm.nih.gov/pubmed/36409713
http://dx.doi.org/10.1371/journal.pone.0277813
work_keys_str_mv AT leehyosun effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol
AT hanyohee effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol
AT kimyoungchan effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol
AT kimyonghoon effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol