Cargando…

Effects analysis of reward functions on reinforcement learning for traffic signal control

The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performanc...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Hyosun, Han, Yohee, Kim, Youngchan, Kim, Yong Hoon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678263/ https://www.ncbi.nlm.nih.gov/pubmed/36409713 http://dx.doi.org/10.1371/journal.pone.0277813

_version_	1784833953058783232
author	Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon
author_facet	Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon
author_sort	Lee, Hyosun
collection	PubMed
description	The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performance from an intersection perspective, targeting virtual simulation. The performance indexes from intersection perspectives are averaged by the weighted traffic flow; therefore, if the balance of each movement is not considered, the green time may be overly concentrated on the movements of heavy flow rates. Furthermore, as the ultimate purpose of traffic signal control research is to apply these controls to the real-world intersections, it is necessary to consider the real-world constraints. Hence, this study aims to design RL-TSC considering real-world applicability and confirm the appropriate design of the reward function. The limitations of the detector in the real world and the dual-ring traffic signal system are taken into account in the model design to facilitate real-world application. To design the reward for balancing traffic movements, we define the average delay weighted by traffic volume per lane and entropy of delay in the reward function. Model training is performed at the prototype intersection for ensuring scalability to multiple intersections. The model after prototype pre-training is evaluated by applying it to a network with two intersections without additional training. As a result, the reward function considering the equality of traffic movements shows the best performance. The proposed model reduces the average delay by more than 7.4% and 15.0% compared to the existing real-time adaptive signal control at two intersections, respectively.
format	Online Article Text
id	pubmed-9678263
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-96782632022-11-22 Effects analysis of reward functions on reinforcement learning for traffic signal control Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon PLoS One Research Article The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performance from an intersection perspective, targeting virtual simulation. The performance indexes from intersection perspectives are averaged by the weighted traffic flow; therefore, if the balance of each movement is not considered, the green time may be overly concentrated on the movements of heavy flow rates. Furthermore, as the ultimate purpose of traffic signal control research is to apply these controls to the real-world intersections, it is necessary to consider the real-world constraints. Hence, this study aims to design RL-TSC considering real-world applicability and confirm the appropriate design of the reward function. The limitations of the detector in the real world and the dual-ring traffic signal system are taken into account in the model design to facilitate real-world application. To design the reward for balancing traffic movements, we define the average delay weighted by traffic volume per lane and entropy of delay in the reward function. Model training is performed at the prototype intersection for ensuring scalability to multiple intersections. The model after prototype pre-training is evaluated by applying it to a network with two intersections without additional training. As a result, the reward function considering the equality of traffic movements shows the best performance. The proposed model reduces the average delay by more than 7.4% and 15.0% compared to the existing real-time adaptive signal control at two intersections, respectively. Public Library of Science 2022-11-21 /pmc/articles/PMC9678263/ /pubmed/36409713 http://dx.doi.org/10.1371/journal.pone.0277813 Text en © 2022 Lee et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon Effects analysis of reward functions on reinforcement learning for traffic signal control
title	Effects analysis of reward functions on reinforcement learning for traffic signal control
title_full	Effects analysis of reward functions on reinforcement learning for traffic signal control
title_fullStr	Effects analysis of reward functions on reinforcement learning for traffic signal control
title_full_unstemmed	Effects analysis of reward functions on reinforcement learning for traffic signal control
title_short	Effects analysis of reward functions on reinforcement learning for traffic signal control
title_sort	effects analysis of reward functions on reinforcement learning for traffic signal control
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678263/ https://www.ncbi.nlm.nih.gov/pubmed/36409713 http://dx.doi.org/10.1371/journal.pone.0277813
work_keys_str_mv	AT leehyosun effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol AT hanyohee effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol AT kimyoungchan effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol AT kimyonghoon effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol

Effects analysis of reward functions on reinforcement learning for traffic signal control

Ejemplares similares