Cargando…
Effects analysis of reward functions on reinforcement learning for traffic signal control
The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performanc...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678263/ https://www.ncbi.nlm.nih.gov/pubmed/36409713 http://dx.doi.org/10.1371/journal.pone.0277813 |
_version_ | 1784833953058783232 |
---|---|
author | Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon |
author_facet | Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon |
author_sort | Lee, Hyosun |
collection | PubMed |
description | The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performance from an intersection perspective, targeting virtual simulation. The performance indexes from intersection perspectives are averaged by the weighted traffic flow; therefore, if the balance of each movement is not considered, the green time may be overly concentrated on the movements of heavy flow rates. Furthermore, as the ultimate purpose of traffic signal control research is to apply these controls to the real-world intersections, it is necessary to consider the real-world constraints. Hence, this study aims to design RL-TSC considering real-world applicability and confirm the appropriate design of the reward function. The limitations of the detector in the real world and the dual-ring traffic signal system are taken into account in the model design to facilitate real-world application. To design the reward for balancing traffic movements, we define the average delay weighted by traffic volume per lane and entropy of delay in the reward function. Model training is performed at the prototype intersection for ensuring scalability to multiple intersections. The model after prototype pre-training is evaluated by applying it to a network with two intersections without additional training. As a result, the reward function considering the equality of traffic movements shows the best performance. The proposed model reduces the average delay by more than 7.4% and 15.0% compared to the existing real-time adaptive signal control at two intersections, respectively. |
format | Online Article Text |
id | pubmed-9678263 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-96782632022-11-22 Effects analysis of reward functions on reinforcement learning for traffic signal control Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon PLoS One Research Article The increasing traffic demand in urban areas frequently causes traffic congestion, which can be managed only through intelligent traffic signal controls. Although many recent studies have focused on reinforcement learning for traffic signal control (RL-TSC), most have focused on improving performance from an intersection perspective, targeting virtual simulation. The performance indexes from intersection perspectives are averaged by the weighted traffic flow; therefore, if the balance of each movement is not considered, the green time may be overly concentrated on the movements of heavy flow rates. Furthermore, as the ultimate purpose of traffic signal control research is to apply these controls to the real-world intersections, it is necessary to consider the real-world constraints. Hence, this study aims to design RL-TSC considering real-world applicability and confirm the appropriate design of the reward function. The limitations of the detector in the real world and the dual-ring traffic signal system are taken into account in the model design to facilitate real-world application. To design the reward for balancing traffic movements, we define the average delay weighted by traffic volume per lane and entropy of delay in the reward function. Model training is performed at the prototype intersection for ensuring scalability to multiple intersections. The model after prototype pre-training is evaluated by applying it to a network with two intersections without additional training. As a result, the reward function considering the equality of traffic movements shows the best performance. The proposed model reduces the average delay by more than 7.4% and 15.0% compared to the existing real-time adaptive signal control at two intersections, respectively. Public Library of Science 2022-11-21 /pmc/articles/PMC9678263/ /pubmed/36409713 http://dx.doi.org/10.1371/journal.pone.0277813 Text en © 2022 Lee et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Lee, Hyosun Han, Yohee Kim, Youngchan Kim, Yong Hoon Effects analysis of reward functions on reinforcement learning for traffic signal control |
title | Effects analysis of reward functions on reinforcement learning for traffic signal control |
title_full | Effects analysis of reward functions on reinforcement learning for traffic signal control |
title_fullStr | Effects analysis of reward functions on reinforcement learning for traffic signal control |
title_full_unstemmed | Effects analysis of reward functions on reinforcement learning for traffic signal control |
title_short | Effects analysis of reward functions on reinforcement learning for traffic signal control |
title_sort | effects analysis of reward functions on reinforcement learning for traffic signal control |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9678263/ https://www.ncbi.nlm.nih.gov/pubmed/36409713 http://dx.doi.org/10.1371/journal.pone.0277813 |
work_keys_str_mv | AT leehyosun effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol AT hanyohee effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol AT kimyoungchan effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol AT kimyonghoon effectsanalysisofrewardfunctionsonreinforcementlearningfortrafficsignalcontrol |