Cargando…

Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction

Road infrastructure has significant effects on road traffic safety and needs further examination. In terms of traffic crash prediction, recent studies have started to develop deep learning classification algorithms. However, given the uncertainty of traffic crashes, predicting the traffic risk poten...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Bo, Zhang, Chi, Wong, Yiik Diew, Hou, Lei, Zhang, Min, Xiang, Yujie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603763/
https://www.ncbi.nlm.nih.gov/pubmed/36294267
http://dx.doi.org/10.3390/ijerph192013693
_version_ 1784817637538136064
author Wang, Bo
Zhang, Chi
Wong, Yiik Diew
Hou, Lei
Zhang, Min
Xiang, Yujie
author_facet Wang, Bo
Zhang, Chi
Wong, Yiik Diew
Hou, Lei
Zhang, Min
Xiang, Yujie
author_sort Wang, Bo
collection PubMed
description Road infrastructure has significant effects on road traffic safety and needs further examination. In terms of traffic crash prediction, recent studies have started to develop deep learning classification algorithms. However, given the uncertainty of traffic crashes, predicting the traffic risk potential of different road sections remains a challenge. To bridge this knowledge gap, this study investigated a real-world expressway and collected its traffic crash data between 2013 and 2020. Then, according to the time-spatial density ratio (Pts), road sections were assigned into three classes corresponding to low, medium, and high risk levels of traffic. Next, different classifiers were compared that were trained using the transformed and resampled feature data to construct a traffic crash risk prediction model. Last, but not least, partial dependence plots (PDPs) were employed to interpret the results and analyze the importance of individual features describing the geometry, pavement, structure, and weather conditions. The results showed that a variety of data balancing algorithms improved the performance of the classifiers, the ensemble classifier superseded the others in terms of the performance metrics, and the combined SMOTEENN and random forest algorithms improved the classification accuracy the most. In the future, the proposed traffic crash risk prediction method will be tested in more road maintenance and design safety assessment scenarios.
format Online
Article
Text
id pubmed-9603763
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96037632022-10-27 Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction Wang, Bo Zhang, Chi Wong, Yiik Diew Hou, Lei Zhang, Min Xiang, Yujie Int J Environ Res Public Health Article Road infrastructure has significant effects on road traffic safety and needs further examination. In terms of traffic crash prediction, recent studies have started to develop deep learning classification algorithms. However, given the uncertainty of traffic crashes, predicting the traffic risk potential of different road sections remains a challenge. To bridge this knowledge gap, this study investigated a real-world expressway and collected its traffic crash data between 2013 and 2020. Then, according to the time-spatial density ratio (Pts), road sections were assigned into three classes corresponding to low, medium, and high risk levels of traffic. Next, different classifiers were compared that were trained using the transformed and resampled feature data to construct a traffic crash risk prediction model. Last, but not least, partial dependence plots (PDPs) were employed to interpret the results and analyze the importance of individual features describing the geometry, pavement, structure, and weather conditions. The results showed that a variety of data balancing algorithms improved the performance of the classifiers, the ensemble classifier superseded the others in terms of the performance metrics, and the combined SMOTEENN and random forest algorithms improved the classification accuracy the most. In the future, the proposed traffic crash risk prediction method will be tested in more road maintenance and design safety assessment scenarios. MDPI 2022-10-21 /pmc/articles/PMC9603763/ /pubmed/36294267 http://dx.doi.org/10.3390/ijerph192013693 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Bo
Zhang, Chi
Wong, Yiik Diew
Hou, Lei
Zhang, Min
Xiang, Yujie
Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction
title Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction
title_full Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction
title_fullStr Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction
title_full_unstemmed Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction
title_short Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction
title_sort comparing resampling algorithms and classifiers for modeling traffic risk prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603763/
https://www.ncbi.nlm.nih.gov/pubmed/36294267
http://dx.doi.org/10.3390/ijerph192013693
work_keys_str_mv AT wangbo comparingresamplingalgorithmsandclassifiersformodelingtrafficriskprediction
AT zhangchi comparingresamplingalgorithmsandclassifiersformodelingtrafficriskprediction
AT wongyiikdiew comparingresamplingalgorithmsandclassifiersformodelingtrafficriskprediction
AT houlei comparingresamplingalgorithmsandclassifiersformodelingtrafficriskprediction
AT zhangmin comparingresamplingalgorithmsandclassifiersformodelingtrafficriskprediction
AT xiangyujie comparingresamplingalgorithmsandclassifiersformodelingtrafficriskprediction