Cargando…

Identifying high crash risk segments in rural roads using ensemble decision tree-based models

Traffic safety forecast models are mainly used to rank road segments. While existing studies have primarily focused on identifying segments in urban networks, rural networks have received less attention. However, rural networks seem to have a higher risk of severe crashes. This paper aims to analyse...

Descripción completa

Detalles Bibliográficos
Autores principales: Iranmanesh, Maryam, Seyedabrishami, Seyedehsan, Moridpour, Sara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9681741/
https://www.ncbi.nlm.nih.gov/pubmed/36414672
http://dx.doi.org/10.1038/s41598-022-24476-z
_version_ 1784834688975634432
author Iranmanesh, Maryam
Seyedabrishami, Seyedehsan
Moridpour, Sara
author_facet Iranmanesh, Maryam
Seyedabrishami, Seyedehsan
Moridpour, Sara
author_sort Iranmanesh, Maryam
collection PubMed
description Traffic safety forecast models are mainly used to rank road segments. While existing studies have primarily focused on identifying segments in urban networks, rural networks have received less attention. However, rural networks seem to have a higher risk of severe crashes. This paper aims to analyse traffic crashes on rural roads to identify the influencing factors on the crash frequency and present a framework to develop a spatial–temporal crash risk map to prioritise high-risk segments on different days. The crash data of Khorasan Razavi province is used in this study. Crash frequency data with the temporal resolution of one day and spatial resolution of 1500 m from loop detectors are analysed. Four groups of influential factors, including traffic parameters (e.g. traffic flow, speed, time headway), road characteristics (e.g. road type, number of lanes), weather data (e.g. daily rainfall, snow depth, temperature), and calendar variables (e.g. day of the week, public holidays, month, year) are used for model calibration. Three different decision tree algorithms, including, Decision Tree (DT), Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) have been employed to predict crash frequency. Results show that based on the traditional evaluation measures, the XGBosst is better for the explanation and interpretation of the factors affecting crash frequency, while the RF model is better for detecting trends and forecasting crash frequency. According to the results, the traffic flow rate, road type, year of the crash, and wind speed are the most influencing variables in predicting crash frequency on rural roads. Forecasting the high and medium risk segment-day in the rural network can be essential to the safety management plan. This risk will be sensitive to real traffic data, weather forecasts and road geometric characteristics. Seventy percent of high and medium risk segment-day are predicted for the case study.
format Online
Article
Text
id pubmed-9681741
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96817412022-11-24 Identifying high crash risk segments in rural roads using ensemble decision tree-based models Iranmanesh, Maryam Seyedabrishami, Seyedehsan Moridpour, Sara Sci Rep Article Traffic safety forecast models are mainly used to rank road segments. While existing studies have primarily focused on identifying segments in urban networks, rural networks have received less attention. However, rural networks seem to have a higher risk of severe crashes. This paper aims to analyse traffic crashes on rural roads to identify the influencing factors on the crash frequency and present a framework to develop a spatial–temporal crash risk map to prioritise high-risk segments on different days. The crash data of Khorasan Razavi province is used in this study. Crash frequency data with the temporal resolution of one day and spatial resolution of 1500 m from loop detectors are analysed. Four groups of influential factors, including traffic parameters (e.g. traffic flow, speed, time headway), road characteristics (e.g. road type, number of lanes), weather data (e.g. daily rainfall, snow depth, temperature), and calendar variables (e.g. day of the week, public holidays, month, year) are used for model calibration. Three different decision tree algorithms, including, Decision Tree (DT), Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) have been employed to predict crash frequency. Results show that based on the traditional evaluation measures, the XGBosst is better for the explanation and interpretation of the factors affecting crash frequency, while the RF model is better for detecting trends and forecasting crash frequency. According to the results, the traffic flow rate, road type, year of the crash, and wind speed are the most influencing variables in predicting crash frequency on rural roads. Forecasting the high and medium risk segment-day in the rural network can be essential to the safety management plan. This risk will be sensitive to real traffic data, weather forecasts and road geometric characteristics. Seventy percent of high and medium risk segment-day are predicted for the case study. Nature Publishing Group UK 2022-11-21 /pmc/articles/PMC9681741/ /pubmed/36414672 http://dx.doi.org/10.1038/s41598-022-24476-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Iranmanesh, Maryam
Seyedabrishami, Seyedehsan
Moridpour, Sara
Identifying high crash risk segments in rural roads using ensemble decision tree-based models
title Identifying high crash risk segments in rural roads using ensemble decision tree-based models
title_full Identifying high crash risk segments in rural roads using ensemble decision tree-based models
title_fullStr Identifying high crash risk segments in rural roads using ensemble decision tree-based models
title_full_unstemmed Identifying high crash risk segments in rural roads using ensemble decision tree-based models
title_short Identifying high crash risk segments in rural roads using ensemble decision tree-based models
title_sort identifying high crash risk segments in rural roads using ensemble decision tree-based models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9681741/
https://www.ncbi.nlm.nih.gov/pubmed/36414672
http://dx.doi.org/10.1038/s41598-022-24476-z
work_keys_str_mv AT iranmaneshmaryam identifyinghighcrashrisksegmentsinruralroadsusingensembledecisiontreebasedmodels
AT seyedabrishamiseyedehsan identifyinghighcrashrisksegmentsinruralroadsusingensembledecisiontreebasedmodels
AT moridpoursara identifyinghighcrashrisksegmentsinruralroadsusingensembledecisiontreebasedmodels