Cargando…
Identifying high crash risk segments in rural roads using ensemble decision tree-based models
Traffic safety forecast models are mainly used to rank road segments. While existing studies have primarily focused on identifying segments in urban networks, rural networks have received less attention. However, rural networks seem to have a higher risk of severe crashes. This paper aims to analyse...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9681741/ https://www.ncbi.nlm.nih.gov/pubmed/36414672 http://dx.doi.org/10.1038/s41598-022-24476-z |
_version_ | 1784834688975634432 |
---|---|
author | Iranmanesh, Maryam Seyedabrishami, Seyedehsan Moridpour, Sara |
author_facet | Iranmanesh, Maryam Seyedabrishami, Seyedehsan Moridpour, Sara |
author_sort | Iranmanesh, Maryam |
collection | PubMed |
description | Traffic safety forecast models are mainly used to rank road segments. While existing studies have primarily focused on identifying segments in urban networks, rural networks have received less attention. However, rural networks seem to have a higher risk of severe crashes. This paper aims to analyse traffic crashes on rural roads to identify the influencing factors on the crash frequency and present a framework to develop a spatial–temporal crash risk map to prioritise high-risk segments on different days. The crash data of Khorasan Razavi province is used in this study. Crash frequency data with the temporal resolution of one day and spatial resolution of 1500 m from loop detectors are analysed. Four groups of influential factors, including traffic parameters (e.g. traffic flow, speed, time headway), road characteristics (e.g. road type, number of lanes), weather data (e.g. daily rainfall, snow depth, temperature), and calendar variables (e.g. day of the week, public holidays, month, year) are used for model calibration. Three different decision tree algorithms, including, Decision Tree (DT), Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) have been employed to predict crash frequency. Results show that based on the traditional evaluation measures, the XGBosst is better for the explanation and interpretation of the factors affecting crash frequency, while the RF model is better for detecting trends and forecasting crash frequency. According to the results, the traffic flow rate, road type, year of the crash, and wind speed are the most influencing variables in predicting crash frequency on rural roads. Forecasting the high and medium risk segment-day in the rural network can be essential to the safety management plan. This risk will be sensitive to real traffic data, weather forecasts and road geometric characteristics. Seventy percent of high and medium risk segment-day are predicted for the case study. |
format | Online Article Text |
id | pubmed-9681741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96817412022-11-24 Identifying high crash risk segments in rural roads using ensemble decision tree-based models Iranmanesh, Maryam Seyedabrishami, Seyedehsan Moridpour, Sara Sci Rep Article Traffic safety forecast models are mainly used to rank road segments. While existing studies have primarily focused on identifying segments in urban networks, rural networks have received less attention. However, rural networks seem to have a higher risk of severe crashes. This paper aims to analyse traffic crashes on rural roads to identify the influencing factors on the crash frequency and present a framework to develop a spatial–temporal crash risk map to prioritise high-risk segments on different days. The crash data of Khorasan Razavi province is used in this study. Crash frequency data with the temporal resolution of one day and spatial resolution of 1500 m from loop detectors are analysed. Four groups of influential factors, including traffic parameters (e.g. traffic flow, speed, time headway), road characteristics (e.g. road type, number of lanes), weather data (e.g. daily rainfall, snow depth, temperature), and calendar variables (e.g. day of the week, public holidays, month, year) are used for model calibration. Three different decision tree algorithms, including, Decision Tree (DT), Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) have been employed to predict crash frequency. Results show that based on the traditional evaluation measures, the XGBosst is better for the explanation and interpretation of the factors affecting crash frequency, while the RF model is better for detecting trends and forecasting crash frequency. According to the results, the traffic flow rate, road type, year of the crash, and wind speed are the most influencing variables in predicting crash frequency on rural roads. Forecasting the high and medium risk segment-day in the rural network can be essential to the safety management plan. This risk will be sensitive to real traffic data, weather forecasts and road geometric characteristics. Seventy percent of high and medium risk segment-day are predicted for the case study. Nature Publishing Group UK 2022-11-21 /pmc/articles/PMC9681741/ /pubmed/36414672 http://dx.doi.org/10.1038/s41598-022-24476-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Iranmanesh, Maryam Seyedabrishami, Seyedehsan Moridpour, Sara Identifying high crash risk segments in rural roads using ensemble decision tree-based models |
title | Identifying high crash risk segments in rural roads using ensemble decision tree-based models |
title_full | Identifying high crash risk segments in rural roads using ensemble decision tree-based models |
title_fullStr | Identifying high crash risk segments in rural roads using ensemble decision tree-based models |
title_full_unstemmed | Identifying high crash risk segments in rural roads using ensemble decision tree-based models |
title_short | Identifying high crash risk segments in rural roads using ensemble decision tree-based models |
title_sort | identifying high crash risk segments in rural roads using ensemble decision tree-based models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9681741/ https://www.ncbi.nlm.nih.gov/pubmed/36414672 http://dx.doi.org/10.1038/s41598-022-24476-z |
work_keys_str_mv | AT iranmaneshmaryam identifyinghighcrashrisksegmentsinruralroadsusingensembledecisiontreebasedmodels AT seyedabrishamiseyedehsan identifyinghighcrashrisksegmentsinruralroadsusingensembledecisiontreebasedmodels AT moridpoursara identifyinghighcrashrisksegmentsinruralroadsusingensembledecisiontreebasedmodels |