Cargando…
Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
Road traffic injuries are one of the primary reasons for death, especially in developing countries like Bangladesh. Safety in land transport is one of the major concerns for road safety authorities and other policymakers. For this reason, contributory factors identification associated with crashes i...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9732348/ https://www.ncbi.nlm.nih.gov/pubmed/36481807 http://dx.doi.org/10.1038/s41598-022-25361-5 |
_version_ | 1784846111250317312 |
---|---|
author | Bhuiyan, Hanif Ara, Jinat Hasib, Khan Md. Sourav, Md Imran Hossain Karim, Faria Benta Sik-Lanyi, Cecilia Governatori, Guido Rakotonirainy, Andry Yasmin, Shamsunnahar |
author_facet | Bhuiyan, Hanif Ara, Jinat Hasib, Khan Md. Sourav, Md Imran Hossain Karim, Faria Benta Sik-Lanyi, Cecilia Governatori, Guido Rakotonirainy, Andry Yasmin, Shamsunnahar |
author_sort | Bhuiyan, Hanif |
collection | PubMed |
description | Road traffic injuries are one of the primary reasons for death, especially in developing countries like Bangladesh. Safety in land transport is one of the major concerns for road safety authorities and other policymakers. For this reason, contributory factors identification associated with crashes is necessary for reducing road crashes and ensuring transportation safety. This paper presents an analytical approach to identifying significant contributing factors of Bangladesh road crashes by evaluating the road crash data, considering three different severity levels (non-fetal, severe, and extremely severe). Generally, official crash databases are compiled from police-reported crash records. Though the official datasets are focusing on compiling a wide array of attributes, an assorted number of unreported issues can be observed that demands an alternative source of crash data. Therefore, this proposed approach considers compiling crash data from newspapers in Bangladesh which could be complimentary to the official crash database. To conduct the analysis, first, we filtered the useful features from compiled crash data using three popular feature selection techniques: chi-square, Two-way ANOVA, and Regression analysis. Then, we employed three machine learning classifiers: Decision Tree, Random Forest, and Naïve Bayes over the extracted features. A confusion matrix was considered to evaluate the proposed model, including classification accuracy, sensitivity, and specificity. The predictive machine learning model, namely, Random Forest using Label Encoder with chi-square and Two-way ANOVA feature selection process, seems the best option for crash severity prediction that provides high prediction accuracy. The resulting model highlights nine out of fourteen independent features as responsible factors. Significant features associated with crash severities include driver characteristics (gender, license type, seat belts), vehicle characteristics (vehicle type), road characteristics (road surface type, road classification), environmental conditions (day of crash occurred, time of crash), and injury localization. This outcome may contribute to improving traffic safety of Bangladesh. |
format | Online Article Text |
id | pubmed-9732348 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-97323482022-12-10 Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country Bhuiyan, Hanif Ara, Jinat Hasib, Khan Md. Sourav, Md Imran Hossain Karim, Faria Benta Sik-Lanyi, Cecilia Governatori, Guido Rakotonirainy, Andry Yasmin, Shamsunnahar Sci Rep Article Road traffic injuries are one of the primary reasons for death, especially in developing countries like Bangladesh. Safety in land transport is one of the major concerns for road safety authorities and other policymakers. For this reason, contributory factors identification associated with crashes is necessary for reducing road crashes and ensuring transportation safety. This paper presents an analytical approach to identifying significant contributing factors of Bangladesh road crashes by evaluating the road crash data, considering three different severity levels (non-fetal, severe, and extremely severe). Generally, official crash databases are compiled from police-reported crash records. Though the official datasets are focusing on compiling a wide array of attributes, an assorted number of unreported issues can be observed that demands an alternative source of crash data. Therefore, this proposed approach considers compiling crash data from newspapers in Bangladesh which could be complimentary to the official crash database. To conduct the analysis, first, we filtered the useful features from compiled crash data using three popular feature selection techniques: chi-square, Two-way ANOVA, and Regression analysis. Then, we employed three machine learning classifiers: Decision Tree, Random Forest, and Naïve Bayes over the extracted features. A confusion matrix was considered to evaluate the proposed model, including classification accuracy, sensitivity, and specificity. The predictive machine learning model, namely, Random Forest using Label Encoder with chi-square and Two-way ANOVA feature selection process, seems the best option for crash severity prediction that provides high prediction accuracy. The resulting model highlights nine out of fourteen independent features as responsible factors. Significant features associated with crash severities include driver characteristics (gender, license type, seat belts), vehicle characteristics (vehicle type), road characteristics (road surface type, road classification), environmental conditions (day of crash occurred, time of crash), and injury localization. This outcome may contribute to improving traffic safety of Bangladesh. Nature Publishing Group UK 2022-12-08 /pmc/articles/PMC9732348/ /pubmed/36481807 http://dx.doi.org/10.1038/s41598-022-25361-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Bhuiyan, Hanif Ara, Jinat Hasib, Khan Md. Sourav, Md Imran Hossain Karim, Faria Benta Sik-Lanyi, Cecilia Governatori, Guido Rakotonirainy, Andry Yasmin, Shamsunnahar Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country |
title | Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country |
title_full | Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country |
title_fullStr | Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country |
title_full_unstemmed | Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country |
title_short | Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country |
title_sort | crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9732348/ https://www.ncbi.nlm.nih.gov/pubmed/36481807 http://dx.doi.org/10.1038/s41598-022-25361-5 |
work_keys_str_mv | AT bhuiyanhanif crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT arajinat crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT hasibkhanmd crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT souravmdimranhossain crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT karimfariabenta crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT siklanyicecilia crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT governatoriguido crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT rakotonirainyandry crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry AT yasminshamsunnahar crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry |