Cargando…

Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country

Road traffic injuries are one of the primary reasons for death, especially in developing countries like Bangladesh. Safety in land transport is one of the major concerns for road safety authorities and other policymakers. For this reason, contributory factors identification associated with crashes i...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhuiyan, Hanif, Ara, Jinat, Hasib, Khan Md., Sourav, Md Imran Hossain, Karim, Faria Benta, Sik-Lanyi, Cecilia, Governatori, Guido, Rakotonirainy, Andry, Yasmin, Shamsunnahar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9732348/
https://www.ncbi.nlm.nih.gov/pubmed/36481807
http://dx.doi.org/10.1038/s41598-022-25361-5
_version_ 1784846111250317312
author Bhuiyan, Hanif
Ara, Jinat
Hasib, Khan Md.
Sourav, Md Imran Hossain
Karim, Faria Benta
Sik-Lanyi, Cecilia
Governatori, Guido
Rakotonirainy, Andry
Yasmin, Shamsunnahar
author_facet Bhuiyan, Hanif
Ara, Jinat
Hasib, Khan Md.
Sourav, Md Imran Hossain
Karim, Faria Benta
Sik-Lanyi, Cecilia
Governatori, Guido
Rakotonirainy, Andry
Yasmin, Shamsunnahar
author_sort Bhuiyan, Hanif
collection PubMed
description Road traffic injuries are one of the primary reasons for death, especially in developing countries like Bangladesh. Safety in land transport is one of the major concerns for road safety authorities and other policymakers. For this reason, contributory factors identification associated with crashes is necessary for reducing road crashes and ensuring transportation safety. This paper presents an analytical approach to identifying significant contributing factors of Bangladesh road crashes by evaluating the road crash data, considering three different severity levels (non-fetal, severe, and extremely severe). Generally, official crash databases are compiled from police-reported crash records. Though the official datasets are focusing on compiling a wide array of attributes, an assorted number of unreported issues can be observed that demands an alternative source of crash data. Therefore, this proposed approach considers compiling crash data from newspapers in Bangladesh which could be complimentary to the official crash database. To conduct the analysis, first, we filtered the useful features from compiled crash data using three popular feature selection techniques: chi-square, Two-way ANOVA, and Regression analysis. Then, we employed three machine learning classifiers: Decision Tree, Random Forest, and Naïve Bayes over the extracted features. A confusion matrix was considered to evaluate the proposed model, including classification accuracy, sensitivity, and specificity. The predictive machine learning model, namely, Random Forest using Label Encoder with chi-square and Two-way ANOVA feature selection process, seems the best option for crash severity prediction that provides high prediction accuracy. The resulting model highlights nine out of fourteen independent features as responsible factors. Significant features associated with crash severities include driver characteristics (gender, license type, seat belts), vehicle characteristics (vehicle type), road characteristics (road surface type, road classification), environmental conditions (day of crash occurred, time of crash), and injury localization. This outcome may contribute to improving traffic safety of Bangladesh.
format Online
Article
Text
id pubmed-9732348
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97323482022-12-10 Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country Bhuiyan, Hanif Ara, Jinat Hasib, Khan Md. Sourav, Md Imran Hossain Karim, Faria Benta Sik-Lanyi, Cecilia Governatori, Guido Rakotonirainy, Andry Yasmin, Shamsunnahar Sci Rep Article Road traffic injuries are one of the primary reasons for death, especially in developing countries like Bangladesh. Safety in land transport is one of the major concerns for road safety authorities and other policymakers. For this reason, contributory factors identification associated with crashes is necessary for reducing road crashes and ensuring transportation safety. This paper presents an analytical approach to identifying significant contributing factors of Bangladesh road crashes by evaluating the road crash data, considering three different severity levels (non-fetal, severe, and extremely severe). Generally, official crash databases are compiled from police-reported crash records. Though the official datasets are focusing on compiling a wide array of attributes, an assorted number of unreported issues can be observed that demands an alternative source of crash data. Therefore, this proposed approach considers compiling crash data from newspapers in Bangladesh which could be complimentary to the official crash database. To conduct the analysis, first, we filtered the useful features from compiled crash data using three popular feature selection techniques: chi-square, Two-way ANOVA, and Regression analysis. Then, we employed three machine learning classifiers: Decision Tree, Random Forest, and Naïve Bayes over the extracted features. A confusion matrix was considered to evaluate the proposed model, including classification accuracy, sensitivity, and specificity. The predictive machine learning model, namely, Random Forest using Label Encoder with chi-square and Two-way ANOVA feature selection process, seems the best option for crash severity prediction that provides high prediction accuracy. The resulting model highlights nine out of fourteen independent features as responsible factors. Significant features associated with crash severities include driver characteristics (gender, license type, seat belts), vehicle characteristics (vehicle type), road characteristics (road surface type, road classification), environmental conditions (day of crash occurred, time of crash), and injury localization. This outcome may contribute to improving traffic safety of Bangladesh. Nature Publishing Group UK 2022-12-08 /pmc/articles/PMC9732348/ /pubmed/36481807 http://dx.doi.org/10.1038/s41598-022-25361-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Bhuiyan, Hanif
Ara, Jinat
Hasib, Khan Md.
Sourav, Md Imran Hossain
Karim, Faria Benta
Sik-Lanyi, Cecilia
Governatori, Guido
Rakotonirainy, Andry
Yasmin, Shamsunnahar
Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
title Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
title_full Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
title_fullStr Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
title_full_unstemmed Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
title_short Crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
title_sort crash severity analysis and risk factors identification based on an alternate data source: a case study of developing country
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9732348/
https://www.ncbi.nlm.nih.gov/pubmed/36481807
http://dx.doi.org/10.1038/s41598-022-25361-5
work_keys_str_mv AT bhuiyanhanif crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT arajinat crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT hasibkhanmd crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT souravmdimranhossain crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT karimfariabenta crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT siklanyicecilia crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT governatoriguido crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT rakotonirainyandry crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry
AT yasminshamsunnahar crashseverityanalysisandriskfactorsidentificationbasedonanalternatedatasourceacasestudyofdevelopingcountry