Cargando…

A comparative study on machine learning based algorithms for prediction of motorcycle crash severity

Motorcycle crash severity is under-researched in Ghana. Thus, the probable risk factors and association between these factors and motorcycle crash severity outcomes is not known. Traditional statistical models have intrinsic assumptions and pre-defined correlations that, if flouted, can generate ina...

Descripción completa

Detalles Bibliográficos
Autores principales: Wahab, Lukuman, Jiang, Haobin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448880/
https://www.ncbi.nlm.nih.gov/pubmed/30947250
http://dx.doi.org/10.1371/journal.pone.0214966
_version_ 1783408743056146432
author Wahab, Lukuman
Jiang, Haobin
author_facet Wahab, Lukuman
Jiang, Haobin
author_sort Wahab, Lukuman
collection PubMed
description Motorcycle crash severity is under-researched in Ghana. Thus, the probable risk factors and association between these factors and motorcycle crash severity outcomes is not known. Traditional statistical models have intrinsic assumptions and pre-defined correlations that, if flouted, can generate inaccurate results. In this study, machine learning based algorithms were employed to predict and classify motorcycle crash severity. Machine learning based techniques are non-parametric models without the presumption of relationships between endogenous and exogenous variables. The main aim of this research is to evaluate and compare different approaches to modeling motorcycle crash severity as well as investigating the effect of risk factors on the injury outcomes of motorcycle crashes. Motorcycle crash dataset between 2011 and 2015 was extracted from the National Road Traffic Crash Database at the Building and Road Research Institute (BRRI) in Ghana. The dataset was classified into four injury severity categories: fatal, hospitalized, injured, and damage-only. Three machine learning based models were developed: J48 Decision Tree Classifier, Random Forest (RF) and Instance-Based learning with parameter k (IBk) were employed to model the severity of injury in a motorcycle crash. These machine learning algorithms were validated using 10-fold cross-validation technique. The three machine learning based algorithms were compared with one another and the statistical model: multinomial logit model (MNLM). Also, the relative importance analysis of the attribute was conducted to determine the impact of these attributes on injury severity outcomes. The results of the study reveal that the predictions of machine learning algorithms are superior to the MNLM in accuracy and effectiveness, and the RF-based algorithms show the overall best agreement with the experimental data out of the three machine learning algorithms, for its global optimization and extrapolation ability. Location type, time of the crash, settlement type, collision partner, collision type, road separation, road surface type, the day of the week, and road shoulder condition were found as the critical determinants of motorcycle crash injury severity.
format Online
Article
Text
id pubmed-6448880
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-64488802019-04-19 A comparative study on machine learning based algorithms for prediction of motorcycle crash severity Wahab, Lukuman Jiang, Haobin PLoS One Research Article Motorcycle crash severity is under-researched in Ghana. Thus, the probable risk factors and association between these factors and motorcycle crash severity outcomes is not known. Traditional statistical models have intrinsic assumptions and pre-defined correlations that, if flouted, can generate inaccurate results. In this study, machine learning based algorithms were employed to predict and classify motorcycle crash severity. Machine learning based techniques are non-parametric models without the presumption of relationships between endogenous and exogenous variables. The main aim of this research is to evaluate and compare different approaches to modeling motorcycle crash severity as well as investigating the effect of risk factors on the injury outcomes of motorcycle crashes. Motorcycle crash dataset between 2011 and 2015 was extracted from the National Road Traffic Crash Database at the Building and Road Research Institute (BRRI) in Ghana. The dataset was classified into four injury severity categories: fatal, hospitalized, injured, and damage-only. Three machine learning based models were developed: J48 Decision Tree Classifier, Random Forest (RF) and Instance-Based learning with parameter k (IBk) were employed to model the severity of injury in a motorcycle crash. These machine learning algorithms were validated using 10-fold cross-validation technique. The three machine learning based algorithms were compared with one another and the statistical model: multinomial logit model (MNLM). Also, the relative importance analysis of the attribute was conducted to determine the impact of these attributes on injury severity outcomes. The results of the study reveal that the predictions of machine learning algorithms are superior to the MNLM in accuracy and effectiveness, and the RF-based algorithms show the overall best agreement with the experimental data out of the three machine learning algorithms, for its global optimization and extrapolation ability. Location type, time of the crash, settlement type, collision partner, collision type, road separation, road surface type, the day of the week, and road shoulder condition were found as the critical determinants of motorcycle crash injury severity. Public Library of Science 2019-04-04 /pmc/articles/PMC6448880/ /pubmed/30947250 http://dx.doi.org/10.1371/journal.pone.0214966 Text en © 2019 Wahab, Jiang http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wahab, Lukuman
Jiang, Haobin
A comparative study on machine learning based algorithms for prediction of motorcycle crash severity
title A comparative study on machine learning based algorithms for prediction of motorcycle crash severity
title_full A comparative study on machine learning based algorithms for prediction of motorcycle crash severity
title_fullStr A comparative study on machine learning based algorithms for prediction of motorcycle crash severity
title_full_unstemmed A comparative study on machine learning based algorithms for prediction of motorcycle crash severity
title_short A comparative study on machine learning based algorithms for prediction of motorcycle crash severity
title_sort comparative study on machine learning based algorithms for prediction of motorcycle crash severity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448880/
https://www.ncbi.nlm.nih.gov/pubmed/30947250
http://dx.doi.org/10.1371/journal.pone.0214966
work_keys_str_mv AT wahablukuman acomparativestudyonmachinelearningbasedalgorithmsforpredictionofmotorcyclecrashseverity
AT jianghaobin acomparativestudyonmachinelearningbasedalgorithmsforpredictionofmotorcyclecrashseverity
AT wahablukuman comparativestudyonmachinelearningbasedalgorithmsforpredictionofmotorcyclecrashseverity
AT jianghaobin comparativestudyonmachinelearningbasedalgorithmsforpredictionofmotorcyclecrashseverity