Cargando…

Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Chen, Liu, Lin, Xu, Chengcheng, Lv, Weitao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6388263/ https://www.ncbi.nlm.nih.gov/pubmed/30691063 http://dx.doi.org/10.3390/ijerph16030334

_version_	1783397730767339520
author	Wang, Chen Liu, Lin Xu, Chengcheng Lv, Weitao
author_facet	Wang, Chen Liu, Lin Xu, Chengcheng Lv, Weitao
author_sort	Wang, Chen
collection	PubMed
description	The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers’ two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013–2014, 2014–2015, 2015–2016, and 2016–2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016–2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1–2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively.
format	Online Article Text
id	pubmed-6388263
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-63882632019-02-27 Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework Wang, Chen Liu, Lin Xu, Chengcheng Lv, Weitao Int J Environ Res Public Health Article The objective of this paper is to predict the future driving risk of crash-involved drivers in Kunshan, China. A systematic machine learning framework is proposed to deal with three critical technical issues: 1. defining driving risk; 2. developing risky driving factors; 3. developing a reliable and explicable machine learning model. High-risk (HR) and low-risk (LR) drivers were defined by five different scenarios. A number of features were extracted from seven-year crash/violation records. Drivers’ two-year prior crash/violation information was used to predict their driving risk in the subsequent two years. Using a one-year rolling time window, prediction models were developed for four consecutive time periods: 2013–2014, 2014–2015, 2015–2016, and 2016–2017. Four tree-based ensemble learning techniques were attempted, including random forest (RF), Adaboost with decision tree, gradient boosting decision tree (GBDT), and extreme gradient boosting decision tree (XGboost). A temporal transferability test and a follow-up study were applied to validate the trained models. The best scenario defining driving risk was multi-dimensional, encompassing crash recurrence, severity, and fault commitment. GBDT appeared to be the best model choice across all time periods, with an acceptable average precision (AP) of 0.68 on the most recent datasets (i.e., 2016–2017). Seven of nine top features were related to risky driving behaviors, which presented non-linear relationships with driving risk. Model transferability held within relatively short time intervals (1–2 years). Appropriate risk definition, complicated violation/crash features, and advanced machine learning techniques need to be considered for risk prediction task. The proposed machine learning approach is promising, so that safety interventions can be launched more effectively. MDPI 2019-01-25 2019-02 /pmc/articles/PMC6388263/ /pubmed/30691063 http://dx.doi.org/10.3390/ijerph16030334 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Wang, Chen Liu, Lin Xu, Chengcheng Lv, Weitao Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework
title	Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework
title_full	Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework
title_fullStr	Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework
title_full_unstemmed	Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework
title_short	Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework
title_sort	predicting future driving risk of crash-involved drivers based on a systematic machine learning framework
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6388263/ https://www.ncbi.nlm.nih.gov/pubmed/30691063 http://dx.doi.org/10.3390/ijerph16030334
work_keys_str_mv	AT wangchen predictingfuturedrivingriskofcrashinvolveddriversbasedonasystematicmachinelearningframework AT liulin predictingfuturedrivingriskofcrashinvolveddriversbasedonasystematicmachinelearningframework AT xuchengcheng predictingfuturedrivingriskofcrashinvolveddriversbasedonasystematicmachinelearningframework AT lvweitao predictingfuturedrivingriskofcrashinvolveddriversbasedonasystematicmachinelearningframework

Predicting Future Driving Risk of Crash-Involved Drivers Based on a Systematic Machine Learning Framework

Ejemplares similares