Cargando…

Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning

Predicting driving behavior and crash risk in real-time is a problem that has been heavily researched in the past years. Although in-vehicle interventions and gamification features in post-trip dashboards have emerged, the connection between real-time driving behavior prediction and the triggering o...

Descripción completa

Detalles Bibliográficos
Autores principales: Garefalakis, Thodoris, Katrakazas, Christos, Yannis, George
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9319394/
https://www.ncbi.nlm.nih.gov/pubmed/35890990
http://dx.doi.org/10.3390/s22145309
_version_ 1784755538389630976
author Garefalakis, Thodoris
Katrakazas, Christos
Yannis, George
author_facet Garefalakis, Thodoris
Katrakazas, Christos
Yannis, George
author_sort Garefalakis, Thodoris
collection PubMed
description Predicting driving behavior and crash risk in real-time is a problem that has been heavily researched in the past years. Although in-vehicle interventions and gamification features in post-trip dashboards have emerged, the connection between real-time driving behavior prediction and the triggering of such interventions is yet to be realized. This is the focus of the European Horizon2020 project “i-DREAMS”, which aims at defining, developing, testing and validating a ‘Safety Tolerance Zone’ (STZ) in order to prevent drivers from risky driving behaviors using interventions both in real-time and post-trip. However, the data-driven conceptualization of STZ levels is a challenging task, and data class imbalance might hinder this process. Following the project principles and taking the aforementioned challenges into consideration, this paper proposes a framework to identify the level of risky driving behavior as well as the duration of the time spent in each risk level by private car drivers. This aim is accomplished by four classification algorithms, namely Support Vector Machines (SVMs), Random Forest (RFs), AdaBoost, and Multilayer Perceptron (MLP) Neural Networks and imbalanced learning using the Adaptive Synthetic technique (ADASYN) in order to deal with the unbalanced distribution of the dataset in the STZ levels. Moreover, as an alternative approach of risk prediction, three regression algorithms, namely Ridge, Lasso, and Elastic Net are used to predict time duration. The results showed that RF and MLP outperformed the rest of the classifiers with 84% and 82% overall accuracy, respectively, and that the maximum speed of the vehicle during a 30 s interval, is the most crucial predictor for identifying the driving time at each safety level.
format Online
Article
Text
id pubmed-9319394
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93193942022-07-27 Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning Garefalakis, Thodoris Katrakazas, Christos Yannis, George Sensors (Basel) Article Predicting driving behavior and crash risk in real-time is a problem that has been heavily researched in the past years. Although in-vehicle interventions and gamification features in post-trip dashboards have emerged, the connection between real-time driving behavior prediction and the triggering of such interventions is yet to be realized. This is the focus of the European Horizon2020 project “i-DREAMS”, which aims at defining, developing, testing and validating a ‘Safety Tolerance Zone’ (STZ) in order to prevent drivers from risky driving behaviors using interventions both in real-time and post-trip. However, the data-driven conceptualization of STZ levels is a challenging task, and data class imbalance might hinder this process. Following the project principles and taking the aforementioned challenges into consideration, this paper proposes a framework to identify the level of risky driving behavior as well as the duration of the time spent in each risk level by private car drivers. This aim is accomplished by four classification algorithms, namely Support Vector Machines (SVMs), Random Forest (RFs), AdaBoost, and Multilayer Perceptron (MLP) Neural Networks and imbalanced learning using the Adaptive Synthetic technique (ADASYN) in order to deal with the unbalanced distribution of the dataset in the STZ levels. Moreover, as an alternative approach of risk prediction, three regression algorithms, namely Ridge, Lasso, and Elastic Net are used to predict time duration. The results showed that RF and MLP outperformed the rest of the classifiers with 84% and 82% overall accuracy, respectively, and that the maximum speed of the vehicle during a 30 s interval, is the most crucial predictor for identifying the driving time at each safety level. MDPI 2022-07-15 /pmc/articles/PMC9319394/ /pubmed/35890990 http://dx.doi.org/10.3390/s22145309 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Garefalakis, Thodoris
Katrakazas, Christos
Yannis, George
Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning
title Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning
title_full Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning
title_fullStr Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning
title_full_unstemmed Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning
title_short Data-Driven Estimation of a Driving Safety Tolerance Zone Using Imbalanced Machine Learning
title_sort data-driven estimation of a driving safety tolerance zone using imbalanced machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9319394/
https://www.ncbi.nlm.nih.gov/pubmed/35890990
http://dx.doi.org/10.3390/s22145309
work_keys_str_mv AT garefalakisthodoris datadrivenestimationofadrivingsafetytolerancezoneusingimbalancedmachinelearning
AT katrakazaschristos datadrivenestimationofadrivingsafetytolerancezoneusingimbalancedmachinelearning
AT yannisgeorge datadrivenestimationofadrivingsafetytolerancezoneusingimbalancedmachinelearning