Cargando…

Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework

In this paper, the risk pattern of e-bike riders in China was examined, based on tree-structured machine learning techniques. Three-year crash/violation data were acquired from the Kunshan traffic police department, China. Firstly, high-risk (HR) electric bicycle (e-bike) riders were defined as thos...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Chen, Kou, Siyuan, Song, Yanchao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514428/
http://dx.doi.org/10.3390/e21111084
_version_ 1783586585888948224
author Wang, Chen
Kou, Siyuan
Song, Yanchao
author_facet Wang, Chen
Kou, Siyuan
Song, Yanchao
author_sort Wang, Chen
collection PubMed
description In this paper, the risk pattern of e-bike riders in China was examined, based on tree-structured machine learning techniques. Three-year crash/violation data were acquired from the Kunshan traffic police department, China. Firstly, high-risk (HR) electric bicycle (e-bike) riders were defined as those with at-fault crash involvement, while others (i.e., non-at-fault or without crash involvement) were considered as non-high-risk (NHR) riders, based on quasi-induced exposure theory. Then, for e-bike riders, their demographics and previous violation-related features were developed based on the crash/violation records. After that, a systematic machine learning (ML) framework was proposed so as to capture the complex risk patterns of those e-bike riders. An ensemble sampling method was selected to deal with the imbalanced datasets. Four tree-structured machine learning methods were compared, and a gradient boost decision tree (GBDT) appeared to be the best. The feature importance and partial dependence were further examined. Interesting findings include the following: (1) tree-structured ML models are able to capture complex risk patterns and interpret them properly; (2) spatial-temporal violation features were found as important indicators of high-risk e-bike riders; and (3) violation behavior features appeared to be more effective than violation punishment-related features, in terms of identifying high-risk e-bike riders. In general, the proposed ML framework is able to identify the complex crash risk pattern of e-bike riders. This paper provides useful insights for policy-makers and traffic practitioners regarding e-bike safety improvement in China.
format Online
Article
Text
id pubmed-7514428
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75144282020-11-09 Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework Wang, Chen Kou, Siyuan Song, Yanchao Entropy (Basel) Article In this paper, the risk pattern of e-bike riders in China was examined, based on tree-structured machine learning techniques. Three-year crash/violation data were acquired from the Kunshan traffic police department, China. Firstly, high-risk (HR) electric bicycle (e-bike) riders were defined as those with at-fault crash involvement, while others (i.e., non-at-fault or without crash involvement) were considered as non-high-risk (NHR) riders, based on quasi-induced exposure theory. Then, for e-bike riders, their demographics and previous violation-related features were developed based on the crash/violation records. After that, a systematic machine learning (ML) framework was proposed so as to capture the complex risk patterns of those e-bike riders. An ensemble sampling method was selected to deal with the imbalanced datasets. Four tree-structured machine learning methods were compared, and a gradient boost decision tree (GBDT) appeared to be the best. The feature importance and partial dependence were further examined. Interesting findings include the following: (1) tree-structured ML models are able to capture complex risk patterns and interpret them properly; (2) spatial-temporal violation features were found as important indicators of high-risk e-bike riders; and (3) violation behavior features appeared to be more effective than violation punishment-related features, in terms of identifying high-risk e-bike riders. In general, the proposed ML framework is able to identify the complex crash risk pattern of e-bike riders. This paper provides useful insights for policy-makers and traffic practitioners regarding e-bike safety improvement in China. MDPI 2019-11-06 /pmc/articles/PMC7514428/ http://dx.doi.org/10.3390/e21111084 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Chen
Kou, Siyuan
Song, Yanchao
Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
title Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
title_full Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
title_fullStr Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
title_full_unstemmed Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
title_short Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
title_sort identify risk pattern of e-bike riders in china based on machine learning framework
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514428/
http://dx.doi.org/10.3390/e21111084
work_keys_str_mv AT wangchen identifyriskpatternofebikeridersinchinabasedonmachinelearningframework
AT kousiyuan identifyriskpatternofebikeridersinchinabasedonmachinelearningframework
AT songyanchao identifyriskpatternofebikeridersinchinabasedonmachinelearningframework