Cargando…
Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework
In this paper, the risk pattern of e-bike riders in China was examined, based on tree-structured machine learning techniques. Three-year crash/violation data were acquired from the Kunshan traffic police department, China. Firstly, high-risk (HR) electric bicycle (e-bike) riders were defined as thos...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514428/ http://dx.doi.org/10.3390/e21111084 |
_version_ | 1783586585888948224 |
---|---|
author | Wang, Chen Kou, Siyuan Song, Yanchao |
author_facet | Wang, Chen Kou, Siyuan Song, Yanchao |
author_sort | Wang, Chen |
collection | PubMed |
description | In this paper, the risk pattern of e-bike riders in China was examined, based on tree-structured machine learning techniques. Three-year crash/violation data were acquired from the Kunshan traffic police department, China. Firstly, high-risk (HR) electric bicycle (e-bike) riders were defined as those with at-fault crash involvement, while others (i.e., non-at-fault or without crash involvement) were considered as non-high-risk (NHR) riders, based on quasi-induced exposure theory. Then, for e-bike riders, their demographics and previous violation-related features were developed based on the crash/violation records. After that, a systematic machine learning (ML) framework was proposed so as to capture the complex risk patterns of those e-bike riders. An ensemble sampling method was selected to deal with the imbalanced datasets. Four tree-structured machine learning methods were compared, and a gradient boost decision tree (GBDT) appeared to be the best. The feature importance and partial dependence were further examined. Interesting findings include the following: (1) tree-structured ML models are able to capture complex risk patterns and interpret them properly; (2) spatial-temporal violation features were found as important indicators of high-risk e-bike riders; and (3) violation behavior features appeared to be more effective than violation punishment-related features, in terms of identifying high-risk e-bike riders. In general, the proposed ML framework is able to identify the complex crash risk pattern of e-bike riders. This paper provides useful insights for policy-makers and traffic practitioners regarding e-bike safety improvement in China. |
format | Online Article Text |
id | pubmed-7514428 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75144282020-11-09 Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework Wang, Chen Kou, Siyuan Song, Yanchao Entropy (Basel) Article In this paper, the risk pattern of e-bike riders in China was examined, based on tree-structured machine learning techniques. Three-year crash/violation data were acquired from the Kunshan traffic police department, China. Firstly, high-risk (HR) electric bicycle (e-bike) riders were defined as those with at-fault crash involvement, while others (i.e., non-at-fault or without crash involvement) were considered as non-high-risk (NHR) riders, based on quasi-induced exposure theory. Then, for e-bike riders, their demographics and previous violation-related features were developed based on the crash/violation records. After that, a systematic machine learning (ML) framework was proposed so as to capture the complex risk patterns of those e-bike riders. An ensemble sampling method was selected to deal with the imbalanced datasets. Four tree-structured machine learning methods were compared, and a gradient boost decision tree (GBDT) appeared to be the best. The feature importance and partial dependence were further examined. Interesting findings include the following: (1) tree-structured ML models are able to capture complex risk patterns and interpret them properly; (2) spatial-temporal violation features were found as important indicators of high-risk e-bike riders; and (3) violation behavior features appeared to be more effective than violation punishment-related features, in terms of identifying high-risk e-bike riders. In general, the proposed ML framework is able to identify the complex crash risk pattern of e-bike riders. This paper provides useful insights for policy-makers and traffic practitioners regarding e-bike safety improvement in China. MDPI 2019-11-06 /pmc/articles/PMC7514428/ http://dx.doi.org/10.3390/e21111084 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Chen Kou, Siyuan Song, Yanchao Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework |
title | Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework |
title_full | Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework |
title_fullStr | Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework |
title_full_unstemmed | Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework |
title_short | Identify Risk Pattern of E-Bike Riders in China Based on Machine Learning Framework |
title_sort | identify risk pattern of e-bike riders in china based on machine learning framework |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7514428/ http://dx.doi.org/10.3390/e21111084 |
work_keys_str_mv | AT wangchen identifyriskpatternofebikeridersinchinabasedonmachinelearningframework AT kousiyuan identifyriskpatternofebikeridersinchinabasedonmachinelearningframework AT songyanchao identifyriskpatternofebikeridersinchinabasedonmachinelearningframework |