Cargando…

Traffic accident duration prediction using text mining and ensemble learning on expressways

Predicting traffic accident duration is necessary for ensuring traffic safety. Several attempts have been made to achieve high prediction accuracy, but researchers have not considered traffic accident text data in much detail. The limited text data of the first report on an incident describes the ch...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Jiaona, Tao, Weijun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744849/
https://www.ncbi.nlm.nih.gov/pubmed/36509866
http://dx.doi.org/10.1038/s41598-022-25988-4
_version_ 1784849016501043200
author Chen, Jiaona
Tao, Weijun
author_facet Chen, Jiaona
Tao, Weijun
author_sort Chen, Jiaona
collection PubMed
description Predicting traffic accident duration is necessary for ensuring traffic safety. Several attempts have been made to achieve high prediction accuracy, but researchers have not considered traffic accident text data in much detail. The limited text data of the first report on an incident describes the characteristics of an accident that are initially available. This paper uses text data fusing and ensemble learning algorithms to build a model to predict an accident’s duration, and a preprocessing scheme of accident duration text data is established. Next, the random forest (RF) algorithm is applied to select feature variables of text data related to the traffic incident duration. Last, a text feature vector is introduced to models such as decision tree, k nearest neighbor, support vector regression, random forest, Gradient Boosting Decision Tree, and Xtreme Gradient Boosting. Our results show that the improved RF model has good prediction accuracy with RMSE, MAPE and R(2). From this, the textual factors important to determining the duration of the accident are identified. Further, we investigated that the cumulative importance of 60% is sufficient for traffic accident prediction using text data. These results provide insights into minimizing traffic congestion related to accidents and contribute to the input optimization in text prediction.
format Online
Article
Text
id pubmed-9744849
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97448492022-12-14 Traffic accident duration prediction using text mining and ensemble learning on expressways Chen, Jiaona Tao, Weijun Sci Rep Article Predicting traffic accident duration is necessary for ensuring traffic safety. Several attempts have been made to achieve high prediction accuracy, but researchers have not considered traffic accident text data in much detail. The limited text data of the first report on an incident describes the characteristics of an accident that are initially available. This paper uses text data fusing and ensemble learning algorithms to build a model to predict an accident’s duration, and a preprocessing scheme of accident duration text data is established. Next, the random forest (RF) algorithm is applied to select feature variables of text data related to the traffic incident duration. Last, a text feature vector is introduced to models such as decision tree, k nearest neighbor, support vector regression, random forest, Gradient Boosting Decision Tree, and Xtreme Gradient Boosting. Our results show that the improved RF model has good prediction accuracy with RMSE, MAPE and R(2). From this, the textual factors important to determining the duration of the accident are identified. Further, we investigated that the cumulative importance of 60% is sufficient for traffic accident prediction using text data. These results provide insights into minimizing traffic congestion related to accidents and contribute to the input optimization in text prediction. Nature Publishing Group UK 2022-12-12 /pmc/articles/PMC9744849/ /pubmed/36509866 http://dx.doi.org/10.1038/s41598-022-25988-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chen, Jiaona
Tao, Weijun
Traffic accident duration prediction using text mining and ensemble learning on expressways
title Traffic accident duration prediction using text mining and ensemble learning on expressways
title_full Traffic accident duration prediction using text mining and ensemble learning on expressways
title_fullStr Traffic accident duration prediction using text mining and ensemble learning on expressways
title_full_unstemmed Traffic accident duration prediction using text mining and ensemble learning on expressways
title_short Traffic accident duration prediction using text mining and ensemble learning on expressways
title_sort traffic accident duration prediction using text mining and ensemble learning on expressways
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744849/
https://www.ncbi.nlm.nih.gov/pubmed/36509866
http://dx.doi.org/10.1038/s41598-022-25988-4
work_keys_str_mv AT chenjiaona trafficaccidentdurationpredictionusingtextminingandensemblelearningonexpressways
AT taoweijun trafficaccidentdurationpredictionusingtextminingandensemblelearningonexpressways