Cargando…
Traffic accident duration prediction using text mining and ensemble learning on expressways
Predicting traffic accident duration is necessary for ensuring traffic safety. Several attempts have been made to achieve high prediction accuracy, but researchers have not considered traffic accident text data in much detail. The limited text data of the first report on an incident describes the ch...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744849/ https://www.ncbi.nlm.nih.gov/pubmed/36509866 http://dx.doi.org/10.1038/s41598-022-25988-4 |
_version_ | 1784849016501043200 |
---|---|
author | Chen, Jiaona Tao, Weijun |
author_facet | Chen, Jiaona Tao, Weijun |
author_sort | Chen, Jiaona |
collection | PubMed |
description | Predicting traffic accident duration is necessary for ensuring traffic safety. Several attempts have been made to achieve high prediction accuracy, but researchers have not considered traffic accident text data in much detail. The limited text data of the first report on an incident describes the characteristics of an accident that are initially available. This paper uses text data fusing and ensemble learning algorithms to build a model to predict an accident’s duration, and a preprocessing scheme of accident duration text data is established. Next, the random forest (RF) algorithm is applied to select feature variables of text data related to the traffic incident duration. Last, a text feature vector is introduced to models such as decision tree, k nearest neighbor, support vector regression, random forest, Gradient Boosting Decision Tree, and Xtreme Gradient Boosting. Our results show that the improved RF model has good prediction accuracy with RMSE, MAPE and R(2). From this, the textual factors important to determining the duration of the accident are identified. Further, we investigated that the cumulative importance of 60% is sufficient for traffic accident prediction using text data. These results provide insights into minimizing traffic congestion related to accidents and contribute to the input optimization in text prediction. |
format | Online Article Text |
id | pubmed-9744849 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-97448492022-12-14 Traffic accident duration prediction using text mining and ensemble learning on expressways Chen, Jiaona Tao, Weijun Sci Rep Article Predicting traffic accident duration is necessary for ensuring traffic safety. Several attempts have been made to achieve high prediction accuracy, but researchers have not considered traffic accident text data in much detail. The limited text data of the first report on an incident describes the characteristics of an accident that are initially available. This paper uses text data fusing and ensemble learning algorithms to build a model to predict an accident’s duration, and a preprocessing scheme of accident duration text data is established. Next, the random forest (RF) algorithm is applied to select feature variables of text data related to the traffic incident duration. Last, a text feature vector is introduced to models such as decision tree, k nearest neighbor, support vector regression, random forest, Gradient Boosting Decision Tree, and Xtreme Gradient Boosting. Our results show that the improved RF model has good prediction accuracy with RMSE, MAPE and R(2). From this, the textual factors important to determining the duration of the accident are identified. Further, we investigated that the cumulative importance of 60% is sufficient for traffic accident prediction using text data. These results provide insights into minimizing traffic congestion related to accidents and contribute to the input optimization in text prediction. Nature Publishing Group UK 2022-12-12 /pmc/articles/PMC9744849/ /pubmed/36509866 http://dx.doi.org/10.1038/s41598-022-25988-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Chen, Jiaona Tao, Weijun Traffic accident duration prediction using text mining and ensemble learning on expressways |
title | Traffic accident duration prediction using text mining and ensemble learning on expressways |
title_full | Traffic accident duration prediction using text mining and ensemble learning on expressways |
title_fullStr | Traffic accident duration prediction using text mining and ensemble learning on expressways |
title_full_unstemmed | Traffic accident duration prediction using text mining and ensemble learning on expressways |
title_short | Traffic accident duration prediction using text mining and ensemble learning on expressways |
title_sort | traffic accident duration prediction using text mining and ensemble learning on expressways |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744849/ https://www.ncbi.nlm.nih.gov/pubmed/36509866 http://dx.doi.org/10.1038/s41598-022-25988-4 |
work_keys_str_mv | AT chenjiaona trafficaccidentdurationpredictionusingtextminingandensemblelearningonexpressways AT taoweijun trafficaccidentdurationpredictionusingtextminingandensemblelearningonexpressways |