Cargando…

LLM Multimodal Traffic Accident Forecasting

With the rise in traffic congestion in urban centers, predicting accidents has become paramount for city planning and public safety. This work comprehensively studied the efficacy of modern deep learning (DL) methods in forecasting traffic accidents and enhancing Level-4 and Level-5 (L-4 and L-5) dr...

Descripción completa

Detalles Bibliográficos
Autores principales: de Zarzà, I., de Curtò, J., Roig, Gemma, Calafate, Carlos T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674612/
https://www.ncbi.nlm.nih.gov/pubmed/38005612
http://dx.doi.org/10.3390/s23229225
_version_ 1785149728505200640
author de Zarzà, I.
de Curtò, J.
Roig, Gemma
Calafate, Carlos T.
author_facet de Zarzà, I.
de Curtò, J.
Roig, Gemma
Calafate, Carlos T.
author_sort de Zarzà, I.
collection PubMed
description With the rise in traffic congestion in urban centers, predicting accidents has become paramount for city planning and public safety. This work comprehensively studied the efficacy of modern deep learning (DL) methods in forecasting traffic accidents and enhancing Level-4 and Level-5 (L-4 and L-5) driving assistants with actionable visual and language cues. Using a rich dataset detailing accident occurrences, we juxtaposed the Transformer model against traditional time series models like ARIMA and the more recent Prophet model. Additionally, through detailed analysis, we delved deep into feature importance using principal component analysis (PCA) loadings, uncovering key factors contributing to accidents. We introduce the idea of using real-time interventions with large language models (LLMs) in autonomous driving with the use of lightweight compact LLMs like LLaMA-2 and Zephyr-7b- [Formula: see text]. Our exploration extends to the realm of multimodality, through the use of Large Language-and-Vision Assistant (LLaVA)—a bridge between visual and linguistic cues by means of a Visual Language Model (VLM)—in conjunction with deep probabilistic reasoning, enhancing the real-time responsiveness of autonomous driving systems. In this study, we elucidate the advantages of employing large multimodal models within DL and deep probabilistic programming for enhancing the performance and usability of time series forecasting and feature weight importance, particularly in a self-driving scenario. This work paves the way for safer, smarter cities, underpinned by data-driven decision making.
format Online
Article
Text
id pubmed-10674612
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106746122023-11-16 LLM Multimodal Traffic Accident Forecasting de Zarzà, I. de Curtò, J. Roig, Gemma Calafate, Carlos T. Sensors (Basel) Article With the rise in traffic congestion in urban centers, predicting accidents has become paramount for city planning and public safety. This work comprehensively studied the efficacy of modern deep learning (DL) methods in forecasting traffic accidents and enhancing Level-4 and Level-5 (L-4 and L-5) driving assistants with actionable visual and language cues. Using a rich dataset detailing accident occurrences, we juxtaposed the Transformer model against traditional time series models like ARIMA and the more recent Prophet model. Additionally, through detailed analysis, we delved deep into feature importance using principal component analysis (PCA) loadings, uncovering key factors contributing to accidents. We introduce the idea of using real-time interventions with large language models (LLMs) in autonomous driving with the use of lightweight compact LLMs like LLaMA-2 and Zephyr-7b- [Formula: see text]. Our exploration extends to the realm of multimodality, through the use of Large Language-and-Vision Assistant (LLaVA)—a bridge between visual and linguistic cues by means of a Visual Language Model (VLM)—in conjunction with deep probabilistic reasoning, enhancing the real-time responsiveness of autonomous driving systems. In this study, we elucidate the advantages of employing large multimodal models within DL and deep probabilistic programming for enhancing the performance and usability of time series forecasting and feature weight importance, particularly in a self-driving scenario. This work paves the way for safer, smarter cities, underpinned by data-driven decision making. MDPI 2023-11-16 /pmc/articles/PMC10674612/ /pubmed/38005612 http://dx.doi.org/10.3390/s23229225 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
de Zarzà, I.
de Curtò, J.
Roig, Gemma
Calafate, Carlos T.
LLM Multimodal Traffic Accident Forecasting
title LLM Multimodal Traffic Accident Forecasting
title_full LLM Multimodal Traffic Accident Forecasting
title_fullStr LLM Multimodal Traffic Accident Forecasting
title_full_unstemmed LLM Multimodal Traffic Accident Forecasting
title_short LLM Multimodal Traffic Accident Forecasting
title_sort llm multimodal traffic accident forecasting
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674612/
https://www.ncbi.nlm.nih.gov/pubmed/38005612
http://dx.doi.org/10.3390/s23229225
work_keys_str_mv AT dezarzai llmmultimodaltrafficaccidentforecasting
AT decurtoj llmmultimodaltrafficaccidentforecasting
AT roiggemma llmmultimodaltrafficaccidentforecasting
AT calafatecarlost llmmultimodaltrafficaccidentforecasting