Cargando…

LLM Multimodal Traffic Accident Forecasting

With the rise in traffic congestion in urban centers, predicting accidents has become paramount for city planning and public safety. This work comprehensively studied the efficacy of modern deep learning (DL) methods in forecasting traffic accidents and enhancing Level-4 and Level-5 (L-4 and L-5) dr...

Descripción completa

Detalles Bibliográficos
Autores principales:	de Zarzà, I., de Curtò, J., Roig, Gemma, Calafate, Carlos T.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674612/ https://www.ncbi.nlm.nih.gov/pubmed/38005612 http://dx.doi.org/10.3390/s23229225

_version_	1785149728505200640
author	de Zarzà, I. de Curtò, J. Roig, Gemma Calafate, Carlos T.
author_facet	de Zarzà, I. de Curtò, J. Roig, Gemma Calafate, Carlos T.
author_sort	de Zarzà, I.
collection	PubMed
description	With the rise in traffic congestion in urban centers, predicting accidents has become paramount for city planning and public safety. This work comprehensively studied the efficacy of modern deep learning (DL) methods in forecasting traffic accidents and enhancing Level-4 and Level-5 (L-4 and L-5) driving assistants with actionable visual and language cues. Using a rich dataset detailing accident occurrences, we juxtaposed the Transformer model against traditional time series models like ARIMA and the more recent Prophet model. Additionally, through detailed analysis, we delved deep into feature importance using principal component analysis (PCA) loadings, uncovering key factors contributing to accidents. We introduce the idea of using real-time interventions with large language models (LLMs) in autonomous driving with the use of lightweight compact LLMs like LLaMA-2 and Zephyr-7b- [Formula: see text]. Our exploration extends to the realm of multimodality, through the use of Large Language-and-Vision Assistant (LLaVA)—a bridge between visual and linguistic cues by means of a Visual Language Model (VLM)—in conjunction with deep probabilistic reasoning, enhancing the real-time responsiveness of autonomous driving systems. In this study, we elucidate the advantages of employing large multimodal models within DL and deep probabilistic programming for enhancing the performance and usability of time series forecasting and feature weight importance, particularly in a self-driving scenario. This work paves the way for safer, smarter cities, underpinned by data-driven decision making.
format	Online Article Text
id	pubmed-10674612
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-106746122023-11-16 LLM Multimodal Traffic Accident Forecasting de Zarzà, I. de Curtò, J. Roig, Gemma Calafate, Carlos T. Sensors (Basel) Article With the rise in traffic congestion in urban centers, predicting accidents has become paramount for city planning and public safety. This work comprehensively studied the efficacy of modern deep learning (DL) methods in forecasting traffic accidents and enhancing Level-4 and Level-5 (L-4 and L-5) driving assistants with actionable visual and language cues. Using a rich dataset detailing accident occurrences, we juxtaposed the Transformer model against traditional time series models like ARIMA and the more recent Prophet model. Additionally, through detailed analysis, we delved deep into feature importance using principal component analysis (PCA) loadings, uncovering key factors contributing to accidents. We introduce the idea of using real-time interventions with large language models (LLMs) in autonomous driving with the use of lightweight compact LLMs like LLaMA-2 and Zephyr-7b- [Formula: see text]. Our exploration extends to the realm of multimodality, through the use of Large Language-and-Vision Assistant (LLaVA)—a bridge between visual and linguistic cues by means of a Visual Language Model (VLM)—in conjunction with deep probabilistic reasoning, enhancing the real-time responsiveness of autonomous driving systems. In this study, we elucidate the advantages of employing large multimodal models within DL and deep probabilistic programming for enhancing the performance and usability of time series forecasting and feature weight importance, particularly in a self-driving scenario. This work paves the way for safer, smarter cities, underpinned by data-driven decision making. MDPI 2023-11-16 /pmc/articles/PMC10674612/ /pubmed/38005612 http://dx.doi.org/10.3390/s23229225 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article de Zarzà, I. de Curtò, J. Roig, Gemma Calafate, Carlos T. LLM Multimodal Traffic Accident Forecasting
title	LLM Multimodal Traffic Accident Forecasting
title_full	LLM Multimodal Traffic Accident Forecasting
title_fullStr	LLM Multimodal Traffic Accident Forecasting
title_full_unstemmed	LLM Multimodal Traffic Accident Forecasting
title_short	LLM Multimodal Traffic Accident Forecasting
title_sort	llm multimodal traffic accident forecasting
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674612/ https://www.ncbi.nlm.nih.gov/pubmed/38005612 http://dx.doi.org/10.3390/s23229225
work_keys_str_mv	AT dezarzai llmmultimodaltrafficaccidentforecasting AT decurtoj llmmultimodaltrafficaccidentforecasting AT roiggemma llmmultimodaltrafficaccidentforecasting AT calafatecarlost llmmultimodaltrafficaccidentforecasting

LLM Multimodal Traffic Accident Forecasting

Ejemplares similares