Cargando…

A3CarScene: An audio-visual dataset for driving scene understanding

Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor s...

Descripción completa

Detalles Bibliográficos
Autores principales: Cantarini, Michela, Gabrielli, Leonardo, Mancini, Adriano, Squartini, Stefano, Longo, Roberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10148019/
https://www.ncbi.nlm.nih.gov/pubmed/37128585
http://dx.doi.org/10.1016/j.dib.2023.109146
_version_ 1785034909938614272
author Cantarini, Michela
Gabrielli, Leonardo
Mancini, Adriano
Squartini, Stefano
Longo, Roberto
author_facet Cantarini, Michela
Gabrielli, Leonardo
Mancini, Adriano
Squartini, Stefano
Longo, Roberto
author_sort Cantarini, Michela
collection PubMed
description Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed). This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving.
format Online
Article
Text
id pubmed-10148019
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-101480192023-04-30 A3CarScene: An audio-visual dataset for driving scene understanding Cantarini, Michela Gabrielli, Leonardo Mancini, Adriano Squartini, Stefano Longo, Roberto Data Brief Data Article Accurate perception and awareness of the environment surrounding the automobile is a challenge in automotive research. This article presents A3CarScene, a dataset recorded while driving a research vehicle equipped with audio and video sensors on public roads in the Marche Region, Italy. The sensor suite includes eight microphones installed inside and outside the passenger compartment and two dashcams mounted on the front and rear windows. Approximately 31 h of data for each device were collected during October and November 2022 by driving about 1500 km along diverse roads and landscapes, in variable weather conditions, in daytime and nighttime hours. All key information for the scene understanding process of automated vehicles has been accurately annotated. For each route, annotations with beginning and end timestamps report the type of road traveled (motorway, trunk, primary, secondary, tertiary, residential, and service roads), the degree of urbanization of the area (city, town, suburban area, village, exurban and rural areas), the weather conditions (clear, cloudy, overcast, and rainy), the level of lighting (daytime, evening, night, and tunnel), the type (asphalt or cobblestones) and moisture status (dry or wet) of the road pavement, and the state of the windows (open or closed). This large-scale dataset is valuable for developing new driving assistance technologies based on audio or video data alone or in a multimodal manner and for improving the performance of systems currently in use. The data acquisition process with sensors in multiple locations allows for the assessment of the best installation placement concerning the task. Deep learning engineers can use this dataset to build new baselines, as a comparative benchmark, and to extend existing databases for autonomous driving. Elsevier 2023-04-12 /pmc/articles/PMC10148019/ /pubmed/37128585 http://dx.doi.org/10.1016/j.dib.2023.109146 Text en © 2023 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Cantarini, Michela
Gabrielli, Leonardo
Mancini, Adriano
Squartini, Stefano
Longo, Roberto
A3CarScene: An audio-visual dataset for driving scene understanding
title A3CarScene: An audio-visual dataset for driving scene understanding
title_full A3CarScene: An audio-visual dataset for driving scene understanding
title_fullStr A3CarScene: An audio-visual dataset for driving scene understanding
title_full_unstemmed A3CarScene: An audio-visual dataset for driving scene understanding
title_short A3CarScene: An audio-visual dataset for driving scene understanding
title_sort a3carscene: an audio-visual dataset for driving scene understanding
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10148019/
https://www.ncbi.nlm.nih.gov/pubmed/37128585
http://dx.doi.org/10.1016/j.dib.2023.109146
work_keys_str_mv AT cantarinimichela a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT gabriellileonardo a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT manciniadriano a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT squartinistefano a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding
AT longoroberto a3carsceneanaudiovisualdatasetfordrivingsceneunderstanding