Cargando…

GTASynth: 3D synthetic data of outdoor non-urban environments.

Developing point clouds registration, SLAM or place recognition algorithms requires data with a high quality ground truth (usually composed of a position and orientation). Moreover, many machine learning algorithms require large amounts of data for training. However, acquiring this kind of data in n...

Descripción completa

Detalles Bibliográficos
Autores principales:	Curnis, Giovanni, Fontana, Simone, Sorrenti, Domenico G.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2022
Materias:	Data Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241091/ https://www.ncbi.nlm.nih.gov/pubmed/35781982 http://dx.doi.org/10.1016/j.dib.2022.108412

_version_	1784737718379479040
author	Curnis, Giovanni Fontana, Simone Sorrenti, Domenico G.
author_facet	Curnis, Giovanni Fontana, Simone Sorrenti, Domenico G.
author_sort	Curnis, Giovanni
collection	PubMed
description	Developing point clouds registration, SLAM or place recognition algorithms requires data with a high quality ground truth (usually composed of a position and orientation). Moreover, many machine learning algorithms require large amounts of data for training. However, acquiring this kind of data in non-urban outdoor environments poses several challenges. First of all, off-road robots are usually very expensive. Above all, producing an accurate ground truth is problematic. Even the best sensor available, i.e. RTK GPS, cannot guarantee the required accuracy in every condition. Hence the scarcity of this kind of dataset for point clouds registration or SLAM in off-road conditions. For these reasons, we propose a synthetic dataset generated using Grand Theft Auto V (GTAV), a video game that accurately simulates sensing in outdoor environments. The data production technique is based on DeepGTAV-PreSIL [1]: a simulated LiDAR and a camera are installed on a vehicle which is driven through the GTAV map. Since one of the goals of our work is to produce a large amount of data to train neural networks which will then be used with real data, we have chosen the characteristics of the sensors to accurately simulate real ones. The proposed dataset is composed of 16.207 point clouds and images, divided into five sequences representing different environments, such as fields, woods and mountains. For each pair of point clouds and images we also provide the ground truth pose of the vehicle at the acquisition.
format	Online Article Text
id	pubmed-9241091
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-92410912022-06-30 GTASynth: 3D synthetic data of outdoor non-urban environments. Curnis, Giovanni Fontana, Simone Sorrenti, Domenico G. Data Brief Data Article Developing point clouds registration, SLAM or place recognition algorithms requires data with a high quality ground truth (usually composed of a position and orientation). Moreover, many machine learning algorithms require large amounts of data for training. However, acquiring this kind of data in non-urban outdoor environments poses several challenges. First of all, off-road robots are usually very expensive. Above all, producing an accurate ground truth is problematic. Even the best sensor available, i.e. RTK GPS, cannot guarantee the required accuracy in every condition. Hence the scarcity of this kind of dataset for point clouds registration or SLAM in off-road conditions. For these reasons, we propose a synthetic dataset generated using Grand Theft Auto V (GTAV), a video game that accurately simulates sensing in outdoor environments. The data production technique is based on DeepGTAV-PreSIL [1]: a simulated LiDAR and a camera are installed on a vehicle which is driven through the GTAV map. Since one of the goals of our work is to produce a large amount of data to train neural networks which will then be used with real data, we have chosen the characteristics of the sensors to accurately simulate real ones. The proposed dataset is composed of 16.207 point clouds and images, divided into five sequences representing different environments, such as fields, woods and mountains. For each pair of point clouds and images we also provide the ground truth pose of the vehicle at the acquisition. Elsevier 2022-06-22 /pmc/articles/PMC9241091/ /pubmed/35781982 http://dx.doi.org/10.1016/j.dib.2022.108412 Text en © 2022 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Data Article Curnis, Giovanni Fontana, Simone Sorrenti, Domenico G. GTASynth: 3D synthetic data of outdoor non-urban environments.
title	GTASynth: 3D synthetic data of outdoor non-urban environments.
title_full	GTASynth: 3D synthetic data of outdoor non-urban environments.
title_fullStr	GTASynth: 3D synthetic data of outdoor non-urban environments.
title_full_unstemmed	GTASynth: 3D synthetic data of outdoor non-urban environments.
title_short	GTASynth: 3D synthetic data of outdoor non-urban environments.
title_sort	gtasynth: 3d synthetic data of outdoor non-urban environments.
topic	Data Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241091/ https://www.ncbi.nlm.nih.gov/pubmed/35781982 http://dx.doi.org/10.1016/j.dib.2022.108412
work_keys_str_mv	AT curnisgiovanni gtasynth3dsyntheticdataofoutdoornonurbanenvironments AT fontanasimone gtasynth3dsyntheticdataofoutdoornonurbanenvironments AT sorrentidomenicog gtasynth3dsyntheticdataofoutdoornonurbanenvironments

GTASynth: 3D synthetic data of outdoor non-urban environments.

Ejemplares similares