Cargando…

GTASynth: 3D synthetic data of outdoor non-urban environments.

Developing point clouds registration, SLAM or place recognition algorithms requires data with a high quality ground truth (usually composed of a position and orientation). Moreover, many machine learning algorithms require large amounts of data for training. However, acquiring this kind of data in n...

Descripción completa

Detalles Bibliográficos
Autores principales: Curnis, Giovanni, Fontana, Simone, Sorrenti, Domenico G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241091/
https://www.ncbi.nlm.nih.gov/pubmed/35781982
http://dx.doi.org/10.1016/j.dib.2022.108412
_version_ 1784737718379479040
author Curnis, Giovanni
Fontana, Simone
Sorrenti, Domenico G.
author_facet Curnis, Giovanni
Fontana, Simone
Sorrenti, Domenico G.
author_sort Curnis, Giovanni
collection PubMed
description Developing point clouds registration, SLAM or place recognition algorithms requires data with a high quality ground truth (usually composed of a position and orientation). Moreover, many machine learning algorithms require large amounts of data for training. However, acquiring this kind of data in non-urban outdoor environments poses several challenges. First of all, off-road robots are usually very expensive. Above all, producing an accurate ground truth is problematic. Even the best sensor available, i.e. RTK GPS, cannot guarantee the required accuracy in every condition. Hence the scarcity of this kind of dataset for point clouds registration or SLAM in off-road conditions. For these reasons, we propose a synthetic dataset generated using Grand Theft Auto V (GTAV), a video game that accurately simulates sensing in outdoor environments. The data production technique is based on DeepGTAV-PreSIL [1]: a simulated LiDAR and a camera are installed on a vehicle which is driven through the GTAV map. Since one of the goals of our work is to produce a large amount of data to train neural networks which will then be used with real data, we have chosen the characteristics of the sensors to accurately simulate real ones. The proposed dataset is composed of 16.207 point clouds and images, divided into five sequences representing different environments, such as fields, woods and mountains. For each pair of point clouds and images we also provide the ground truth pose of the vehicle at the acquisition.
format Online
Article
Text
id pubmed-9241091
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-92410912022-06-30 GTASynth: 3D synthetic data of outdoor non-urban environments. Curnis, Giovanni Fontana, Simone Sorrenti, Domenico G. Data Brief Data Article Developing point clouds registration, SLAM or place recognition algorithms requires data with a high quality ground truth (usually composed of a position and orientation). Moreover, many machine learning algorithms require large amounts of data for training. However, acquiring this kind of data in non-urban outdoor environments poses several challenges. First of all, off-road robots are usually very expensive. Above all, producing an accurate ground truth is problematic. Even the best sensor available, i.e. RTK GPS, cannot guarantee the required accuracy in every condition. Hence the scarcity of this kind of dataset for point clouds registration or SLAM in off-road conditions. For these reasons, we propose a synthetic dataset generated using Grand Theft Auto V (GTAV), a video game that accurately simulates sensing in outdoor environments. The data production technique is based on DeepGTAV-PreSIL [1]: a simulated LiDAR and a camera are installed on a vehicle which is driven through the GTAV map. Since one of the goals of our work is to produce a large amount of data to train neural networks which will then be used with real data, we have chosen the characteristics of the sensors to accurately simulate real ones. The proposed dataset is composed of 16.207 point clouds and images, divided into five sequences representing different environments, such as fields, woods and mountains. For each pair of point clouds and images we also provide the ground truth pose of the vehicle at the acquisition. Elsevier 2022-06-22 /pmc/articles/PMC9241091/ /pubmed/35781982 http://dx.doi.org/10.1016/j.dib.2022.108412 Text en © 2022 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Curnis, Giovanni
Fontana, Simone
Sorrenti, Domenico G.
GTASynth: 3D synthetic data of outdoor non-urban environments.
title GTASynth: 3D synthetic data of outdoor non-urban environments.
title_full GTASynth: 3D synthetic data of outdoor non-urban environments.
title_fullStr GTASynth: 3D synthetic data of outdoor non-urban environments.
title_full_unstemmed GTASynth: 3D synthetic data of outdoor non-urban environments.
title_short GTASynth: 3D synthetic data of outdoor non-urban environments.
title_sort gtasynth: 3d synthetic data of outdoor non-urban environments.
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241091/
https://www.ncbi.nlm.nih.gov/pubmed/35781982
http://dx.doi.org/10.1016/j.dib.2022.108412
work_keys_str_mv AT curnisgiovanni gtasynth3dsyntheticdataofoutdoornonurbanenvironments
AT fontanasimone gtasynth3dsyntheticdataofoutdoornonurbanenvironments
AT sorrentidomenicog gtasynth3dsyntheticdataofoutdoornonurbanenvironments