Cargando…

Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization

Limited training data is one of the biggest challenges in the industrial application of deep learning. Generating synthetic training images is a promising solution in computer vision; however, minimizing the domain gap between synthetic and real-world images remains a problem. Therefore, based on a...

Descripción completa

Detalles Bibliográficos
Autores principales: Eversberg, Leon, Lambrecht, Jens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659618/
https://www.ncbi.nlm.nih.gov/pubmed/34883900
http://dx.doi.org/10.3390/s21237901
_version_ 1784613006076805120
author Eversberg, Leon
Lambrecht, Jens
author_facet Eversberg, Leon
Lambrecht, Jens
author_sort Eversberg, Leon
collection PubMed
description Limited training data is one of the biggest challenges in the industrial application of deep learning. Generating synthetic training images is a promising solution in computer vision; however, minimizing the domain gap between synthetic and real-world images remains a problem. Therefore, based on a real-world application, we explored the generation of images with physics-based rendering for an industrial object detection task. Setting up the render engine’s environment requires a lot of choices and parameters. One fundamental question is whether to apply the concept of domain randomization or use domain knowledge to try and achieve photorealism. To answer this question, we compared different strategies for setting up lighting, background, object texture, additional foreground objects and bounding box computation in a data-centric approach. We compared the resulting average precision from generated images with different levels of realism and variability. In conclusion, we found that domain randomization is a viable strategy for the detection of industrial objects. However, domain knowledge can be used for object-related aspects to improve detection performance. Based on our results, we provide guidelines and an open-source tool for the generation of synthetic images for new industrial applications.
format Online
Article
Text
id pubmed-8659618
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86596182021-12-10 Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization Eversberg, Leon Lambrecht, Jens Sensors (Basel) Article Limited training data is one of the biggest challenges in the industrial application of deep learning. Generating synthetic training images is a promising solution in computer vision; however, minimizing the domain gap between synthetic and real-world images remains a problem. Therefore, based on a real-world application, we explored the generation of images with physics-based rendering for an industrial object detection task. Setting up the render engine’s environment requires a lot of choices and parameters. One fundamental question is whether to apply the concept of domain randomization or use domain knowledge to try and achieve photorealism. To answer this question, we compared different strategies for setting up lighting, background, object texture, additional foreground objects and bounding box computation in a data-centric approach. We compared the resulting average precision from generated images with different levels of realism and variability. In conclusion, we found that domain randomization is a viable strategy for the detection of industrial objects. However, domain knowledge can be used for object-related aspects to improve detection performance. Based on our results, we provide guidelines and an open-source tool for the generation of synthetic images for new industrial applications. MDPI 2021-11-26 /pmc/articles/PMC8659618/ /pubmed/34883900 http://dx.doi.org/10.3390/s21237901 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Eversberg, Leon
Lambrecht, Jens
Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization
title Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization
title_full Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization
title_fullStr Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization
title_full_unstemmed Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization
title_short Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization
title_sort generating images with physics-based rendering for an industrial object detection task: realism versus domain randomization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659618/
https://www.ncbi.nlm.nih.gov/pubmed/34883900
http://dx.doi.org/10.3390/s21237901
work_keys_str_mv AT eversbergleon generatingimageswithphysicsbasedrenderingforanindustrialobjectdetectiontaskrealismversusdomainrandomization
AT lambrechtjens generatingimageswithphysicsbasedrenderingforanindustrialobjectdetectiontaskrealismversusdomainrandomization