Cargando…

Synthetic dataset generation for object-to-model deep learning in industrial applications

The availability of large image data sets has been a crucial factor in the success of deep learning-based classification and detection methods. Yet, while data sets for everyday objects are widely available, data for specific industrial use-cases (e.g., identifying packaged products in a warehouse)...

Descripción completa

Detalles Bibliográficos
Autores principales: Wong, Matthew Z., Kunii, Kiyohito, Baylis, Max, Ong, Wai Hong, Kroupa, Pavel, Koller, Swen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924434/
https://www.ncbi.nlm.nih.gov/pubmed/33816875
http://dx.doi.org/10.7717/peerj-cs.222
_version_ 1783659088920444928
author Wong, Matthew Z.
Kunii, Kiyohito
Baylis, Max
Ong, Wai Hong
Kroupa, Pavel
Koller, Swen
author_facet Wong, Matthew Z.
Kunii, Kiyohito
Baylis, Max
Ong, Wai Hong
Kroupa, Pavel
Koller, Swen
author_sort Wong, Matthew Z.
collection PubMed
description The availability of large image data sets has been a crucial factor in the success of deep learning-based classification and detection methods. Yet, while data sets for everyday objects are widely available, data for specific industrial use-cases (e.g., identifying packaged products in a warehouse) remains scarce. In such cases, the data sets have to be created from scratch, placing a crucial bottleneck on the deployment of deep learning techniques in industrial applications. We present work carried out in collaboration with a leading UK online supermarket, with the aim of creating a computer vision system capable of detecting and identifying unique supermarket products in a warehouse setting. To this end, we demonstrate a framework for using data synthesis to create an end-to-end deep learning pipeline, beginning with real-world objects and culminating in a trained model. Our method is based on the generation of a synthetic dataset from 3D models obtained by applying photogrammetry techniques to real-world objects. Using 100K synthetic images for 10 classes, an InceptionV3 convolutional neural network was trained, which achieved accuracy of 96% on a separately acquired test set of real supermarket product images. The image generation process supports automatic pixel annotation. This eliminates the prohibitively expensive manual annotation typically required for detection tasks. Based on this readily available data, a one-stage RetinaNet detector was trained on the synthetic, annotated images to produce a detector that can accurately localize and classify the specimen products in real-time.
format Online
Article
Text
id pubmed-7924434
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-79244342021-04-02 Synthetic dataset generation for object-to-model deep learning in industrial applications Wong, Matthew Z. Kunii, Kiyohito Baylis, Max Ong, Wai Hong Kroupa, Pavel Koller, Swen PeerJ Comput Sci Computer Vision The availability of large image data sets has been a crucial factor in the success of deep learning-based classification and detection methods. Yet, while data sets for everyday objects are widely available, data for specific industrial use-cases (e.g., identifying packaged products in a warehouse) remains scarce. In such cases, the data sets have to be created from scratch, placing a crucial bottleneck on the deployment of deep learning techniques in industrial applications. We present work carried out in collaboration with a leading UK online supermarket, with the aim of creating a computer vision system capable of detecting and identifying unique supermarket products in a warehouse setting. To this end, we demonstrate a framework for using data synthesis to create an end-to-end deep learning pipeline, beginning with real-world objects and culminating in a trained model. Our method is based on the generation of a synthetic dataset from 3D models obtained by applying photogrammetry techniques to real-world objects. Using 100K synthetic images for 10 classes, an InceptionV3 convolutional neural network was trained, which achieved accuracy of 96% on a separately acquired test set of real supermarket product images. The image generation process supports automatic pixel annotation. This eliminates the prohibitively expensive manual annotation typically required for detection tasks. Based on this readily available data, a one-stage RetinaNet detector was trained on the synthetic, annotated images to produce a detector that can accurately localize and classify the specimen products in real-time. PeerJ Inc. 2019-10-14 /pmc/articles/PMC7924434/ /pubmed/33816875 http://dx.doi.org/10.7717/peerj-cs.222 Text en © 2019 Wong et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Computer Vision
Wong, Matthew Z.
Kunii, Kiyohito
Baylis, Max
Ong, Wai Hong
Kroupa, Pavel
Koller, Swen
Synthetic dataset generation for object-to-model deep learning in industrial applications
title Synthetic dataset generation for object-to-model deep learning in industrial applications
title_full Synthetic dataset generation for object-to-model deep learning in industrial applications
title_fullStr Synthetic dataset generation for object-to-model deep learning in industrial applications
title_full_unstemmed Synthetic dataset generation for object-to-model deep learning in industrial applications
title_short Synthetic dataset generation for object-to-model deep learning in industrial applications
title_sort synthetic dataset generation for object-to-model deep learning in industrial applications
topic Computer Vision
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924434/
https://www.ncbi.nlm.nih.gov/pubmed/33816875
http://dx.doi.org/10.7717/peerj-cs.222
work_keys_str_mv AT wongmatthewz syntheticdatasetgenerationforobjecttomodeldeeplearninginindustrialapplications
AT kuniikiyohito syntheticdatasetgenerationforobjecttomodeldeeplearninginindustrialapplications
AT baylismax syntheticdatasetgenerationforobjecttomodeldeeplearninginindustrialapplications
AT ongwaihong syntheticdatasetgenerationforobjecttomodeldeeplearninginindustrialapplications
AT kroupapavel syntheticdatasetgenerationforobjecttomodeldeeplearninginindustrialapplications
AT kollerswen syntheticdatasetgenerationforobjecttomodeldeeplearninginindustrialapplications