Cargando…

HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments

To assess the potential of current neural network architectures to reliably identify packaged products within a retail environment, we created an open-source dataset of 295 shelf images of vending machines with 10’035 labelled instances of 109 products. The dataset contains photos of vending machine...

Descripción completa

Detalles Bibliográficos
Autores principales: Fuchs, K., Grundmann, T., Haldimann, M., Fleisch, E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7494663/
https://www.ncbi.nlm.nih.gov/pubmed/32984473
http://dx.doi.org/10.1016/j.dib.2020.106280
_version_ 1783582773526659072
author Fuchs, K.
Grundmann, T.
Haldimann, M.
Fleisch, E.
author_facet Fuchs, K.
Grundmann, T.
Haldimann, M.
Fleisch, E.
author_sort Fuchs, K.
collection PubMed
description To assess the potential of current neural network architectures to reliably identify packaged products within a retail environment, we created an open-source dataset of 295 shelf images of vending machines with 10’035 labelled instances of 109 products. The dataset contains photos of vending machines by the provider Selecta, the largest European operator of vending machines. The vending machines are a mix of machines in public and private office spaces. The vending machines contain food as well as beverage products. The product instances in the vending machine images are labelled with bounding boxes, where a bounding box encapsulates the entire product with as little overlap as possible. The labels corresponding to the bounding box consist of a structured, human-readable labels including brand, product name and size as well as the GTIN of the product. The GTIN is the global standard to identify products in the retail environment and therefore increases the value as a dataset for the retail industry. Contrary to typical object detection datasets that choose labels at a higher level such as a can or bottle for a much wider variety of objects, this dataset chooses a far more detailed label that depends less on the shape but rather on the exact design of the product. The dataset falls into the category of object detection datasets with a large number of objects, which next to the GTIN label, represents a main differentiator of the dataset to other object detection datasets.
format Online
Article
Text
id pubmed-7494663
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-74946632020-09-24 HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments Fuchs, K. Grundmann, T. Haldimann, M. Fleisch, E. Data Brief Data Article To assess the potential of current neural network architectures to reliably identify packaged products within a retail environment, we created an open-source dataset of 295 shelf images of vending machines with 10’035 labelled instances of 109 products. The dataset contains photos of vending machines by the provider Selecta, the largest European operator of vending machines. The vending machines are a mix of machines in public and private office spaces. The vending machines contain food as well as beverage products. The product instances in the vending machine images are labelled with bounding boxes, where a bounding box encapsulates the entire product with as little overlap as possible. The labels corresponding to the bounding box consist of a structured, human-readable labels including brand, product name and size as well as the GTIN of the product. The GTIN is the global standard to identify products in the retail environment and therefore increases the value as a dataset for the retail industry. Contrary to typical object detection datasets that choose labels at a higher level such as a can or bottle for a much wider variety of objects, this dataset chooses a far more detailed label that depends less on the shape but rather on the exact design of the product. The dataset falls into the category of object detection datasets with a large number of objects, which next to the GTIN label, represents a main differentiator of the dataset to other object detection datasets. Elsevier 2020-09-08 /pmc/articles/PMC7494663/ /pubmed/32984473 http://dx.doi.org/10.1016/j.dib.2020.106280 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Fuchs, K.
Grundmann, T.
Haldimann, M.
Fleisch, E.
HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments
title HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments
title_full HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments
title_fullStr HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments
title_full_unstemmed HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments
title_short HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments
title_sort holoselecta dataset: 10’035 gtin-labelled product instances in vending machines for object detection of packaged products in retail environments
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7494663/
https://www.ncbi.nlm.nih.gov/pubmed/32984473
http://dx.doi.org/10.1016/j.dib.2020.106280
work_keys_str_mv AT fuchsk holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments
AT grundmannt holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments
AT haldimannm holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments
AT fleische holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments