Cargando…
HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments
To assess the potential of current neural network architectures to reliably identify packaged products within a retail environment, we created an open-source dataset of 295 shelf images of vending machines with 10’035 labelled instances of 109 products. The dataset contains photos of vending machine...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7494663/ https://www.ncbi.nlm.nih.gov/pubmed/32984473 http://dx.doi.org/10.1016/j.dib.2020.106280 |
_version_ | 1783582773526659072 |
---|---|
author | Fuchs, K. Grundmann, T. Haldimann, M. Fleisch, E. |
author_facet | Fuchs, K. Grundmann, T. Haldimann, M. Fleisch, E. |
author_sort | Fuchs, K. |
collection | PubMed |
description | To assess the potential of current neural network architectures to reliably identify packaged products within a retail environment, we created an open-source dataset of 295 shelf images of vending machines with 10’035 labelled instances of 109 products. The dataset contains photos of vending machines by the provider Selecta, the largest European operator of vending machines. The vending machines are a mix of machines in public and private office spaces. The vending machines contain food as well as beverage products. The product instances in the vending machine images are labelled with bounding boxes, where a bounding box encapsulates the entire product with as little overlap as possible. The labels corresponding to the bounding box consist of a structured, human-readable labels including brand, product name and size as well as the GTIN of the product. The GTIN is the global standard to identify products in the retail environment and therefore increases the value as a dataset for the retail industry. Contrary to typical object detection datasets that choose labels at a higher level such as a can or bottle for a much wider variety of objects, this dataset chooses a far more detailed label that depends less on the shape but rather on the exact design of the product. The dataset falls into the category of object detection datasets with a large number of objects, which next to the GTIN label, represents a main differentiator of the dataset to other object detection datasets. |
format | Online Article Text |
id | pubmed-7494663 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-74946632020-09-24 HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments Fuchs, K. Grundmann, T. Haldimann, M. Fleisch, E. Data Brief Data Article To assess the potential of current neural network architectures to reliably identify packaged products within a retail environment, we created an open-source dataset of 295 shelf images of vending machines with 10’035 labelled instances of 109 products. The dataset contains photos of vending machines by the provider Selecta, the largest European operator of vending machines. The vending machines are a mix of machines in public and private office spaces. The vending machines contain food as well as beverage products. The product instances in the vending machine images are labelled with bounding boxes, where a bounding box encapsulates the entire product with as little overlap as possible. The labels corresponding to the bounding box consist of a structured, human-readable labels including brand, product name and size as well as the GTIN of the product. The GTIN is the global standard to identify products in the retail environment and therefore increases the value as a dataset for the retail industry. Contrary to typical object detection datasets that choose labels at a higher level such as a can or bottle for a much wider variety of objects, this dataset chooses a far more detailed label that depends less on the shape but rather on the exact design of the product. The dataset falls into the category of object detection datasets with a large number of objects, which next to the GTIN label, represents a main differentiator of the dataset to other object detection datasets. Elsevier 2020-09-08 /pmc/articles/PMC7494663/ /pubmed/32984473 http://dx.doi.org/10.1016/j.dib.2020.106280 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Fuchs, K. Grundmann, T. Haldimann, M. Fleisch, E. HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments |
title | HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments |
title_full | HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments |
title_fullStr | HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments |
title_full_unstemmed | HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments |
title_short | HoloSelecta dataset: 10’035 GTIN-labelled product instances in vending machines for object detection of packaged products in retail environments |
title_sort | holoselecta dataset: 10’035 gtin-labelled product instances in vending machines for object detection of packaged products in retail environments |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7494663/ https://www.ncbi.nlm.nih.gov/pubmed/32984473 http://dx.doi.org/10.1016/j.dib.2020.106280 |
work_keys_str_mv | AT fuchsk holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments AT grundmannt holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments AT haldimannm holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments AT fleische holoselectadataset10035gtinlabelledproductinstancesinvendingmachinesforobjectdetectionofpackagedproductsinretailenvironments |