Cargando…

Using Variational Multi-view Learning for Classification of Grocery Items

An essential task for computer vision-based assistive technologies is to help visually impaired people to recognize objects in constrained environments, for instance, recognizing food items in grocery stores. In this paper, we introduce a novel dataset with natural images of groceries—fruits, vegeta...

Descripción completa

Detalles Bibliográficos
Autores principales: Klasson, Marcus, Zhang, Cheng, Kjellström, Hedvig
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7691398/
https://www.ncbi.nlm.nih.gov/pubmed/33294874
http://dx.doi.org/10.1016/j.patter.2020.100143
_version_ 1783614281921593344
author Klasson, Marcus
Zhang, Cheng
Kjellström, Hedvig
author_facet Klasson, Marcus
Zhang, Cheng
Kjellström, Hedvig
author_sort Klasson, Marcus
collection PubMed
description An essential task for computer vision-based assistive technologies is to help visually impaired people to recognize objects in constrained environments, for instance, recognizing food items in grocery stores. In this paper, we introduce a novel dataset with natural images of groceries—fruits, vegetables, and packaged products—where all images have been taken inside grocery stores to resemble a shopping scenario. Additionally, we download iconic images and text descriptions for each item that can be utilized for better representation learning of groceries. We select a multi-view generative model, which can combine the different item information into lower-dimensional representations. The experiments show that utilizing the additional information yields higher accuracies on classifying grocery items than only using the natural images. We observe that iconic images help to construct representations separated by visual differences of the items, while text descriptions enable the model to distinguish between visually similar items by different ingredients.
format Online
Article
Text
id pubmed-7691398
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-76913982020-12-07 Using Variational Multi-view Learning for Classification of Grocery Items Klasson, Marcus Zhang, Cheng Kjellström, Hedvig Patterns (N Y) Article An essential task for computer vision-based assistive technologies is to help visually impaired people to recognize objects in constrained environments, for instance, recognizing food items in grocery stores. In this paper, we introduce a novel dataset with natural images of groceries—fruits, vegetables, and packaged products—where all images have been taken inside grocery stores to resemble a shopping scenario. Additionally, we download iconic images and text descriptions for each item that can be utilized for better representation learning of groceries. We select a multi-view generative model, which can combine the different item information into lower-dimensional representations. The experiments show that utilizing the additional information yields higher accuracies on classifying grocery items than only using the natural images. We observe that iconic images help to construct representations separated by visual differences of the items, while text descriptions enable the model to distinguish between visually similar items by different ingredients. Elsevier 2020-11-13 /pmc/articles/PMC7691398/ /pubmed/33294874 http://dx.doi.org/10.1016/j.patter.2020.100143 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Klasson, Marcus
Zhang, Cheng
Kjellström, Hedvig
Using Variational Multi-view Learning for Classification of Grocery Items
title Using Variational Multi-view Learning for Classification of Grocery Items
title_full Using Variational Multi-view Learning for Classification of Grocery Items
title_fullStr Using Variational Multi-view Learning for Classification of Grocery Items
title_full_unstemmed Using Variational Multi-view Learning for Classification of Grocery Items
title_short Using Variational Multi-view Learning for Classification of Grocery Items
title_sort using variational multi-view learning for classification of grocery items
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7691398/
https://www.ncbi.nlm.nih.gov/pubmed/33294874
http://dx.doi.org/10.1016/j.patter.2020.100143
work_keys_str_mv AT klassonmarcus usingvariationalmultiviewlearningforclassificationofgroceryitems
AT zhangcheng usingvariationalmultiviewlearningforclassificationofgroceryitems
AT kjellstromhedvig usingvariationalmultiviewlearningforclassificationofgroceryitems