Cargando…

GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model

Recent studies on unsupervised object detection based on spatial attention have achieved promising results. Models, such as AIR and SPAIR, output “what” and “where” latent variables that represent the attributes and locations of objects in a scene, respectively. Most of the previous studies concentr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Weijin, Shen, Yao, Liu, Mingqian, Aguirre Sanchez, Lizeth Patricia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313923/
https://www.ncbi.nlm.nih.gov/pubmed/35898790
http://dx.doi.org/10.1155/2022/7254462
_version_ 1784754193452498944
author Zhu, Weijin
Shen, Yao
Liu, Mingqian
Aguirre Sanchez, Lizeth Patricia
author_facet Zhu, Weijin
Shen, Yao
Liu, Mingqian
Aguirre Sanchez, Lizeth Patricia
author_sort Zhu, Weijin
collection PubMed
description Recent studies on unsupervised object detection based on spatial attention have achieved promising results. Models, such as AIR and SPAIR, output “what” and “where” latent variables that represent the attributes and locations of objects in a scene, respectively. Most of the previous studies concentrate on the “where” localization performance. However, we claim that acquiring “what” object attributes is also essential for representation learning. This study presents a framework, GMAIR, for unsupervised object detection. It incorporates spatial attention and a Gaussian mixture in a unified deep generative model. GMAIR can locate objects in a scene and simultaneously cluster them without supervision. Furthermore, we analyze the “what” latent variables and clustering process. Finally, we evaluate our model on MultiMNIST and Fruit2D datasets. We show that GMAIR achieves competitive results on localization and clustering compared with state-of-the-art methods.
format Online
Article
Text
id pubmed-9313923
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-93139232022-07-26 GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model Zhu, Weijin Shen, Yao Liu, Mingqian Aguirre Sanchez, Lizeth Patricia Comput Intell Neurosci Research Article Recent studies on unsupervised object detection based on spatial attention have achieved promising results. Models, such as AIR and SPAIR, output “what” and “where” latent variables that represent the attributes and locations of objects in a scene, respectively. Most of the previous studies concentrate on the “where” localization performance. However, we claim that acquiring “what” object attributes is also essential for representation learning. This study presents a framework, GMAIR, for unsupervised object detection. It incorporates spatial attention and a Gaussian mixture in a unified deep generative model. GMAIR can locate objects in a scene and simultaneously cluster them without supervision. Furthermore, we analyze the “what” latent variables and clustering process. Finally, we evaluate our model on MultiMNIST and Fruit2D datasets. We show that GMAIR achieves competitive results on localization and clustering compared with state-of-the-art methods. Hindawi 2022-07-18 /pmc/articles/PMC9313923/ /pubmed/35898790 http://dx.doi.org/10.1155/2022/7254462 Text en Copyright © 2022 Weijin Zhu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhu, Weijin
Shen, Yao
Liu, Mingqian
Aguirre Sanchez, Lizeth Patricia
GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model
title GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model
title_full GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model
title_fullStr GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model
title_full_unstemmed GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model
title_short GMAIR: Unsupervised Object Detection Based on Spatial Attention and Gaussian Mixture Model
title_sort gmair: unsupervised object detection based on spatial attention and gaussian mixture model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313923/
https://www.ncbi.nlm.nih.gov/pubmed/35898790
http://dx.doi.org/10.1155/2022/7254462
work_keys_str_mv AT zhuweijin gmairunsupervisedobjectdetectionbasedonspatialattentionandgaussianmixturemodel
AT shenyao gmairunsupervisedobjectdetectionbasedonspatialattentionandgaussianmixturemodel
AT liumingqian gmairunsupervisedobjectdetectionbasedonspatialattentionandgaussianmixturemodel
AT aguirresanchezlizethpatricia gmairunsupervisedobjectdetectionbasedonspatialattentionandgaussianmixturemodel