Cargando…

Sparsity-Regularized HMAX for Visual Recognition

About ten years ago, HMAX was proposed as a simple and biologically feasible model for object recognition, based on how the visual cortex processes information. However, the model does not encompass sparse firing, which is a hallmark of neurons at all stages of the visual pathway. The current paper...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hu, Xiaolin, Zhang, Jianwei, Li, Jianmin, Zhang, Bo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3879257/ https://www.ncbi.nlm.nih.gov/pubmed/24392078 http://dx.doi.org/10.1371/journal.pone.0081813

_version_	1782297947273166848
author	Hu, Xiaolin Zhang, Jianwei Li, Jianmin Zhang, Bo
author_facet	Hu, Xiaolin Zhang, Jianwei Li, Jianmin Zhang, Bo
author_sort	Hu, Xiaolin
collection	PubMed
description	About ten years ago, HMAX was proposed as a simple and biologically feasible model for object recognition, based on how the visual cortex processes information. However, the model does not encompass sparse firing, which is a hallmark of neurons at all stages of the visual pathway. The current paper presents an improved model, called sparse HMAX, which integrates sparse firing. This model is able to learn higher-level features of objects on unlabeled training images. Unlike most other deep learning models that explicitly address global structure of images in every layer, sparse HMAX addresses local to global structure gradually along the hierarchy by applying patch-based learning to the output of the previous layer. As a consequence, the learning method can be standard sparse coding (SSC) or independent component analysis (ICA), two techniques deeply rooted in neuroscience. What makes SSC and ICA applicable at higher levels is the introduction of linear higher-order statistical regularities by max pooling. After training, high-level units display sparse, invariant selectivity for particular individuals or for image categories like those observed in human inferior temporal cortex (ITC) and medial temporal lobe (MTL). Finally, on an image classification benchmark, sparse HMAX outperforms the original HMAX by a large margin, suggesting its great potential for computer vision.
format	Online Article Text
id	pubmed-3879257
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-38792572014-01-03 Sparsity-Regularized HMAX for Visual Recognition Hu, Xiaolin Zhang, Jianwei Li, Jianmin Zhang, Bo PLoS One Research Article About ten years ago, HMAX was proposed as a simple and biologically feasible model for object recognition, based on how the visual cortex processes information. However, the model does not encompass sparse firing, which is a hallmark of neurons at all stages of the visual pathway. The current paper presents an improved model, called sparse HMAX, which integrates sparse firing. This model is able to learn higher-level features of objects on unlabeled training images. Unlike most other deep learning models that explicitly address global structure of images in every layer, sparse HMAX addresses local to global structure gradually along the hierarchy by applying patch-based learning to the output of the previous layer. As a consequence, the learning method can be standard sparse coding (SSC) or independent component analysis (ICA), two techniques deeply rooted in neuroscience. What makes SSC and ICA applicable at higher levels is the introduction of linear higher-order statistical regularities by max pooling. After training, high-level units display sparse, invariant selectivity for particular individuals or for image categories like those observed in human inferior temporal cortex (ITC) and medial temporal lobe (MTL). Finally, on an image classification benchmark, sparse HMAX outperforms the original HMAX by a large margin, suggesting its great potential for computer vision. Public Library of Science 2014-01-02 /pmc/articles/PMC3879257/ /pubmed/24392078 http://dx.doi.org/10.1371/journal.pone.0081813 Text en © 2014 Hu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Hu, Xiaolin Zhang, Jianwei Li, Jianmin Zhang, Bo Sparsity-Regularized HMAX for Visual Recognition
title	Sparsity-Regularized HMAX for Visual Recognition
title_full	Sparsity-Regularized HMAX for Visual Recognition
title_fullStr	Sparsity-Regularized HMAX for Visual Recognition
title_full_unstemmed	Sparsity-Regularized HMAX for Visual Recognition
title_short	Sparsity-Regularized HMAX for Visual Recognition
title_sort	sparsity-regularized hmax for visual recognition
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3879257/ https://www.ncbi.nlm.nih.gov/pubmed/24392078 http://dx.doi.org/10.1371/journal.pone.0081813
work_keys_str_mv	AT huxiaolin sparsityregularizedhmaxforvisualrecognition AT zhangjianwei sparsityregularizedhmaxforvisualrecognition AT lijianmin sparsityregularizedhmaxforvisualrecognition AT zhangbo sparsityregularizedhmaxforvisualrecognition

Sparsity-Regularized HMAX for Visual Recognition

Ejemplares similares