Cargando…

Enhanced HMAX model with feedforward feature learning for multiclass categorization

In recent years, the interdisciplinary research between neuroscience and computer vision has promoted the development in both fields. Many biologically inspired visual models are proposed, and among them, the Hierarchical Max-pooling model (HMAX) is a feedforward model mimicking the structures and f...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yinlin, Wu, Wei, Zhang, Bo, Li, Fengfu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4595662/
https://www.ncbi.nlm.nih.gov/pubmed/26500532
http://dx.doi.org/10.3389/fncom.2015.00123
_version_ 1782393648519839744
author Li, Yinlin
Wu, Wei
Zhang, Bo
Li, Fengfu
author_facet Li, Yinlin
Wu, Wei
Zhang, Bo
Li, Fengfu
author_sort Li, Yinlin
collection PubMed
description In recent years, the interdisciplinary research between neuroscience and computer vision has promoted the development in both fields. Many biologically inspired visual models are proposed, and among them, the Hierarchical Max-pooling model (HMAX) is a feedforward model mimicking the structures and functions of V1 to posterior inferotemporal (PIT) layer of the primate visual cortex, which could generate a series of position- and scale- invariant features. However, it could be improved with attention modulation and memory processing, which are two important properties of the primate visual cortex. Thus, in this paper, based on recent biological research on the primate visual cortex, we still mimic the first 100–150 ms of visual cognition to enhance the HMAX model, which mainly focuses on the unsupervised feedforward feature learning process. The main modifications are as follows: (1) To mimic the attention modulation mechanism of V1 layer, a bottom-up saliency map is computed in the S1 layer of the HMAX model, which can support the initial feature extraction for memory processing; (2) To mimic the learning, clustering and short-term memory to long-term memory conversion abilities of V2 and IT, an unsupervised iterative clustering method is used to learn clusters with multiscale middle level patches, which are taken as long-term memory; (3) Inspired by the multiple feature encoding mode of the primate visual cortex, information including color, orientation, and spatial position are encoded in different layers of the HMAX model progressively. By adding a softmax layer at the top of the model, multiclass categorization experiments can be conducted, and the results on Caltech101 show that the enhanced model with a smaller memory size exhibits higher accuracy than the original HMAX model, and could also achieve better accuracy than other unsupervised feature learning methods in multiclass categorization task.
format Online
Article
Text
id pubmed-4595662
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-45956622015-10-23 Enhanced HMAX model with feedforward feature learning for multiclass categorization Li, Yinlin Wu, Wei Zhang, Bo Li, Fengfu Front Comput Neurosci Neuroscience In recent years, the interdisciplinary research between neuroscience and computer vision has promoted the development in both fields. Many biologically inspired visual models are proposed, and among them, the Hierarchical Max-pooling model (HMAX) is a feedforward model mimicking the structures and functions of V1 to posterior inferotemporal (PIT) layer of the primate visual cortex, which could generate a series of position- and scale- invariant features. However, it could be improved with attention modulation and memory processing, which are two important properties of the primate visual cortex. Thus, in this paper, based on recent biological research on the primate visual cortex, we still mimic the first 100–150 ms of visual cognition to enhance the HMAX model, which mainly focuses on the unsupervised feedforward feature learning process. The main modifications are as follows: (1) To mimic the attention modulation mechanism of V1 layer, a bottom-up saliency map is computed in the S1 layer of the HMAX model, which can support the initial feature extraction for memory processing; (2) To mimic the learning, clustering and short-term memory to long-term memory conversion abilities of V2 and IT, an unsupervised iterative clustering method is used to learn clusters with multiscale middle level patches, which are taken as long-term memory; (3) Inspired by the multiple feature encoding mode of the primate visual cortex, information including color, orientation, and spatial position are encoded in different layers of the HMAX model progressively. By adding a softmax layer at the top of the model, multiclass categorization experiments can be conducted, and the results on Caltech101 show that the enhanced model with a smaller memory size exhibits higher accuracy than the original HMAX model, and could also achieve better accuracy than other unsupervised feature learning methods in multiclass categorization task. Frontiers Media S.A. 2015-10-07 /pmc/articles/PMC4595662/ /pubmed/26500532 http://dx.doi.org/10.3389/fncom.2015.00123 Text en Copyright © 2015 Li, Wu, Zhang and Li. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Li, Yinlin
Wu, Wei
Zhang, Bo
Li, Fengfu
Enhanced HMAX model with feedforward feature learning for multiclass categorization
title Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_full Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_fullStr Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_full_unstemmed Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_short Enhanced HMAX model with feedforward feature learning for multiclass categorization
title_sort enhanced hmax model with feedforward feature learning for multiclass categorization
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4595662/
https://www.ncbi.nlm.nih.gov/pubmed/26500532
http://dx.doi.org/10.3389/fncom.2015.00123
work_keys_str_mv AT liyinlin enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization
AT wuwei enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization
AT zhangbo enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization
AT lifengfu enhancedhmaxmodelwithfeedforwardfeaturelearningformulticlasscategorization