Cargando…

Invariant visual object recognition: biologically plausible approaches

Key properties of inferior temporal cortex neurons are described, and then, the biological plausibility of two leading approaches to invariant visual object recognition in the ventral visual system is assessed to investigate whether they account for these properties. Experiment 1 shows that VisNet p...

Descripción completa

Detalles Bibliográficos
Autores principales: Robinson, Leigh, Rolls, Edmund T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572081/
https://www.ncbi.nlm.nih.gov/pubmed/26335743
http://dx.doi.org/10.1007/s00422-015-0658-2
_version_ 1782390390020636672
author Robinson, Leigh
Rolls, Edmund T.
author_facet Robinson, Leigh
Rolls, Edmund T.
author_sort Robinson, Leigh
collection PubMed
description Key properties of inferior temporal cortex neurons are described, and then, the biological plausibility of two leading approaches to invariant visual object recognition in the ventral visual system is assessed to investigate whether they account for these properties. Experiment 1 shows that VisNet performs object classification with random exemplars comparably to HMAX, except that the final layer C neurons of HMAX have a very non-sparse representation (unlike that in the brain) that provides little information in the single-neuron responses about the object class. Experiment 2 shows that VisNet forms invariant representations when trained with different views of each object, whereas HMAX performs poorly when assessed with a biologically plausible pattern association network, as HMAX has no mechanism to learn view invariance. Experiment 3 shows that VisNet neurons do not respond to scrambled images of faces, and thus encode shape information. HMAX neurons responded with similarly high rates to the unscrambled and scrambled faces, indicating that low-level features including texture may be relevant to HMAX performance. Experiment 4 shows that VisNet can learn to recognize objects even when the view provided by the object changes catastrophically as it transforms, whereas HMAX has no learning mechanism in its S–C hierarchy that provides for view-invariant learning. This highlights some requirements for the neurobiological mechanisms of high-level vision, and how some different approaches perform, in order to help understand the fundamental underlying principles of invariant visual object recognition in the ventral visual stream. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00422-015-0658-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4572081
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-45720812015-09-23 Invariant visual object recognition: biologically plausible approaches Robinson, Leigh Rolls, Edmund T. Biol Cybern Original Article Key properties of inferior temporal cortex neurons are described, and then, the biological plausibility of two leading approaches to invariant visual object recognition in the ventral visual system is assessed to investigate whether they account for these properties. Experiment 1 shows that VisNet performs object classification with random exemplars comparably to HMAX, except that the final layer C neurons of HMAX have a very non-sparse representation (unlike that in the brain) that provides little information in the single-neuron responses about the object class. Experiment 2 shows that VisNet forms invariant representations when trained with different views of each object, whereas HMAX performs poorly when assessed with a biologically plausible pattern association network, as HMAX has no mechanism to learn view invariance. Experiment 3 shows that VisNet neurons do not respond to scrambled images of faces, and thus encode shape information. HMAX neurons responded with similarly high rates to the unscrambled and scrambled faces, indicating that low-level features including texture may be relevant to HMAX performance. Experiment 4 shows that VisNet can learn to recognize objects even when the view provided by the object changes catastrophically as it transforms, whereas HMAX has no learning mechanism in its S–C hierarchy that provides for view-invariant learning. This highlights some requirements for the neurobiological mechanisms of high-level vision, and how some different approaches perform, in order to help understand the fundamental underlying principles of invariant visual object recognition in the ventral visual stream. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00422-015-0658-2) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2015-09-03 2015 /pmc/articles/PMC4572081/ /pubmed/26335743 http://dx.doi.org/10.1007/s00422-015-0658-2 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Article
Robinson, Leigh
Rolls, Edmund T.
Invariant visual object recognition: biologically plausible approaches
title Invariant visual object recognition: biologically plausible approaches
title_full Invariant visual object recognition: biologically plausible approaches
title_fullStr Invariant visual object recognition: biologically plausible approaches
title_full_unstemmed Invariant visual object recognition: biologically plausible approaches
title_short Invariant visual object recognition: biologically plausible approaches
title_sort invariant visual object recognition: biologically plausible approaches
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572081/
https://www.ncbi.nlm.nih.gov/pubmed/26335743
http://dx.doi.org/10.1007/s00422-015-0658-2
work_keys_str_mv AT robinsonleigh invariantvisualobjectrecognitionbiologicallyplausibleapproaches
AT rollsedmundt invariantvisualobjectrecognitionbiologicallyplausibleapproaches