Cargando…

Invariant visual object recognition: biologically plausible approaches

Key properties of inferior temporal cortex neurons are described, and then, the biological plausibility of two leading approaches to invariant visual object recognition in the ventral visual system is assessed to investigate whether they account for these properties. Experiment 1 shows that VisNet p...

Descripción completa

Detalles Bibliográficos
Autores principales:	Robinson, Leigh, Rolls, Edmund T.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Berlin Heidelberg 2015
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572081/ https://www.ncbi.nlm.nih.gov/pubmed/26335743 http://dx.doi.org/10.1007/s00422-015-0658-2

_version_	1782390390020636672
author	Robinson, Leigh Rolls, Edmund T.
author_facet	Robinson, Leigh Rolls, Edmund T.
author_sort	Robinson, Leigh
collection	PubMed
description	Key properties of inferior temporal cortex neurons are described, and then, the biological plausibility of two leading approaches to invariant visual object recognition in the ventral visual system is assessed to investigate whether they account for these properties. Experiment 1 shows that VisNet performs object classification with random exemplars comparably to HMAX, except that the final layer C neurons of HMAX have a very non-sparse representation (unlike that in the brain) that provides little information in the single-neuron responses about the object class. Experiment 2 shows that VisNet forms invariant representations when trained with different views of each object, whereas HMAX performs poorly when assessed with a biologically plausible pattern association network, as HMAX has no mechanism to learn view invariance. Experiment 3 shows that VisNet neurons do not respond to scrambled images of faces, and thus encode shape information. HMAX neurons responded with similarly high rates to the unscrambled and scrambled faces, indicating that low-level features including texture may be relevant to HMAX performance. Experiment 4 shows that VisNet can learn to recognize objects even when the view provided by the object changes catastrophically as it transforms, whereas HMAX has no learning mechanism in its S–C hierarchy that provides for view-invariant learning. This highlights some requirements for the neurobiological mechanisms of high-level vision, and how some different approaches perform, in order to help understand the fundamental underlying principles of invariant visual object recognition in the ventral visual stream. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00422-015-0658-2) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-4572081
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Springer Berlin Heidelberg
record_format	MEDLINE/PubMed
spelling	pubmed-45720812015-09-23 Invariant visual object recognition: biologically plausible approaches Robinson, Leigh Rolls, Edmund T. Biol Cybern Original Article Key properties of inferior temporal cortex neurons are described, and then, the biological plausibility of two leading approaches to invariant visual object recognition in the ventral visual system is assessed to investigate whether they account for these properties. Experiment 1 shows that VisNet performs object classification with random exemplars comparably to HMAX, except that the final layer C neurons of HMAX have a very non-sparse representation (unlike that in the brain) that provides little information in the single-neuron responses about the object class. Experiment 2 shows that VisNet forms invariant representations when trained with different views of each object, whereas HMAX performs poorly when assessed with a biologically plausible pattern association network, as HMAX has no mechanism to learn view invariance. Experiment 3 shows that VisNet neurons do not respond to scrambled images of faces, and thus encode shape information. HMAX neurons responded with similarly high rates to the unscrambled and scrambled faces, indicating that low-level features including texture may be relevant to HMAX performance. Experiment 4 shows that VisNet can learn to recognize objects even when the view provided by the object changes catastrophically as it transforms, whereas HMAX has no learning mechanism in its S–C hierarchy that provides for view-invariant learning. This highlights some requirements for the neurobiological mechanisms of high-level vision, and how some different approaches perform, in order to help understand the fundamental underlying principles of invariant visual object recognition in the ventral visual stream. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s00422-015-0658-2) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2015-09-03 2015 /pmc/articles/PMC4572081/ /pubmed/26335743 http://dx.doi.org/10.1007/s00422-015-0658-2 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle	Original Article Robinson, Leigh Rolls, Edmund T. Invariant visual object recognition: biologically plausible approaches
title	Invariant visual object recognition: biologically plausible approaches
title_full	Invariant visual object recognition: biologically plausible approaches
title_fullStr	Invariant visual object recognition: biologically plausible approaches
title_full_unstemmed	Invariant visual object recognition: biologically plausible approaches
title_short	Invariant visual object recognition: biologically plausible approaches
title_sort	invariant visual object recognition: biologically plausible approaches
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4572081/ https://www.ncbi.nlm.nih.gov/pubmed/26335743 http://dx.doi.org/10.1007/s00422-015-0658-2
work_keys_str_mv	AT robinsonleigh invariantvisualobjectrecognitionbiologicallyplausibleapproaches AT rollsedmundt invariantvisualobjectrecognitionbiologicallyplausibleapproaches

Invariant visual object recognition: biologically plausible approaches

Ejemplares similares