Cargando…

Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects

When we see a human sitting down, standing up, or walking, we can recognize one of these poses independently of the individual, or we can recognize the individual person, independently of the pose. The same issues arise for deforming objects. For example, if we see a flag deformed by the wind, eithe...

Descripción completa

Detalles Bibliográficos
Autores principales: Webb, Tristan J., Rolls, Edmund T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978248/
https://www.ncbi.nlm.nih.gov/pubmed/24744725
http://dx.doi.org/10.3389/fncom.2014.00037
_version_ 1782310532856938496
author Webb, Tristan J.
Rolls, Edmund T.
author_facet Webb, Tristan J.
Rolls, Edmund T.
author_sort Webb, Tristan J.
collection PubMed
description When we see a human sitting down, standing up, or walking, we can recognize one of these poses independently of the individual, or we can recognize the individual person, independently of the pose. The same issues arise for deforming objects. For example, if we see a flag deformed by the wind, either blowing out or hanging languidly, we can usually recognize the flag, independently of its deformation; or we can recognize the deformation independently of the identity of the flag. We hypothesize that these types of recognition can be implemented by the primate visual system using temporo-spatial continuity as objects transform as a learning principle. In particular, we hypothesize that pose or deformation can be learned under conditions in which large numbers of different people are successively seen in the same pose, or objects in the same deformation. We also hypothesize that person-specific representations that are independent of pose, and object-specific representations that are independent of deformation and view, could be built, when individual people or objects are observed successively transforming from one pose or deformation and view to another. These hypotheses were tested in a simulation of the ventral visual system, VisNet, that uses temporal continuity, implemented in a synaptic learning rule with a short-term memory trace of previous neuronal activity, to learn invariant representations. It was found that depending on the statistics of the visual input, either pose-specific or deformation-specific representations could be built that were invariant with respect to individual and view; or that identity-specific representations could be built that were invariant with respect to pose or deformation and view. We propose that this is how pose-specific and pose-invariant, and deformation-specific and deformation-invariant, perceptual representations are built in the brain.
format Online
Article
Text
id pubmed-3978248
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-39782482014-04-17 Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects Webb, Tristan J. Rolls, Edmund T. Front Comput Neurosci Neuroscience When we see a human sitting down, standing up, or walking, we can recognize one of these poses independently of the individual, or we can recognize the individual person, independently of the pose. The same issues arise for deforming objects. For example, if we see a flag deformed by the wind, either blowing out or hanging languidly, we can usually recognize the flag, independently of its deformation; or we can recognize the deformation independently of the identity of the flag. We hypothesize that these types of recognition can be implemented by the primate visual system using temporo-spatial continuity as objects transform as a learning principle. In particular, we hypothesize that pose or deformation can be learned under conditions in which large numbers of different people are successively seen in the same pose, or objects in the same deformation. We also hypothesize that person-specific representations that are independent of pose, and object-specific representations that are independent of deformation and view, could be built, when individual people or objects are observed successively transforming from one pose or deformation and view to another. These hypotheses were tested in a simulation of the ventral visual system, VisNet, that uses temporal continuity, implemented in a synaptic learning rule with a short-term memory trace of previous neuronal activity, to learn invariant representations. It was found that depending on the statistics of the visual input, either pose-specific or deformation-specific representations could be built that were invariant with respect to individual and view; or that identity-specific representations could be built that were invariant with respect to pose or deformation and view. We propose that this is how pose-specific and pose-invariant, and deformation-specific and deformation-invariant, perceptual representations are built in the brain. Frontiers Media S.A. 2014-04-01 /pmc/articles/PMC3978248/ /pubmed/24744725 http://dx.doi.org/10.3389/fncom.2014.00037 Text en Copyright © 2014 Webb and Rolls. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Webb, Tristan J.
Rolls, Edmund T.
Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
title Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
title_full Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
title_fullStr Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
title_full_unstemmed Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
title_short Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
title_sort deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978248/
https://www.ncbi.nlm.nih.gov/pubmed/24744725
http://dx.doi.org/10.3389/fncom.2014.00037
work_keys_str_mv AT webbtristanj deformationspecificanddeformationinvariantvisualobjectrecognitionposevsidentityrecognitionofpeopleanddeformingobjects
AT rollsedmundt deformationspecificanddeformationinvariantvisualobjectrecognitionposevsidentityrecognitionofpeopleanddeformingobjects