Cargando…
Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects
When we see a human sitting down, standing up, or walking, we can recognize one of these poses independently of the individual, or we can recognize the individual person, independently of the pose. The same issues arise for deforming objects. For example, if we see a flag deformed by the wind, eithe...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978248/ https://www.ncbi.nlm.nih.gov/pubmed/24744725 http://dx.doi.org/10.3389/fncom.2014.00037 |
_version_ | 1782310532856938496 |
---|---|
author | Webb, Tristan J. Rolls, Edmund T. |
author_facet | Webb, Tristan J. Rolls, Edmund T. |
author_sort | Webb, Tristan J. |
collection | PubMed |
description | When we see a human sitting down, standing up, or walking, we can recognize one of these poses independently of the individual, or we can recognize the individual person, independently of the pose. The same issues arise for deforming objects. For example, if we see a flag deformed by the wind, either blowing out or hanging languidly, we can usually recognize the flag, independently of its deformation; or we can recognize the deformation independently of the identity of the flag. We hypothesize that these types of recognition can be implemented by the primate visual system using temporo-spatial continuity as objects transform as a learning principle. In particular, we hypothesize that pose or deformation can be learned under conditions in which large numbers of different people are successively seen in the same pose, or objects in the same deformation. We also hypothesize that person-specific representations that are independent of pose, and object-specific representations that are independent of deformation and view, could be built, when individual people or objects are observed successively transforming from one pose or deformation and view to another. These hypotheses were tested in a simulation of the ventral visual system, VisNet, that uses temporal continuity, implemented in a synaptic learning rule with a short-term memory trace of previous neuronal activity, to learn invariant representations. It was found that depending on the statistics of the visual input, either pose-specific or deformation-specific representations could be built that were invariant with respect to individual and view; or that identity-specific representations could be built that were invariant with respect to pose or deformation and view. We propose that this is how pose-specific and pose-invariant, and deformation-specific and deformation-invariant, perceptual representations are built in the brain. |
format | Online Article Text |
id | pubmed-3978248 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-39782482014-04-17 Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects Webb, Tristan J. Rolls, Edmund T. Front Comput Neurosci Neuroscience When we see a human sitting down, standing up, or walking, we can recognize one of these poses independently of the individual, or we can recognize the individual person, independently of the pose. The same issues arise for deforming objects. For example, if we see a flag deformed by the wind, either blowing out or hanging languidly, we can usually recognize the flag, independently of its deformation; or we can recognize the deformation independently of the identity of the flag. We hypothesize that these types of recognition can be implemented by the primate visual system using temporo-spatial continuity as objects transform as a learning principle. In particular, we hypothesize that pose or deformation can be learned under conditions in which large numbers of different people are successively seen in the same pose, or objects in the same deformation. We also hypothesize that person-specific representations that are independent of pose, and object-specific representations that are independent of deformation and view, could be built, when individual people or objects are observed successively transforming from one pose or deformation and view to another. These hypotheses were tested in a simulation of the ventral visual system, VisNet, that uses temporal continuity, implemented in a synaptic learning rule with a short-term memory trace of previous neuronal activity, to learn invariant representations. It was found that depending on the statistics of the visual input, either pose-specific or deformation-specific representations could be built that were invariant with respect to individual and view; or that identity-specific representations could be built that were invariant with respect to pose or deformation and view. We propose that this is how pose-specific and pose-invariant, and deformation-specific and deformation-invariant, perceptual representations are built in the brain. Frontiers Media S.A. 2014-04-01 /pmc/articles/PMC3978248/ /pubmed/24744725 http://dx.doi.org/10.3389/fncom.2014.00037 Text en Copyright © 2014 Webb and Rolls. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Webb, Tristan J. Rolls, Edmund T. Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects |
title | Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects |
title_full | Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects |
title_fullStr | Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects |
title_full_unstemmed | Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects |
title_short | Deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects |
title_sort | deformation-specific and deformation-invariant visual object recognition: pose vs. identity recognition of people and deforming objects |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978248/ https://www.ncbi.nlm.nih.gov/pubmed/24744725 http://dx.doi.org/10.3389/fncom.2014.00037 |
work_keys_str_mv | AT webbtristanj deformationspecificanddeformationinvariantvisualobjectrecognitionposevsidentityrecognitionofpeopleanddeformingobjects AT rollsedmundt deformationspecificanddeformationinvariantvisualobjectrecognitionposevsidentityrecognitionofpeopleanddeformingobjects |