Cargando…

Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems

Searching for and recognizing objects in complex natural scenes is implemented by multiple saccades until the eyes reach within the reduced receptive field sizes of inferior temporal cortex (IT) neurons. We analyze and model how the dorsal and ventral visual streams both contribute to this. Saliency...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rolls, Edmund T., Webb, Tristan J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2014
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4130325/ https://www.ncbi.nlm.nih.gov/pubmed/25161619 http://dx.doi.org/10.3389/fncom.2014.00085

_version_	1782330315755225088
author	Rolls, Edmund T. Webb, Tristan J.
author_facet	Rolls, Edmund T. Webb, Tristan J.
author_sort	Rolls, Edmund T.
collection	PubMed
description	Searching for and recognizing objects in complex natural scenes is implemented by multiple saccades until the eyes reach within the reduced receptive field sizes of inferior temporal cortex (IT) neurons. We analyze and model how the dorsal and ventral visual streams both contribute to this. Saliency detection in the dorsal visual system including area LIP is modeled by graph-based visual saliency, and allows the eyes to fixate potential objects within several degrees. Visual information at the fixated location subtending approximately 9° corresponding to the receptive fields of IT neurons is then passed through a four layer hierarchical model of the ventral cortical visual system, VisNet. We show that VisNet can be trained using a synaptic modification rule with a short-term memory trace of recent neuronal activity to capture both the required view and translation invariances to allow in the model approximately 90% correct object recognition for 4 objects shown in any view across a range of 135° anywhere in a scene. The model was able to generalize correctly within the four trained views and the 25 trained translations. This approach analyses the principles by which complementary computations in the dorsal and ventral visual cortical streams enable objects to be located and recognized in complex natural scenes.
format	Online Article Text
id	pubmed-4130325
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-41303252014-08-26 Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems Rolls, Edmund T. Webb, Tristan J. Front Comput Neurosci Neuroscience Searching for and recognizing objects in complex natural scenes is implemented by multiple saccades until the eyes reach within the reduced receptive field sizes of inferior temporal cortex (IT) neurons. We analyze and model how the dorsal and ventral visual streams both contribute to this. Saliency detection in the dorsal visual system including area LIP is modeled by graph-based visual saliency, and allows the eyes to fixate potential objects within several degrees. Visual information at the fixated location subtending approximately 9° corresponding to the receptive fields of IT neurons is then passed through a four layer hierarchical model of the ventral cortical visual system, VisNet. We show that VisNet can be trained using a synaptic modification rule with a short-term memory trace of recent neuronal activity to capture both the required view and translation invariances to allow in the model approximately 90% correct object recognition for 4 objects shown in any view across a range of 135° anywhere in a scene. The model was able to generalize correctly within the four trained views and the 25 trained translations. This approach analyses the principles by which complementary computations in the dorsal and ventral visual cortical streams enable objects to be located and recognized in complex natural scenes. Frontiers Media S.A. 2014-08-12 /pmc/articles/PMC4130325/ /pubmed/25161619 http://dx.doi.org/10.3389/fncom.2014.00085 Text en Copyright © 2014 Rolls and Webb. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Rolls, Edmund T. Webb, Tristan J. Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title	Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_full	Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_fullStr	Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_full_unstemmed	Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_short	Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems
title_sort	finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4130325/ https://www.ncbi.nlm.nih.gov/pubmed/25161619 http://dx.doi.org/10.3389/fncom.2014.00085
work_keys_str_mv	AT rollsedmundt findingandrecognizingobjectsinnaturalscenescomplementarycomputationsinthedorsalandventralvisualsystems AT webbtristanj findingandrecognizingobjectsinnaturalscenescomplementarycomputationsinthedorsalandventralvisualsystems

Finding and recognizing objects in natural scenes: complementary computations in the dorsal and ventral visual systems

Ejemplares similares