Cargando…

Object detection through search with a foveated visual system

Humans and many other species sense visual information with varying spatial resolution across the visual field (foveated vision) and deploy eye movements to actively sample regions of interests in scenes. The advantage of such varying resolution architecture is a reduced computational, hence metabol...

Descripción completa

Detalles Bibliográficos
Autores principales: Akbas, Emre, Eckstein, Miguel P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5669499/
https://www.ncbi.nlm.nih.gov/pubmed/28991906
http://dx.doi.org/10.1371/journal.pcbi.1005743
_version_ 1783275856483844096
author Akbas, Emre
Eckstein, Miguel P.
author_facet Akbas, Emre
Eckstein, Miguel P.
author_sort Akbas, Emre
collection PubMed
description Humans and many other species sense visual information with varying spatial resolution across the visual field (foveated vision) and deploy eye movements to actively sample regions of interests in scenes. The advantage of such varying resolution architecture is a reduced computational, hence metabolic cost. But what are the performance costs of such processing strategy relative to a scheme that processes the visual field at high spatial resolution? Here we first focus on visual search and combine object detectors from computer vision with a recent model of peripheral pooling regions found at the V1 layer of the human visual system. We develop a foveated object detector that processes the entire scene with varying resolution, uses retino-specific object detection classifiers to guide eye movements, aligns its fovea with regions of interest in the input image and integrates observations across multiple fixations. We compared the foveated object detector against a non-foveated version of the same object detector which processes the entire image at homogeneous high spatial resolution. We evaluated the accuracy of the foveated and non-foveated object detectors identifying 20 different objects classes in scenes from a standard computer vision data set (the PASCAL VOC 2007 dataset). We show that the foveated object detector can approximate the performance of the object detector with homogeneous high spatial resolution processing while bringing significant computational cost savings. Additionally, we assessed the impact of foveation on the computation of bottom-up saliency. An implementation of a simple foveated bottom-up saliency model with eye movements showed agreement in the selection of top salient regions of scenes with those selected by a non-foveated high resolution saliency model. Together, our results might help explain the evolution of foveated visual systems with eye movements as a solution that preserves perceptual performance in visual search while resulting in computational and metabolic savings to the brain.
format Online
Article
Text
id pubmed-5669499
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56694992017-11-18 Object detection through search with a foveated visual system Akbas, Emre Eckstein, Miguel P. PLoS Comput Biol Research Article Humans and many other species sense visual information with varying spatial resolution across the visual field (foveated vision) and deploy eye movements to actively sample regions of interests in scenes. The advantage of such varying resolution architecture is a reduced computational, hence metabolic cost. But what are the performance costs of such processing strategy relative to a scheme that processes the visual field at high spatial resolution? Here we first focus on visual search and combine object detectors from computer vision with a recent model of peripheral pooling regions found at the V1 layer of the human visual system. We develop a foveated object detector that processes the entire scene with varying resolution, uses retino-specific object detection classifiers to guide eye movements, aligns its fovea with regions of interest in the input image and integrates observations across multiple fixations. We compared the foveated object detector against a non-foveated version of the same object detector which processes the entire image at homogeneous high spatial resolution. We evaluated the accuracy of the foveated and non-foveated object detectors identifying 20 different objects classes in scenes from a standard computer vision data set (the PASCAL VOC 2007 dataset). We show that the foveated object detector can approximate the performance of the object detector with homogeneous high spatial resolution processing while bringing significant computational cost savings. Additionally, we assessed the impact of foveation on the computation of bottom-up saliency. An implementation of a simple foveated bottom-up saliency model with eye movements showed agreement in the selection of top salient regions of scenes with those selected by a non-foveated high resolution saliency model. Together, our results might help explain the evolution of foveated visual systems with eye movements as a solution that preserves perceptual performance in visual search while resulting in computational and metabolic savings to the brain. Public Library of Science 2017-10-09 /pmc/articles/PMC5669499/ /pubmed/28991906 http://dx.doi.org/10.1371/journal.pcbi.1005743 Text en © 2017 Akbas, Eckstein http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Akbas, Emre
Eckstein, Miguel P.
Object detection through search with a foveated visual system
title Object detection through search with a foveated visual system
title_full Object detection through search with a foveated visual system
title_fullStr Object detection through search with a foveated visual system
title_full_unstemmed Object detection through search with a foveated visual system
title_short Object detection through search with a foveated visual system
title_sort object detection through search with a foveated visual system
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5669499/
https://www.ncbi.nlm.nih.gov/pubmed/28991906
http://dx.doi.org/10.1371/journal.pcbi.1005743
work_keys_str_mv AT akbasemre objectdetectionthroughsearchwithafoveatedvisualsystem
AT ecksteinmiguelp objectdetectionthroughsearchwithafoveatedvisualsystem