Cargando…

A conceptual framework of computations in mid-level vision

If a picture is worth a thousand words, as an English idiom goes, what should those words—or, rather, descriptors—capture? What format of image representation would be sufficiently rich if we were to reconstruct the essence of images from their descriptors? In this paper, we set out to develop a con...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kubilius, Jonas, Wagemans, Johan, Op de Beeck, Hans P.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2014
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4264474/ https://www.ncbi.nlm.nih.gov/pubmed/25566044 http://dx.doi.org/10.3389/fncom.2014.00158

_version_	1782348751792242688
author	Kubilius, Jonas Wagemans, Johan Op de Beeck, Hans P.
author_facet	Kubilius, Jonas Wagemans, Johan Op de Beeck, Hans P.
author_sort	Kubilius, Jonas
collection	PubMed
description	If a picture is worth a thousand words, as an English idiom goes, what should those words—or, rather, descriptors—capture? What format of image representation would be sufficiently rich if we were to reconstruct the essence of images from their descriptors? In this paper, we set out to develop a conceptual framework that would be: (i) biologically plausible in order to provide a better mechanistic understanding of our visual system; (ii) sufficiently robust to apply in practice on realistic images; and (iii) able to tap into underlying structure of our visual world. We bring forward three key ideas. First, we argue that surface-based representations are constructed based on feature inference from the input in the intermediate processing layers of the visual system. Such representations are computed in a largely pre-semantic (prior to categorization) and pre-attentive manner using multiple cues (orientation, color, polarity, variation in orientation, and so on), and explicitly retain configural relations between features. The constructed surfaces may be partially overlapping to compensate for occlusions and are ordered in depth (figure-ground organization). Second, we propose that such intermediate representations could be formed by a hierarchical computation of similarity between features in local image patches and pooling of highly-similar units, and reestimated via recurrent loops according to the task demands. Finally, we suggest to use datasets composed of realistically rendered artificial objects and surfaces in order to better understand a model's behavior and its limitations.
format	Online Article Text
id	pubmed-4264474
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-42644742015-01-06 A conceptual framework of computations in mid-level vision Kubilius, Jonas Wagemans, Johan Op de Beeck, Hans P. Front Comput Neurosci Neuroscience If a picture is worth a thousand words, as an English idiom goes, what should those words—or, rather, descriptors—capture? What format of image representation would be sufficiently rich if we were to reconstruct the essence of images from their descriptors? In this paper, we set out to develop a conceptual framework that would be: (i) biologically plausible in order to provide a better mechanistic understanding of our visual system; (ii) sufficiently robust to apply in practice on realistic images; and (iii) able to tap into underlying structure of our visual world. We bring forward three key ideas. First, we argue that surface-based representations are constructed based on feature inference from the input in the intermediate processing layers of the visual system. Such representations are computed in a largely pre-semantic (prior to categorization) and pre-attentive manner using multiple cues (orientation, color, polarity, variation in orientation, and so on), and explicitly retain configural relations between features. The constructed surfaces may be partially overlapping to compensate for occlusions and are ordered in depth (figure-ground organization). Second, we propose that such intermediate representations could be formed by a hierarchical computation of similarity between features in local image patches and pooling of highly-similar units, and reestimated via recurrent loops according to the task demands. Finally, we suggest to use datasets composed of realistically rendered artificial objects and surfaces in order to better understand a model's behavior and its limitations. Frontiers Media S.A. 2014-12-12 /pmc/articles/PMC4264474/ /pubmed/25566044 http://dx.doi.org/10.3389/fncom.2014.00158 Text en Copyright © 2014 Kubilius, Wagemans and Op de Beeck. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Kubilius, Jonas Wagemans, Johan Op de Beeck, Hans P. A conceptual framework of computations in mid-level vision
title	A conceptual framework of computations in mid-level vision
title_full	A conceptual framework of computations in mid-level vision
title_fullStr	A conceptual framework of computations in mid-level vision
title_full_unstemmed	A conceptual framework of computations in mid-level vision
title_short	A conceptual framework of computations in mid-level vision
title_sort	conceptual framework of computations in mid-level vision
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4264474/ https://www.ncbi.nlm.nih.gov/pubmed/25566044 http://dx.doi.org/10.3389/fncom.2014.00158
work_keys_str_mv	AT kubiliusjonas aconceptualframeworkofcomputationsinmidlevelvision AT wagemansjohan aconceptualframeworkofcomputationsinmidlevelvision AT opdebeeckhansp aconceptualframeworkofcomputationsinmidlevelvision AT kubiliusjonas conceptualframeworkofcomputationsinmidlevelvision AT wagemansjohan conceptualframeworkofcomputationsinmidlevelvision AT opdebeeckhansp conceptualframeworkofcomputationsinmidlevelvision

A conceptual framework of computations in mid-level vision

Ejemplares similares