Cargando…

Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision

The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with...

Descripción completa

Detalles Bibliográficos
Autores principales:	Avberšek, Lev Kiar, Zeman, Astrid, Op de Beeck, Hans
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Association for Research in Vision and Ophthalmology 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458991/ https://www.ncbi.nlm.nih.gov/pubmed/34533580 http://dx.doi.org/10.1167/jov.21.10.14

_version_	1784571424632995840
author	Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans
author_facet	Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans
author_sort	Avberšek, Lev Kiar
collection	PubMed
description	The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with age. In adulthood, when humans receive new visual input, low spatial frequencies are typically processed first before subsequent processing of higher spatial frequencies. We investigated to what extent this coarse-to-fine progression might impact visual representations in artificial vision and compared this to adult human representations. We simulated the coarse-to-fine progression of image processing in deep convolutional neural networks (CNNs) by gradually increasing spatial frequency information during training. We compared CNN performance after standard and coarse-to-fine training with a wide range of datasets from behavioral and neuroimaging experiments. In contrast to humans, CNNs that are trained using the standard protocol are very insensitive to low spatial frequency information, showing very poor performance in being able to classify such object images. By training CNNs using our coarse-to-fine method, we improved the classification accuracy of CNNs from 0% to 32% on low-pass-filtered images taken from the ImageNet dataset. The coarse-to-fine training also made the CNNs more sensitive to low spatial frequencies in hybrid images with conflicting information in different frequency bands. When comparing differently trained networks on images containing full spatial frequency information, we saw no representational differences. Overall, this integration of computational, neural, and behavioral findings shows the relevance of the exposure to and processing of inputs with variation in spatial frequency content for some aspects of high-level object representations.
format	Online Article Text
id	pubmed-8458991
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	The Association for Research in Vision and Ophthalmology
record_format	MEDLINE/PubMed
spelling	pubmed-84589912021-10-05 Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans J Vis Article The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with age. In adulthood, when humans receive new visual input, low spatial frequencies are typically processed first before subsequent processing of higher spatial frequencies. We investigated to what extent this coarse-to-fine progression might impact visual representations in artificial vision and compared this to adult human representations. We simulated the coarse-to-fine progression of image processing in deep convolutional neural networks (CNNs) by gradually increasing spatial frequency information during training. We compared CNN performance after standard and coarse-to-fine training with a wide range of datasets from behavioral and neuroimaging experiments. In contrast to humans, CNNs that are trained using the standard protocol are very insensitive to low spatial frequency information, showing very poor performance in being able to classify such object images. By training CNNs using our coarse-to-fine method, we improved the classification accuracy of CNNs from 0% to 32% on low-pass-filtered images taken from the ImageNet dataset. The coarse-to-fine training also made the CNNs more sensitive to low spatial frequencies in hybrid images with conflicting information in different frequency bands. When comparing differently trained networks on images containing full spatial frequency information, we saw no representational differences. Overall, this integration of computational, neural, and behavioral findings shows the relevance of the exposure to and processing of inputs with variation in spatial frequency content for some aspects of high-level object representations. The Association for Research in Vision and Ophthalmology 2021-09-17 /pmc/articles/PMC8458991/ /pubmed/34533580 http://dx.doi.org/10.1167/jov.21.10.14 Text en Copyright 2021 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle	Article Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title	Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_full	Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_fullStr	Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_full_unstemmed	Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_short	Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_sort	training for object recognition with increasing spatial frequency: a comparison of deep learning with human vision
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458991/ https://www.ncbi.nlm.nih.gov/pubmed/34533580 http://dx.doi.org/10.1167/jov.21.10.14
work_keys_str_mv	AT avberseklevkiar trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision AT zemanastrid trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision AT opdebeeckhans trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision

Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision

Ejemplares similares