Cargando…

Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision

The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with...

Descripción completa

Detalles Bibliográficos
Autores principales: Avberšek, Lev Kiar, Zeman, Astrid, Op de Beeck, Hans
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458991/
https://www.ncbi.nlm.nih.gov/pubmed/34533580
http://dx.doi.org/10.1167/jov.21.10.14
_version_ 1784571424632995840
author Avberšek, Lev Kiar
Zeman, Astrid
Op de Beeck, Hans
author_facet Avberšek, Lev Kiar
Zeman, Astrid
Op de Beeck, Hans
author_sort Avberšek, Lev Kiar
collection PubMed
description The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with age. In adulthood, when humans receive new visual input, low spatial frequencies are typically processed first before subsequent processing of higher spatial frequencies. We investigated to what extent this coarse-to-fine progression might impact visual representations in artificial vision and compared this to adult human representations. We simulated the coarse-to-fine progression of image processing in deep convolutional neural networks (CNNs) by gradually increasing spatial frequency information during training. We compared CNN performance after standard and coarse-to-fine training with a wide range of datasets from behavioral and neuroimaging experiments. In contrast to humans, CNNs that are trained using the standard protocol are very insensitive to low spatial frequency information, showing very poor performance in being able to classify such object images. By training CNNs using our coarse-to-fine method, we improved the classification accuracy of CNNs from 0% to 32% on low-pass-filtered images taken from the ImageNet dataset. The coarse-to-fine training also made the CNNs more sensitive to low spatial frequencies in hybrid images with conflicting information in different frequency bands. When comparing differently trained networks on images containing full spatial frequency information, we saw no representational differences. Overall, this integration of computational, neural, and behavioral findings shows the relevance of the exposure to and processing of inputs with variation in spatial frequency content for some aspects of high-level object representations.
format Online
Article
Text
id pubmed-8458991
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-84589912021-10-05 Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans J Vis Article The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with age. In adulthood, when humans receive new visual input, low spatial frequencies are typically processed first before subsequent processing of higher spatial frequencies. We investigated to what extent this coarse-to-fine progression might impact visual representations in artificial vision and compared this to adult human representations. We simulated the coarse-to-fine progression of image processing in deep convolutional neural networks (CNNs) by gradually increasing spatial frequency information during training. We compared CNN performance after standard and coarse-to-fine training with a wide range of datasets from behavioral and neuroimaging experiments. In contrast to humans, CNNs that are trained using the standard protocol are very insensitive to low spatial frequency information, showing very poor performance in being able to classify such object images. By training CNNs using our coarse-to-fine method, we improved the classification accuracy of CNNs from 0% to 32% on low-pass-filtered images taken from the ImageNet dataset. The coarse-to-fine training also made the CNNs more sensitive to low spatial frequencies in hybrid images with conflicting information in different frequency bands. When comparing differently trained networks on images containing full spatial frequency information, we saw no representational differences. Overall, this integration of computational, neural, and behavioral findings shows the relevance of the exposure to and processing of inputs with variation in spatial frequency content for some aspects of high-level object representations. The Association for Research in Vision and Ophthalmology 2021-09-17 /pmc/articles/PMC8458991/ /pubmed/34533580 http://dx.doi.org/10.1167/jov.21.10.14 Text en Copyright 2021 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle Article
Avberšek, Lev Kiar
Zeman, Astrid
Op de Beeck, Hans
Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_full Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_fullStr Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_full_unstemmed Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_short Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
title_sort training for object recognition with increasing spatial frequency: a comparison of deep learning with human vision
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458991/
https://www.ncbi.nlm.nih.gov/pubmed/34533580
http://dx.doi.org/10.1167/jov.21.10.14
work_keys_str_mv AT avberseklevkiar trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision
AT zemanastrid trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision
AT opdebeeckhans trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision