Cargando…
Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision
The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Association for Research in Vision and Ophthalmology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458991/ https://www.ncbi.nlm.nih.gov/pubmed/34533580 http://dx.doi.org/10.1167/jov.21.10.14 |
_version_ | 1784571424632995840 |
---|---|
author | Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans |
author_facet | Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans |
author_sort | Avberšek, Lev Kiar |
collection | PubMed |
description | The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with age. In adulthood, when humans receive new visual input, low spatial frequencies are typically processed first before subsequent processing of higher spatial frequencies. We investigated to what extent this coarse-to-fine progression might impact visual representations in artificial vision and compared this to adult human representations. We simulated the coarse-to-fine progression of image processing in deep convolutional neural networks (CNNs) by gradually increasing spatial frequency information during training. We compared CNN performance after standard and coarse-to-fine training with a wide range of datasets from behavioral and neuroimaging experiments. In contrast to humans, CNNs that are trained using the standard protocol are very insensitive to low spatial frequency information, showing very poor performance in being able to classify such object images. By training CNNs using our coarse-to-fine method, we improved the classification accuracy of CNNs from 0% to 32% on low-pass-filtered images taken from the ImageNet dataset. The coarse-to-fine training also made the CNNs more sensitive to low spatial frequencies in hybrid images with conflicting information in different frequency bands. When comparing differently trained networks on images containing full spatial frequency information, we saw no representational differences. Overall, this integration of computational, neural, and behavioral findings shows the relevance of the exposure to and processing of inputs with variation in spatial frequency content for some aspects of high-level object representations. |
format | Online Article Text |
id | pubmed-8458991 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | The Association for Research in Vision and Ophthalmology |
record_format | MEDLINE/PubMed |
spelling | pubmed-84589912021-10-05 Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans J Vis Article The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with age. In adulthood, when humans receive new visual input, low spatial frequencies are typically processed first before subsequent processing of higher spatial frequencies. We investigated to what extent this coarse-to-fine progression might impact visual representations in artificial vision and compared this to adult human representations. We simulated the coarse-to-fine progression of image processing in deep convolutional neural networks (CNNs) by gradually increasing spatial frequency information during training. We compared CNN performance after standard and coarse-to-fine training with a wide range of datasets from behavioral and neuroimaging experiments. In contrast to humans, CNNs that are trained using the standard protocol are very insensitive to low spatial frequency information, showing very poor performance in being able to classify such object images. By training CNNs using our coarse-to-fine method, we improved the classification accuracy of CNNs from 0% to 32% on low-pass-filtered images taken from the ImageNet dataset. The coarse-to-fine training also made the CNNs more sensitive to low spatial frequencies in hybrid images with conflicting information in different frequency bands. When comparing differently trained networks on images containing full spatial frequency information, we saw no representational differences. Overall, this integration of computational, neural, and behavioral findings shows the relevance of the exposure to and processing of inputs with variation in spatial frequency content for some aspects of high-level object representations. The Association for Research in Vision and Ophthalmology 2021-09-17 /pmc/articles/PMC8458991/ /pubmed/34533580 http://dx.doi.org/10.1167/jov.21.10.14 Text en Copyright 2021 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License. |
spellingShingle | Article Avberšek, Lev Kiar Zeman, Astrid Op de Beeck, Hans Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision |
title | Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision |
title_full | Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision |
title_fullStr | Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision |
title_full_unstemmed | Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision |
title_short | Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision |
title_sort | training for object recognition with increasing spatial frequency: a comparison of deep learning with human vision |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8458991/ https://www.ncbi.nlm.nih.gov/pubmed/34533580 http://dx.doi.org/10.1167/jov.21.10.14 |
work_keys_str_mv | AT avberseklevkiar trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision AT zemanastrid trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision AT opdebeeckhans trainingforobjectrecognitionwithincreasingspatialfrequencyacomparisonofdeeplearningwithhumanvision |