Cargando…

Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks

Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each con...

Descripción completa

Detalles Bibliográficos
Autores principales: Mocz, Viola, Jeong, Su Keun, Chun, Marvin, Xu, Yaoda
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002658/
https://www.ncbi.nlm.nih.gov/pubmed/36909506
http://dx.doi.org/10.1101/2023.02.28.530472
_version_ 1784904437618180096
author Mocz, Viola
Jeong, Su Keun
Chun, Marvin
Xu, Yaoda
author_facet Mocz, Viola
Jeong, Su Keun
Chun, Marvin
Xu, Yaoda
author_sort Mocz, Viola
collection PubMed
description Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each constituent object shown alone, indicating the whole is equal to the average of its parts. This is present at the single unit level in the slope of response amplitudes of macaque IT neurons to paired and single objects, and at the population level in response patterns of fMRI voxels in human ventral object processing regions (e.g., LO). Here we show that averaging exists in both single fMRI voxels and voxel population responses in human LO, with better averaging in single voxels leading to better averaging in fMRI response patterns, demonstrating a close correspondence of averaging at the fMRI unit and population levels. To understand if a similar averaging mechanism exists in convolutional neural networks (CNNs) pretrained for object classification, we examined five CNNs with varying architecture, depth and the presence/absence of recurrent processing. We observed averaging at the CNN unit level but rarely at the population level, with CNN unit response distribution in most cases did not resemble human LO or macaque IT responses. The whole is thus not equal to the average of its parts in CNNs, potentially rendering the individual objects in a pair less accessible in CNNs during visual processing than they are in the human brain.
format Online
Article
Text
id pubmed-10002658
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100026582023-03-11 Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda bioRxiv Article Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each constituent object shown alone, indicating the whole is equal to the average of its parts. This is present at the single unit level in the slope of response amplitudes of macaque IT neurons to paired and single objects, and at the population level in response patterns of fMRI voxels in human ventral object processing regions (e.g., LO). Here we show that averaging exists in both single fMRI voxels and voxel population responses in human LO, with better averaging in single voxels leading to better averaging in fMRI response patterns, demonstrating a close correspondence of averaging at the fMRI unit and population levels. To understand if a similar averaging mechanism exists in convolutional neural networks (CNNs) pretrained for object classification, we examined five CNNs with varying architecture, depth and the presence/absence of recurrent processing. We observed averaging at the CNN unit level but rarely at the population level, with CNN unit response distribution in most cases did not resemble human LO or macaque IT responses. The whole is thus not equal to the average of its parts in CNNs, potentially rendering the individual objects in a pair less accessible in CNNs during visual processing than they are in the human brain. Cold Spring Harbor Laboratory 2023-03-01 /pmc/articles/PMC10002658/ /pubmed/36909506 http://dx.doi.org/10.1101/2023.02.28.530472 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Mocz, Viola
Jeong, Su Keun
Chun, Marvin
Xu, Yaoda
Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_full Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_fullStr Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_full_unstemmed Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_short Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_sort representing multiple visual objects in the human brain and convolutional neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002658/
https://www.ncbi.nlm.nih.gov/pubmed/36909506
http://dx.doi.org/10.1101/2023.02.28.530472
work_keys_str_mv AT moczviola representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks
AT jeongsukeun representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks
AT chunmarvin representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks
AT xuyaoda representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks