Cargando…
Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each con...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002658/ https://www.ncbi.nlm.nih.gov/pubmed/36909506 http://dx.doi.org/10.1101/2023.02.28.530472 |
_version_ | 1784904437618180096 |
---|---|
author | Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda |
author_facet | Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda |
author_sort | Mocz, Viola |
collection | PubMed |
description | Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each constituent object shown alone, indicating the whole is equal to the average of its parts. This is present at the single unit level in the slope of response amplitudes of macaque IT neurons to paired and single objects, and at the population level in response patterns of fMRI voxels in human ventral object processing regions (e.g., LO). Here we show that averaging exists in both single fMRI voxels and voxel population responses in human LO, with better averaging in single voxels leading to better averaging in fMRI response patterns, demonstrating a close correspondence of averaging at the fMRI unit and population levels. To understand if a similar averaging mechanism exists in convolutional neural networks (CNNs) pretrained for object classification, we examined five CNNs with varying architecture, depth and the presence/absence of recurrent processing. We observed averaging at the CNN unit level but rarely at the population level, with CNN unit response distribution in most cases did not resemble human LO or macaque IT responses. The whole is thus not equal to the average of its parts in CNNs, potentially rendering the individual objects in a pair less accessible in CNNs during visual processing than they are in the human brain. |
format | Online Article Text |
id | pubmed-10002658 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-100026582023-03-11 Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda bioRxiv Article Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each constituent object shown alone, indicating the whole is equal to the average of its parts. This is present at the single unit level in the slope of response amplitudes of macaque IT neurons to paired and single objects, and at the population level in response patterns of fMRI voxels in human ventral object processing regions (e.g., LO). Here we show that averaging exists in both single fMRI voxels and voxel population responses in human LO, with better averaging in single voxels leading to better averaging in fMRI response patterns, demonstrating a close correspondence of averaging at the fMRI unit and population levels. To understand if a similar averaging mechanism exists in convolutional neural networks (CNNs) pretrained for object classification, we examined five CNNs with varying architecture, depth and the presence/absence of recurrent processing. We observed averaging at the CNN unit level but rarely at the population level, with CNN unit response distribution in most cases did not resemble human LO or macaque IT responses. The whole is thus not equal to the average of its parts in CNNs, potentially rendering the individual objects in a pair less accessible in CNNs during visual processing than they are in the human brain. Cold Spring Harbor Laboratory 2023-03-01 /pmc/articles/PMC10002658/ /pubmed/36909506 http://dx.doi.org/10.1101/2023.02.28.530472 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks |
title | Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks |
title_full | Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks |
title_fullStr | Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks |
title_full_unstemmed | Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks |
title_short | Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks |
title_sort | representing multiple visual objects in the human brain and convolutional neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002658/ https://www.ncbi.nlm.nih.gov/pubmed/36909506 http://dx.doi.org/10.1101/2023.02.28.530472 |
work_keys_str_mv | AT moczviola representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks AT jeongsukeun representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks AT chunmarvin representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks AT xuyaoda representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks |