Cargando…

Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks

Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each con...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mocz, Viola, Jeong, Su Keun, Chun, Marvin, Xu, Yaoda
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cold Spring Harbor Laboratory 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002658/ https://www.ncbi.nlm.nih.gov/pubmed/36909506 http://dx.doi.org/10.1101/2023.02.28.530472

_version_	1784904437618180096
author	Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda
author_facet	Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda
author_sort	Mocz, Viola
collection	PubMed
description	Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each constituent object shown alone, indicating the whole is equal to the average of its parts. This is present at the single unit level in the slope of response amplitudes of macaque IT neurons to paired and single objects, and at the population level in response patterns of fMRI voxels in human ventral object processing regions (e.g., LO). Here we show that averaging exists in both single fMRI voxels and voxel population responses in human LO, with better averaging in single voxels leading to better averaging in fMRI response patterns, demonstrating a close correspondence of averaging at the fMRI unit and population levels. To understand if a similar averaging mechanism exists in convolutional neural networks (CNNs) pretrained for object classification, we examined five CNNs with varying architecture, depth and the presence/absence of recurrent processing. We observed averaging at the CNN unit level but rarely at the population level, with CNN unit response distribution in most cases did not resemble human LO or macaque IT responses. The whole is thus not equal to the average of its parts in CNNs, potentially rendering the individual objects in a pair less accessible in CNNs during visual processing than they are in the human brain.
format	Online Article Text
id	pubmed-10002658
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Cold Spring Harbor Laboratory
record_format	MEDLINE/PubMed
spelling	pubmed-100026582023-03-11 Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda bioRxiv Article Objects in the real world often appear with other objects. To recover the identity of an object whether or not other objects are encoded concurrently, in primate object-processing regions, neural responses to an object pair have been shown to be well approximated by the average responses to each constituent object shown alone, indicating the whole is equal to the average of its parts. This is present at the single unit level in the slope of response amplitudes of macaque IT neurons to paired and single objects, and at the population level in response patterns of fMRI voxels in human ventral object processing regions (e.g., LO). Here we show that averaging exists in both single fMRI voxels and voxel population responses in human LO, with better averaging in single voxels leading to better averaging in fMRI response patterns, demonstrating a close correspondence of averaging at the fMRI unit and population levels. To understand if a similar averaging mechanism exists in convolutional neural networks (CNNs) pretrained for object classification, we examined five CNNs with varying architecture, depth and the presence/absence of recurrent processing. We observed averaging at the CNN unit level but rarely at the population level, with CNN unit response distribution in most cases did not resemble human LO or macaque IT responses. The whole is thus not equal to the average of its parts in CNNs, potentially rendering the individual objects in a pair less accessible in CNNs during visual processing than they are in the human brain. Cold Spring Harbor Laboratory 2023-03-01 /pmc/articles/PMC10002658/ /pubmed/36909506 http://dx.doi.org/10.1101/2023.02.28.530472 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle	Article Mocz, Viola Jeong, Su Keun Chun, Marvin Xu, Yaoda Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title	Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_full	Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_fullStr	Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_full_unstemmed	Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_short	Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks
title_sort	representing multiple visual objects in the human brain and convolutional neural networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002658/ https://www.ncbi.nlm.nih.gov/pubmed/36909506 http://dx.doi.org/10.1101/2023.02.28.530472
work_keys_str_mv	AT moczviola representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks AT jeongsukeun representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks AT chunmarvin representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks AT xuyaoda representingmultiplevisualobjectsinthehumanbrainandconvolutionalneuralnetworks

Representing Multiple Visual Objects in the Human Brain and Convolutional Neural Networks

Ejemplares similares