Cargando…

A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes

The promise of artificial intelligence in understanding biological vision relies on the comparison of computational models with brain data with the goal of capturing functional principles of visual information processing. Convolutional neural networks (CNN) have successfully matched the transformati...

Descripción completa

Detalles Bibliográficos
Autores principales:	Svanera, Michele, Morgan, Andrew T., Petro, Lucy S., Muckli, Lars
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Association for Research in Vision and Ophthalmology 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8288063/ https://www.ncbi.nlm.nih.gov/pubmed/34259828 http://dx.doi.org/10.1167/jov.21.7.5

_version_	1783724027588640768
author	Svanera, Michele Morgan, Andrew T. Petro, Lucy S. Muckli, Lars
author_facet	Svanera, Michele Morgan, Andrew T. Petro, Lucy S. Muckli, Lars
author_sort	Svanera, Michele
collection	PubMed
description	The promise of artificial intelligence in understanding biological vision relies on the comparison of computational models with brain data with the goal of capturing functional principles of visual information processing. Convolutional neural networks (CNN) have successfully matched the transformations in hierarchical processing occurring along the brain's feedforward visual pathway, extending into ventral temporal cortex. However, we are still to learn if CNNs can successfully describe feedback processes in early visual cortex. Here, we investigated similarities between human early visual cortex and a CNN with encoder/decoder architecture, trained with self-supervised learning to fill occlusions and reconstruct an unseen image. Using representational similarity analysis (RSA), we compared 3T functional magnetic resonance imaging (fMRI) data from a nonstimulated patch of early visual cortex in human participants viewing partially occluded images, with the different CNN layer activations from the same images. Results show that our self-supervised image-completion network outperforms a classical object-recognition supervised network (VGG16) in terms of similarity to fMRI data. This work provides additional evidence that optimal models of the visual system might come from less feedforward architectures trained with less supervision. We also find that CNN decoder pathway activations are more similar to brain processing compared to encoder activations, suggesting an integration of mid- and low/middle-level features in early visual cortex. Challenging an artificial intelligence model to learn natural image representations via self-supervised learning and comparing them with brain data can help us to constrain our understanding of information processing, such as neuronal predictive coding.
format	Online Article Text
id	pubmed-8288063
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	The Association for Research in Vision and Ophthalmology
record_format	MEDLINE/PubMed
spelling	pubmed-82880632021-07-26 A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes Svanera, Michele Morgan, Andrew T. Petro, Lucy S. Muckli, Lars J Vis Article The promise of artificial intelligence in understanding biological vision relies on the comparison of computational models with brain data with the goal of capturing functional principles of visual information processing. Convolutional neural networks (CNN) have successfully matched the transformations in hierarchical processing occurring along the brain's feedforward visual pathway, extending into ventral temporal cortex. However, we are still to learn if CNNs can successfully describe feedback processes in early visual cortex. Here, we investigated similarities between human early visual cortex and a CNN with encoder/decoder architecture, trained with self-supervised learning to fill occlusions and reconstruct an unseen image. Using representational similarity analysis (RSA), we compared 3T functional magnetic resonance imaging (fMRI) data from a nonstimulated patch of early visual cortex in human participants viewing partially occluded images, with the different CNN layer activations from the same images. Results show that our self-supervised image-completion network outperforms a classical object-recognition supervised network (VGG16) in terms of similarity to fMRI data. This work provides additional evidence that optimal models of the visual system might come from less feedforward architectures trained with less supervision. We also find that CNN decoder pathway activations are more similar to brain processing compared to encoder activations, suggesting an integration of mid- and low/middle-level features in early visual cortex. Challenging an artificial intelligence model to learn natural image representations via self-supervised learning and comparing them with brain data can help us to constrain our understanding of information processing, such as neuronal predictive coding. The Association for Research in Vision and Ophthalmology 2021-07-14 /pmc/articles/PMC8288063/ /pubmed/34259828 http://dx.doi.org/10.1167/jov.21.7.5 Text en Copyright 2021 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle	Article Svanera, Michele Morgan, Andrew T. Petro, Lucy S. Muckli, Lars A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes
title	A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes
title_full	A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes
title_fullStr	A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes
title_full_unstemmed	A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes
title_short	A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes
title_sort	self-supervised deep neural network for image completion resembles early visual cortex fmri activity patterns for occluded scenes
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8288063/ https://www.ncbi.nlm.nih.gov/pubmed/34259828 http://dx.doi.org/10.1167/jov.21.7.5
work_keys_str_mv	AT svaneramichele aselfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes AT morganandrewt aselfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes AT petrolucys aselfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes AT mucklilars aselfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes AT svaneramichele selfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes AT morganandrewt selfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes AT petrolucys selfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes AT mucklilars selfsuperviseddeepneuralnetworkforimagecompletionresemblesearlyvisualcortexfmriactivitypatternsforoccludedscenes

A self-supervised deep neural network for image completion resembles early visual cortex fMRI activity patterns for occluded scenes

Ejemplares similares