Cargando…

Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain

Despite the remarkable similarities between convolutional neural networks (CNN) and the human brain, CNNs still fall behind humans in many visual tasks, indicating that there still exist considerable differences between the two systems. Here, we leverage adversarial noise (AN) and adversarial interf...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Chi, Duan, Xiao-Han, Wang, Lin-Yuan, Li, Yong-Li, Yan, Bin, Hu, Guo-En, Zhang, Ru-Yuan, Tong, Li
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Neuroinformatics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8375771/ https://www.ncbi.nlm.nih.gov/pubmed/34421567 http://dx.doi.org/10.3389/fninf.2021.677925

_version_	1783740362522624000
author	Zhang, Chi Duan, Xiao-Han Wang, Lin-Yuan Li, Yong-Li Yan, Bin Hu, Guo-En Zhang, Ru-Yuan Tong, Li
author_facet	Zhang, Chi Duan, Xiao-Han Wang, Lin-Yuan Li, Yong-Li Yan, Bin Hu, Guo-En Zhang, Ru-Yuan Tong, Li
author_sort	Zhang, Chi
collection	PubMed
description	Despite the remarkable similarities between convolutional neural networks (CNN) and the human brain, CNNs still fall behind humans in many visual tasks, indicating that there still exist considerable differences between the two systems. Here, we leverage adversarial noise (AN) and adversarial interference (AI) images to quantify the consistency between neural representations and perceptual outcomes in the two systems. Humans can successfully recognize AI images as the same categories as their corresponding regular images but perceive AN images as meaningless noise. In contrast, CNNs can recognize AN images similar as corresponding regular images but classify AI images into wrong categories with surprisingly high confidence. We use functional magnetic resonance imaging to measure brain activity evoked by regular and adversarial images in the human brain, and compare it to the activity of artificial neurons in a prototypical CNN—AlexNet. In the human brain, we find that the representational similarity between regular and adversarial images largely echoes their perceptual similarity in all early visual areas. In AlexNet, however, the neural representations of adversarial images are inconsistent with network outputs in all intermediate processing layers, providing no neural foundations for the similarities at the perceptual level. Furthermore, we show that voxel-encoding models trained on regular images can successfully generalize to the neural responses to AI images but not AN images. These remarkable differences between the human brain and AlexNet in representation-perception association suggest that future CNNs should emulate both behavior and the internal neural presentations of the human brain.
format	Online Article Text
id	pubmed-8375771
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-83757712021-08-20 Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain Zhang, Chi Duan, Xiao-Han Wang, Lin-Yuan Li, Yong-Li Yan, Bin Hu, Guo-En Zhang, Ru-Yuan Tong, Li Front Neuroinform Neuroinformatics Despite the remarkable similarities between convolutional neural networks (CNN) and the human brain, CNNs still fall behind humans in many visual tasks, indicating that there still exist considerable differences between the two systems. Here, we leverage adversarial noise (AN) and adversarial interference (AI) images to quantify the consistency between neural representations and perceptual outcomes in the two systems. Humans can successfully recognize AI images as the same categories as their corresponding regular images but perceive AN images as meaningless noise. In contrast, CNNs can recognize AN images similar as corresponding regular images but classify AI images into wrong categories with surprisingly high confidence. We use functional magnetic resonance imaging to measure brain activity evoked by regular and adversarial images in the human brain, and compare it to the activity of artificial neurons in a prototypical CNN—AlexNet. In the human brain, we find that the representational similarity between regular and adversarial images largely echoes their perceptual similarity in all early visual areas. In AlexNet, however, the neural representations of adversarial images are inconsistent with network outputs in all intermediate processing layers, providing no neural foundations for the similarities at the perceptual level. Furthermore, we show that voxel-encoding models trained on regular images can successfully generalize to the neural responses to AI images but not AN images. These remarkable differences between the human brain and AlexNet in representation-perception association suggest that future CNNs should emulate both behavior and the internal neural presentations of the human brain. Frontiers Media S.A. 2021-08-05 /pmc/articles/PMC8375771/ /pubmed/34421567 http://dx.doi.org/10.3389/fninf.2021.677925 Text en Copyright © 2021 Zhang, Duan, Wang, Li, Yan, Hu, Zhang and Tong. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroinformatics Zhang, Chi Duan, Xiao-Han Wang, Lin-Yuan Li, Yong-Li Yan, Bin Hu, Guo-En Zhang, Ru-Yuan Tong, Li Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain
title	Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain
title_full	Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain
title_fullStr	Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain
title_full_unstemmed	Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain
title_short	Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain
title_sort	dissociable neural representations of adversarially perturbed images in convolutional neural networks and the human brain
topic	Neuroinformatics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8375771/ https://www.ncbi.nlm.nih.gov/pubmed/34421567 http://dx.doi.org/10.3389/fninf.2021.677925
work_keys_str_mv	AT zhangchi dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain AT duanxiaohan dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain AT wanglinyuan dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain AT liyongli dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain AT yanbin dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain AT huguoen dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain AT zhangruyuan dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain AT tongli dissociableneuralrepresentationsofadversariallyperturbedimagesinconvolutionalneuralnetworksandthehumanbrain

Dissociable Neural Representations of Adversarially Perturbed Images in Convolutional Neural Networks and the Human Brain

Ejemplares similares