Cargando…

Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex

Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zeman, Astrid A., Ritchie, J. Brendan, Bracci, Stefania, Op de Beeck, Hans
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016009/ https://www.ncbi.nlm.nih.gov/pubmed/32051467 http://dx.doi.org/10.1038/s41598-020-59175-0

_version_	1783496902314033152
author	Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans
author_facet	Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans
author_sort	Zeman, Astrid A.
collection	PubMed
description	Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tuned to represent the visual shape of objects, rather than object category, since both are often confounded in natural images. Using two stimulus sets that explicitly dissociate shape from category, we correlate these two types of information with each layer of multiple CNNs. We also compare CNN output with fMRI activation along the human visual ventral stream by correlating artificial with neural representations. We find that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures. Comparing CNNs with fMRI brain data, early visual cortex (V1) and early layers of CNNs encode shape information. Anterior ventral temporal cortex encodes category information, which correlates best with the final layer of CNNs. The interaction between shape and category that is found along the human visual ventral pathway is echoed in multiple deep networks. Our results suggest CNNs represent category information independently from shape, much like the human visual system.
format	Online Article Text
id	pubmed-7016009
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-70160092020-02-21 Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans Sci Rep Article Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tuned to represent the visual shape of objects, rather than object category, since both are often confounded in natural images. Using two stimulus sets that explicitly dissociate shape from category, we correlate these two types of information with each layer of multiple CNNs. We also compare CNN output with fMRI activation along the human visual ventral stream by correlating artificial with neural representations. We find that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures. Comparing CNNs with fMRI brain data, early visual cortex (V1) and early layers of CNNs encode shape information. Anterior ventral temporal cortex encodes category information, which correlates best with the final layer of CNNs. The interaction between shape and category that is found along the human visual ventral pathway is echoed in multiple deep networks. Our results suggest CNNs represent category information independently from shape, much like the human visual system. Nature Publishing Group UK 2020-02-12 /pmc/articles/PMC7016009/ /pubmed/32051467 http://dx.doi.org/10.1038/s41598-020-59175-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title	Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_full	Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_fullStr	Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_full_unstemmed	Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_short	Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_sort	orthogonal representations of object shape and category in deep convolutional neural networks and human visual cortex
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016009/ https://www.ncbi.nlm.nih.gov/pubmed/32051467 http://dx.doi.org/10.1038/s41598-020-59175-0
work_keys_str_mv	AT zemanastrida orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex AT ritchiejbrendan orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex AT braccistefania orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex AT opdebeeckhans orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex

Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex

Ejemplares similares