Cargando…

Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex

Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tu...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeman, Astrid A., Ritchie, J. Brendan, Bracci, Stefania, Op de Beeck, Hans
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016009/
https://www.ncbi.nlm.nih.gov/pubmed/32051467
http://dx.doi.org/10.1038/s41598-020-59175-0
_version_ 1783496902314033152
author Zeman, Astrid A.
Ritchie, J. Brendan
Bracci, Stefania
Op de Beeck, Hans
author_facet Zeman, Astrid A.
Ritchie, J. Brendan
Bracci, Stefania
Op de Beeck, Hans
author_sort Zeman, Astrid A.
collection PubMed
description Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tuned to represent the visual shape of objects, rather than object category, since both are often confounded in natural images. Using two stimulus sets that explicitly dissociate shape from category, we correlate these two types of information with each layer of multiple CNNs. We also compare CNN output with fMRI activation along the human visual ventral stream by correlating artificial with neural representations. We find that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures. Comparing CNNs with fMRI brain data, early visual cortex (V1) and early layers of CNNs encode shape information. Anterior ventral temporal cortex encodes category information, which correlates best with the final layer of CNNs. The interaction between shape and category that is found along the human visual ventral pathway is echoed in multiple deep networks. Our results suggest CNNs represent category information independently from shape, much like the human visual system.
format Online
Article
Text
id pubmed-7016009
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-70160092020-02-21 Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans Sci Rep Article Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tuned to represent the visual shape of objects, rather than object category, since both are often confounded in natural images. Using two stimulus sets that explicitly dissociate shape from category, we correlate these two types of information with each layer of multiple CNNs. We also compare CNN output with fMRI activation along the human visual ventral stream by correlating artificial with neural representations. We find that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures. Comparing CNNs with fMRI brain data, early visual cortex (V1) and early layers of CNNs encode shape information. Anterior ventral temporal cortex encodes category information, which correlates best with the final layer of CNNs. The interaction between shape and category that is found along the human visual ventral pathway is echoed in multiple deep networks. Our results suggest CNNs represent category information independently from shape, much like the human visual system. Nature Publishing Group UK 2020-02-12 /pmc/articles/PMC7016009/ /pubmed/32051467 http://dx.doi.org/10.1038/s41598-020-59175-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Zeman, Astrid A.
Ritchie, J. Brendan
Bracci, Stefania
Op de Beeck, Hans
Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_full Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_fullStr Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_full_unstemmed Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_short Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
title_sort orthogonal representations of object shape and category in deep convolutional neural networks and human visual cortex
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016009/
https://www.ncbi.nlm.nih.gov/pubmed/32051467
http://dx.doi.org/10.1038/s41598-020-59175-0
work_keys_str_mv AT zemanastrida orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex
AT ritchiejbrendan orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex
AT braccistefania orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex
AT opdebeeckhans orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex