Cargando…
Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex
Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016009/ https://www.ncbi.nlm.nih.gov/pubmed/32051467 http://dx.doi.org/10.1038/s41598-020-59175-0 |
_version_ | 1783496902314033152 |
---|---|
author | Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans |
author_facet | Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans |
author_sort | Zeman, Astrid A. |
collection | PubMed |
description | Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tuned to represent the visual shape of objects, rather than object category, since both are often confounded in natural images. Using two stimulus sets that explicitly dissociate shape from category, we correlate these two types of information with each layer of multiple CNNs. We also compare CNN output with fMRI activation along the human visual ventral stream by correlating artificial with neural representations. We find that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures. Comparing CNNs with fMRI brain data, early visual cortex (V1) and early layers of CNNs encode shape information. Anterior ventral temporal cortex encodes category information, which correlates best with the final layer of CNNs. The interaction between shape and category that is found along the human visual ventral pathway is echoed in multiple deep networks. Our results suggest CNNs represent category information independently from shape, much like the human visual system. |
format | Online Article Text |
id | pubmed-7016009 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-70160092020-02-21 Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans Sci Rep Article Deep Convolutional Neural Networks (CNNs) are gaining traction as the benchmark model of visual object recognition, with performance now surpassing humans. While CNNs can accurately assign one image to potentially thousands of categories, network performance could be the result of layers that are tuned to represent the visual shape of objects, rather than object category, since both are often confounded in natural images. Using two stimulus sets that explicitly dissociate shape from category, we correlate these two types of information with each layer of multiple CNNs. We also compare CNN output with fMRI activation along the human visual ventral stream by correlating artificial with neural representations. We find that CNNs encode category information independently from shape, peaking at the final fully connected layer in all tested CNN architectures. Comparing CNNs with fMRI brain data, early visual cortex (V1) and early layers of CNNs encode shape information. Anterior ventral temporal cortex encodes category information, which correlates best with the final layer of CNNs. The interaction between shape and category that is found along the human visual ventral pathway is echoed in multiple deep networks. Our results suggest CNNs represent category information independently from shape, much like the human visual system. Nature Publishing Group UK 2020-02-12 /pmc/articles/PMC7016009/ /pubmed/32051467 http://dx.doi.org/10.1038/s41598-020-59175-0 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Zeman, Astrid A. Ritchie, J. Brendan Bracci, Stefania Op de Beeck, Hans Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex |
title | Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex |
title_full | Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex |
title_fullStr | Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex |
title_full_unstemmed | Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex |
title_short | Orthogonal Representations of Object Shape and Category in Deep Convolutional Neural Networks and Human Visual Cortex |
title_sort | orthogonal representations of object shape and category in deep convolutional neural networks and human visual cortex |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016009/ https://www.ncbi.nlm.nih.gov/pubmed/32051467 http://dx.doi.org/10.1038/s41598-020-59175-0 |
work_keys_str_mv | AT zemanastrida orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex AT ritchiejbrendan orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex AT braccistefania orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex AT opdebeeckhans orthogonalrepresentationsofobjectshapeandcategoryindeepconvolutionalneuralnetworksandhumanvisualcortex |