Cargando…

Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition

The primate visual system achieves remarkable visual object recognition performance even in brief presentations, and under changes to object exemplar, geometric transformations, and background variation (a.k.a. core visual object recognition). This remarkable performance is mediated by the represent...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cadieu, Charles F., Hong, Ha, Yamins, Daniel L. K., Pinto, Nicolas, Ardila, Diego, Solomon, Ethan A., Majaj, Najib J., DiCarlo, James J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4270441/ https://www.ncbi.nlm.nih.gov/pubmed/25521294 http://dx.doi.org/10.1371/journal.pcbi.1003963

_version_	1782349497767034880
author	Cadieu, Charles F. Hong, Ha Yamins, Daniel L. K. Pinto, Nicolas Ardila, Diego Solomon, Ethan A. Majaj, Najib J. DiCarlo, James J.
author_facet	Cadieu, Charles F. Hong, Ha Yamins, Daniel L. K. Pinto, Nicolas Ardila, Diego Solomon, Ethan A. Majaj, Najib J. DiCarlo, James J.
author_sort	Cadieu, Charles F.
collection	PubMed
description	The primate visual system achieves remarkable visual object recognition performance even in brief presentations, and under changes to object exemplar, geometric transformations, and background variation (a.k.a. core visual object recognition). This remarkable performance is mediated by the representation formed in inferior temporal (IT) cortex. In parallel, recent advances in machine learning have led to ever higher performing models of object recognition using artificial deep neural networks (DNNs). It remains unclear, however, whether the representational performance of DNNs rivals that of the brain. To accurately produce such a comparison, a major difficulty has been a unifying metric that accounts for experimental limitations, such as the amount of noise, the number of neural recording sites, and the number of trials, and computational limitations, such as the complexity of the decoding classifier and the number of classifier training examples. In this work, we perform a direct comparison that corrects for these experimental limitations and computational considerations. As part of our methodology, we propose an extension of “kernel analysis” that measures the generalization accuracy as a function of representational complexity. Our evaluations show that, unlike previous bio-inspired models, the latest DNNs rival the representational performance of IT cortex on this visual object recognition task. Furthermore, we show that models that perform well on measures of representational performance also perform well on measures of representational similarity to IT, and on measures of predicting individual IT multi-unit responses. Whether these DNNs rely on computational mechanisms similar to the primate visual system is yet to be determined, but, unlike all previous bio-inspired models, that possibility cannot be ruled out merely on representational performance grounds.
format	Online Article Text
id	pubmed-4270441
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-42704412014-12-26 Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition Cadieu, Charles F. Hong, Ha Yamins, Daniel L. K. Pinto, Nicolas Ardila, Diego Solomon, Ethan A. Majaj, Najib J. DiCarlo, James J. PLoS Comput Biol Research Article The primate visual system achieves remarkable visual object recognition performance even in brief presentations, and under changes to object exemplar, geometric transformations, and background variation (a.k.a. core visual object recognition). This remarkable performance is mediated by the representation formed in inferior temporal (IT) cortex. In parallel, recent advances in machine learning have led to ever higher performing models of object recognition using artificial deep neural networks (DNNs). It remains unclear, however, whether the representational performance of DNNs rivals that of the brain. To accurately produce such a comparison, a major difficulty has been a unifying metric that accounts for experimental limitations, such as the amount of noise, the number of neural recording sites, and the number of trials, and computational limitations, such as the complexity of the decoding classifier and the number of classifier training examples. In this work, we perform a direct comparison that corrects for these experimental limitations and computational considerations. As part of our methodology, we propose an extension of “kernel analysis” that measures the generalization accuracy as a function of representational complexity. Our evaluations show that, unlike previous bio-inspired models, the latest DNNs rival the representational performance of IT cortex on this visual object recognition task. Furthermore, we show that models that perform well on measures of representational performance also perform well on measures of representational similarity to IT, and on measures of predicting individual IT multi-unit responses. Whether these DNNs rely on computational mechanisms similar to the primate visual system is yet to be determined, but, unlike all previous bio-inspired models, that possibility cannot be ruled out merely on representational performance grounds. Public Library of Science 2014-12-18 /pmc/articles/PMC4270441/ /pubmed/25521294 http://dx.doi.org/10.1371/journal.pcbi.1003963 Text en © 2014 Cadieu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Cadieu, Charles F. Hong, Ha Yamins, Daniel L. K. Pinto, Nicolas Ardila, Diego Solomon, Ethan A. Majaj, Najib J. DiCarlo, James J. Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition
title	Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition
title_full	Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition
title_fullStr	Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition
title_full_unstemmed	Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition
title_short	Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition
title_sort	deep neural networks rival the representation of primate it cortex for core visual object recognition
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4270441/ https://www.ncbi.nlm.nih.gov/pubmed/25521294 http://dx.doi.org/10.1371/journal.pcbi.1003963
work_keys_str_mv	AT cadieucharlesf deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition AT hongha deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition AT yaminsdaniellk deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition AT pintonicolas deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition AT ardiladiego deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition AT solomonethana deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition AT majajnajibj deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition AT dicarlojamesj deepneuralnetworksrivaltherepresentationofprimateitcortexforcorevisualobjectrecognition

Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition

Ejemplares similares