Cargando…

Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons

In order to better understand how the brain perceives faces, it is important to know what objective drives learning in the ventral visual stream. To answer this question, we model neural responses to faces in the macaque inferotemporal (IT) cortex with a deep self-supervised generative model, β-VAE,...

Descripción completa

Detalles Bibliográficos
Autores principales: Higgins, Irina, Chang, Le, Langston, Victoria, Hassabis, Demis, Summerfield, Christopher, Tsao, Doris, Botvinick, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8578601/
https://www.ncbi.nlm.nih.gov/pubmed/34753913
http://dx.doi.org/10.1038/s41467-021-26751-5
_version_ 1784596263339032576
author Higgins, Irina
Chang, Le
Langston, Victoria
Hassabis, Demis
Summerfield, Christopher
Tsao, Doris
Botvinick, Matthew
author_facet Higgins, Irina
Chang, Le
Langston, Victoria
Hassabis, Demis
Summerfield, Christopher
Tsao, Doris
Botvinick, Matthew
author_sort Higgins, Irina
collection PubMed
description In order to better understand how the brain perceives faces, it is important to know what objective drives learning in the ventral visual stream. To answer this question, we model neural responses to faces in the macaque inferotemporal (IT) cortex with a deep self-supervised generative model, β-VAE, which disentangles sensory data into interpretable latent factors, such as gender or age. Our results demonstrate a strong correspondence between the generative factors discovered by β-VAE and those coded by single IT neurons, beyond that found for the baselines, including the handcrafted state-of-the-art model of face perception, the Active Appearance Model, and deep classifiers. Moreover, β-VAE is able to reconstruct novel face images using signals from just a handful of cells. Together our results imply that optimising the disentangling objective leads to representations that closely resemble those in the IT at the single unit level. This points at disentangling as a plausible learning objective for the visual brain.
format Online
Article
Text
id pubmed-8578601
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85786012021-11-15 Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons Higgins, Irina Chang, Le Langston, Victoria Hassabis, Demis Summerfield, Christopher Tsao, Doris Botvinick, Matthew Nat Commun Article In order to better understand how the brain perceives faces, it is important to know what objective drives learning in the ventral visual stream. To answer this question, we model neural responses to faces in the macaque inferotemporal (IT) cortex with a deep self-supervised generative model, β-VAE, which disentangles sensory data into interpretable latent factors, such as gender or age. Our results demonstrate a strong correspondence between the generative factors discovered by β-VAE and those coded by single IT neurons, beyond that found for the baselines, including the handcrafted state-of-the-art model of face perception, the Active Appearance Model, and deep classifiers. Moreover, β-VAE is able to reconstruct novel face images using signals from just a handful of cells. Together our results imply that optimising the disentangling objective leads to representations that closely resemble those in the IT at the single unit level. This points at disentangling as a plausible learning objective for the visual brain. Nature Publishing Group UK 2021-11-09 /pmc/articles/PMC8578601/ /pubmed/34753913 http://dx.doi.org/10.1038/s41467-021-26751-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Higgins, Irina
Chang, Le
Langston, Victoria
Hassabis, Demis
Summerfield, Christopher
Tsao, Doris
Botvinick, Matthew
Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_full Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_fullStr Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_full_unstemmed Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_short Unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
title_sort unsupervised deep learning identifies semantic disentanglement in single inferotemporal face patch neurons
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8578601/
https://www.ncbi.nlm.nih.gov/pubmed/34753913
http://dx.doi.org/10.1038/s41467-021-26751-5
work_keys_str_mv AT higginsirina unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT changle unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT langstonvictoria unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT hassabisdemis unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT summerfieldchristopher unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT tsaodoris unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons
AT botvinickmatthew unsuperviseddeeplearningidentifiessemanticdisentanglementinsingleinferotemporalfacepatchneurons