Cargando…

A self-supervised domain-general learning framework for human ventral stream representation

Anterior regions of the ventral visual stream encode substantial information about object categories. Are top-down category-level forces critical for arriving at this representation, or can this representation be formed purely through domain-general learning of natural image structure? Here we prese...

Descripción completa

Detalles Bibliográficos
Autores principales: Konkle, Talia, Alvarez, George A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8789817/
https://www.ncbi.nlm.nih.gov/pubmed/35078981
http://dx.doi.org/10.1038/s41467-022-28091-4
_version_ 1784639858331877376
author Konkle, Talia
Alvarez, George A.
author_facet Konkle, Talia
Alvarez, George A.
author_sort Konkle, Talia
collection PubMed
description Anterior regions of the ventral visual stream encode substantial information about object categories. Are top-down category-level forces critical for arriving at this representation, or can this representation be formed purely through domain-general learning of natural image structure? Here we present a fully self-supervised model which learns to represent individual images, rather than categories, such that views of the same image are embedded nearby in a low-dimensional feature space, distinctly from other recently encountered views. We find that category information implicitly emerges in the local similarity structure of this feature space. Further, these models learn hierarchical features which capture the structure of brain responses across the human ventral visual stream, on par with category-supervised models. These results provide computational support for a domain-general framework guiding the formation of visual representation, where the proximate goal is not explicitly about category information, but is instead to learn unique, compressed descriptions of the visual world.
format Online
Article
Text
id pubmed-8789817
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-87898172022-02-07 A self-supervised domain-general learning framework for human ventral stream representation Konkle, Talia Alvarez, George A. Nat Commun Article Anterior regions of the ventral visual stream encode substantial information about object categories. Are top-down category-level forces critical for arriving at this representation, or can this representation be formed purely through domain-general learning of natural image structure? Here we present a fully self-supervised model which learns to represent individual images, rather than categories, such that views of the same image are embedded nearby in a low-dimensional feature space, distinctly from other recently encountered views. We find that category information implicitly emerges in the local similarity structure of this feature space. Further, these models learn hierarchical features which capture the structure of brain responses across the human ventral visual stream, on par with category-supervised models. These results provide computational support for a domain-general framework guiding the formation of visual representation, where the proximate goal is not explicitly about category information, but is instead to learn unique, compressed descriptions of the visual world. Nature Publishing Group UK 2022-01-25 /pmc/articles/PMC8789817/ /pubmed/35078981 http://dx.doi.org/10.1038/s41467-022-28091-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Konkle, Talia
Alvarez, George A.
A self-supervised domain-general learning framework for human ventral stream representation
title A self-supervised domain-general learning framework for human ventral stream representation
title_full A self-supervised domain-general learning framework for human ventral stream representation
title_fullStr A self-supervised domain-general learning framework for human ventral stream representation
title_full_unstemmed A self-supervised domain-general learning framework for human ventral stream representation
title_short A self-supervised domain-general learning framework for human ventral stream representation
title_sort self-supervised domain-general learning framework for human ventral stream representation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8789817/
https://www.ncbi.nlm.nih.gov/pubmed/35078981
http://dx.doi.org/10.1038/s41467-022-28091-4
work_keys_str_mv AT konkletalia aselfsuperviseddomaingenerallearningframeworkforhumanventralstreamrepresentation
AT alvarezgeorgea aselfsuperviseddomaingenerallearningframeworkforhumanventralstreamrepresentation
AT konkletalia selfsuperviseddomaingenerallearningframeworkforhumanventralstreamrepresentation
AT alvarezgeorgea selfsuperviseddomaingenerallearningframeworkforhumanventralstreamrepresentation