Cargando…

Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars

Human vision, thought, and planning involve parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using neural networks, but a generative model for...

Descripción completa

Detalles Bibliográficos
Autores principales: Fisher, Ares, Rao, Rajesh P N
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10637337/
https://www.ncbi.nlm.nih.gov/pubmed/37954157
http://dx.doi.org/10.1093/pnasnexus/pgad337
_version_ 1785133379101917184
author Fisher, Ares
Rao, Rajesh P N
author_facet Fisher, Ares
Rao, Rajesh P N
author_sort Fisher, Ares
collection PubMed
description Human vision, thought, and planning involve parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using neural networks, but a generative model formulation has been lacking. Generative models that leverage compositionality, recursion, and part-whole hierarchies are thought to underlie human concept learning and the ability to construct and represent flexible mental concepts. We introduce Recursive Neural Programs (RNPs), a neural generative model that addresses the part-whole hierarchy learning problem by modeling images as hierarchical trees of probabilistic sensory-motor programs. These programs recursively reuse learned sensory-motor primitives to model an image within different spatial reference frames, enabling hierarchical composition of objects from parts and implementing a grammar for images. We show that RNPs can learn part-whole hierarchies for a variety of image datasets, allowing rich compositionality and intuitive parts-based explanations of objects. Our model also suggests a cognitive framework for understanding how human brains can potentially learn and represent concepts in terms of recursively defined primitives and their relations with each other.
format Online
Article
Text
id pubmed-10637337
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106373372023-11-11 Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars Fisher, Ares Rao, Rajesh P N PNAS Nexus Physical Sciences and Engineering Human vision, thought, and planning involve parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using neural networks, but a generative model formulation has been lacking. Generative models that leverage compositionality, recursion, and part-whole hierarchies are thought to underlie human concept learning and the ability to construct and represent flexible mental concepts. We introduce Recursive Neural Programs (RNPs), a neural generative model that addresses the part-whole hierarchy learning problem by modeling images as hierarchical trees of probabilistic sensory-motor programs. These programs recursively reuse learned sensory-motor primitives to model an image within different spatial reference frames, enabling hierarchical composition of objects from parts and implementing a grammar for images. We show that RNPs can learn part-whole hierarchies for a variety of image datasets, allowing rich compositionality and intuitive parts-based explanations of objects. Our model also suggests a cognitive framework for understanding how human brains can potentially learn and represent concepts in terms of recursively defined primitives and their relations with each other. Oxford University Press 2023-10-14 /pmc/articles/PMC10637337/ /pubmed/37954157 http://dx.doi.org/10.1093/pnasnexus/pgad337 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of National Academy of Sciences. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Physical Sciences and Engineering
Fisher, Ares
Rao, Rajesh P N
Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars
title Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars
title_full Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars
title_fullStr Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars
title_full_unstemmed Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars
title_short Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars
title_sort recursive neural programs: a differentiable framework for learning compositional part-whole hierarchies and image grammars
topic Physical Sciences and Engineering
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10637337/
https://www.ncbi.nlm.nih.gov/pubmed/37954157
http://dx.doi.org/10.1093/pnasnexus/pgad337
work_keys_str_mv AT fisherares recursiveneuralprogramsadifferentiableframeworkforlearningcompositionalpartwholehierarchiesandimagegrammars
AT raorajeshpn recursiveneuralprogramsadifferentiableframeworkforlearningcompositionalpartwholehierarchiesandimagegrammars