Cargando…

Capsule networks as recurrent models of grouping and segmentation

Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of visi...

Descripción completa

Detalles Bibliográficos
Autores principales: Doerig, Adrien, Schmittwilken, Lynn, Sayim, Bilge, Manassi, Mauro, Herzog, Michael H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394447/
https://www.ncbi.nlm.nih.gov/pubmed/32692780
http://dx.doi.org/10.1371/journal.pcbi.1008017
_version_ 1783565232958865408
author Doerig, Adrien
Schmittwilken, Lynn
Sayim, Bilge
Manassi, Mauro
Herzog, Michael H.
author_facet Doerig, Adrien
Schmittwilken, Lynn
Sayim, Bilge
Manassi, Mauro
Herzog, Michael H.
author_sort Doerig, Adrien
collection PubMed
description Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of vision, including ffCNNs, can explain human global shape processing. Here, we show that Capsule Neural Networks (CapsNets), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. We also show that ffCNNs and standard recurrent CNNs do not, suggesting that the grouping and segmentation capabilities of CapsNets are crucial. Furthermore, we provide psychophysical evidence that grouping and segmentation are implemented recurrently in humans, and show that CapsNets reproduce these results well. We discuss why recurrence seems needed to implement grouping and segmentation efficiently. Together, we provide mutually reinforcing psychophysical and computational evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations.
format Online
Article
Text
id pubmed-7394447
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73944472020-08-13 Capsule networks as recurrent models of grouping and segmentation Doerig, Adrien Schmittwilken, Lynn Sayim, Bilge Manassi, Mauro Herzog, Michael H. PLoS Comput Biol Research Article Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of vision, including ffCNNs, can explain human global shape processing. Here, we show that Capsule Neural Networks (CapsNets), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. We also show that ffCNNs and standard recurrent CNNs do not, suggesting that the grouping and segmentation capabilities of CapsNets are crucial. Furthermore, we provide psychophysical evidence that grouping and segmentation are implemented recurrently in humans, and show that CapsNets reproduce these results well. We discuss why recurrence seems needed to implement grouping and segmentation efficiently. Together, we provide mutually reinforcing psychophysical and computational evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations. Public Library of Science 2020-07-21 /pmc/articles/PMC7394447/ /pubmed/32692780 http://dx.doi.org/10.1371/journal.pcbi.1008017 Text en © 2020 Doerig et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Doerig, Adrien
Schmittwilken, Lynn
Sayim, Bilge
Manassi, Mauro
Herzog, Michael H.
Capsule networks as recurrent models of grouping and segmentation
title Capsule networks as recurrent models of grouping and segmentation
title_full Capsule networks as recurrent models of grouping and segmentation
title_fullStr Capsule networks as recurrent models of grouping and segmentation
title_full_unstemmed Capsule networks as recurrent models of grouping and segmentation
title_short Capsule networks as recurrent models of grouping and segmentation
title_sort capsule networks as recurrent models of grouping and segmentation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394447/
https://www.ncbi.nlm.nih.gov/pubmed/32692780
http://dx.doi.org/10.1371/journal.pcbi.1008017
work_keys_str_mv AT doerigadrien capsulenetworksasrecurrentmodelsofgroupingandsegmentation
AT schmittwilkenlynn capsulenetworksasrecurrentmodelsofgroupingandsegmentation
AT sayimbilge capsulenetworksasrecurrentmodelsofgroupingandsegmentation
AT manassimauro capsulenetworksasrecurrentmodelsofgroupingandsegmentation
AT herzogmichaelh capsulenetworksasrecurrentmodelsofgroupingandsegmentation