Cargando…
Capsule networks as recurrent models of grouping and segmentation
Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of visi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394447/ https://www.ncbi.nlm.nih.gov/pubmed/32692780 http://dx.doi.org/10.1371/journal.pcbi.1008017 |
_version_ | 1783565232958865408 |
---|---|
author | Doerig, Adrien Schmittwilken, Lynn Sayim, Bilge Manassi, Mauro Herzog, Michael H. |
author_facet | Doerig, Adrien Schmittwilken, Lynn Sayim, Bilge Manassi, Mauro Herzog, Michael H. |
author_sort | Doerig, Adrien |
collection | PubMed |
description | Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of vision, including ffCNNs, can explain human global shape processing. Here, we show that Capsule Neural Networks (CapsNets), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. We also show that ffCNNs and standard recurrent CNNs do not, suggesting that the grouping and segmentation capabilities of CapsNets are crucial. Furthermore, we provide psychophysical evidence that grouping and segmentation are implemented recurrently in humans, and show that CapsNets reproduce these results well. We discuss why recurrence seems needed to implement grouping and segmentation efficiently. Together, we provide mutually reinforcing psychophysical and computational evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations. |
format | Online Article Text |
id | pubmed-7394447 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-73944472020-08-13 Capsule networks as recurrent models of grouping and segmentation Doerig, Adrien Schmittwilken, Lynn Sayim, Bilge Manassi, Mauro Herzog, Michael H. PLoS Comput Biol Research Article Classically, visual processing is described as a cascade of local feedforward computations. Feedforward Convolutional Neural Networks (ffCNNs) have shown how powerful such models can be. However, using visual crowding as a well-controlled challenge, we previously showed that no classic model of vision, including ffCNNs, can explain human global shape processing. Here, we show that Capsule Neural Networks (CapsNets), combining ffCNNs with recurrent grouping and segmentation, solve this challenge. We also show that ffCNNs and standard recurrent CNNs do not, suggesting that the grouping and segmentation capabilities of CapsNets are crucial. Furthermore, we provide psychophysical evidence that grouping and segmentation are implemented recurrently in humans, and show that CapsNets reproduce these results well. We discuss why recurrence seems needed to implement grouping and segmentation efficiently. Together, we provide mutually reinforcing psychophysical and computational evidence that a recurrent grouping and segmentation process is essential to understand the visual system and create better models that harness global shape computations. Public Library of Science 2020-07-21 /pmc/articles/PMC7394447/ /pubmed/32692780 http://dx.doi.org/10.1371/journal.pcbi.1008017 Text en © 2020 Doerig et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Doerig, Adrien Schmittwilken, Lynn Sayim, Bilge Manassi, Mauro Herzog, Michael H. Capsule networks as recurrent models of grouping and segmentation |
title | Capsule networks as recurrent models of grouping and segmentation |
title_full | Capsule networks as recurrent models of grouping and segmentation |
title_fullStr | Capsule networks as recurrent models of grouping and segmentation |
title_full_unstemmed | Capsule networks as recurrent models of grouping and segmentation |
title_short | Capsule networks as recurrent models of grouping and segmentation |
title_sort | capsule networks as recurrent models of grouping and segmentation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7394447/ https://www.ncbi.nlm.nih.gov/pubmed/32692780 http://dx.doi.org/10.1371/journal.pcbi.1008017 |
work_keys_str_mv | AT doerigadrien capsulenetworksasrecurrentmodelsofgroupingandsegmentation AT schmittwilkenlynn capsulenetworksasrecurrentmodelsofgroupingandsegmentation AT sayimbilge capsulenetworksasrecurrentmodelsofgroupingandsegmentation AT manassimauro capsulenetworksasrecurrentmodelsofgroupingandsegmentation AT herzogmichaelh capsulenetworksasrecurrentmodelsofgroupingandsegmentation |