Cargando…

Depth in convolutional neural networks solves scene segmentation

Feed-forward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and even surpassing human performance in object recognition in natural scenes. This performance suggests that the analysis of a loose collection of image features could support the recognition of natural...

Descripción completa

Detalles Bibliográficos
Autores principales:	Seijdel, Noor, Tsakmakidis, Nikos, de Haan, Edward H. F., Bohte, Sander M., Scholte, H. Steven
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2020
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7406083/ https://www.ncbi.nlm.nih.gov/pubmed/32706770 http://dx.doi.org/10.1371/journal.pcbi.1008022

_version_	1783567373765181440
author	Seijdel, Noor Tsakmakidis, Nikos de Haan, Edward H. F. Bohte, Sander M. Scholte, H. Steven
author_facet	Seijdel, Noor Tsakmakidis, Nikos de Haan, Edward H. F. Bohte, Sander M. Scholte, H. Steven
author_sort	Seijdel, Noor
collection	PubMed
description	Feed-forward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and even surpassing human performance in object recognition in natural scenes. This performance suggests that the analysis of a loose collection of image features could support the recognition of natural object categories, without dedicated systems to solve specific visual subtasks. Research in humans however suggests that while feedforward activity may suffice for sparse scenes with isolated objects, additional visual operations ('routines') that aid the recognition process (e.g. segmentation or grouping) are needed for more complex scenes. Linking human visual processing to performance of DCNNs with increasing depth, we here explored if, how, and when object information is differentiated from the backgrounds they appear on. To this end, we controlled the information in both objects and backgrounds, as well as the relationship between them by adding noise, manipulating background congruence and systematically occluding parts of the image. Results indicate that with an increase in network depth, there is an increase in the distinction between object- and background information. For more shallow networks, results indicated a benefit of training on segmented objects. Overall, these results indicate that, de facto, scene segmentation can be performed by a network of sufficient depth. We conclude that the human brain could perform scene segmentation in the context of object identification without an explicit mechanism, by selecting or “binding” features that belong to the object and ignoring other features, in a manner similar to a very deep convolutional neural network.
format	Online Article Text
id	pubmed-7406083
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-74060832020-08-13 Depth in convolutional neural networks solves scene segmentation Seijdel, Noor Tsakmakidis, Nikos de Haan, Edward H. F. Bohte, Sander M. Scholte, H. Steven PLoS Comput Biol Research Article Feed-forward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and even surpassing human performance in object recognition in natural scenes. This performance suggests that the analysis of a loose collection of image features could support the recognition of natural object categories, without dedicated systems to solve specific visual subtasks. Research in humans however suggests that while feedforward activity may suffice for sparse scenes with isolated objects, additional visual operations ('routines') that aid the recognition process (e.g. segmentation or grouping) are needed for more complex scenes. Linking human visual processing to performance of DCNNs with increasing depth, we here explored if, how, and when object information is differentiated from the backgrounds they appear on. To this end, we controlled the information in both objects and backgrounds, as well as the relationship between them by adding noise, manipulating background congruence and systematically occluding parts of the image. Results indicate that with an increase in network depth, there is an increase in the distinction between object- and background information. For more shallow networks, results indicated a benefit of training on segmented objects. Overall, these results indicate that, de facto, scene segmentation can be performed by a network of sufficient depth. We conclude that the human brain could perform scene segmentation in the context of object identification without an explicit mechanism, by selecting or “binding” features that belong to the object and ignoring other features, in a manner similar to a very deep convolutional neural network. Public Library of Science 2020-07-24 /pmc/articles/PMC7406083/ /pubmed/32706770 http://dx.doi.org/10.1371/journal.pcbi.1008022 Text en © 2020 Seijdel et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Seijdel, Noor Tsakmakidis, Nikos de Haan, Edward H. F. Bohte, Sander M. Scholte, H. Steven Depth in convolutional neural networks solves scene segmentation
title	Depth in convolutional neural networks solves scene segmentation
title_full	Depth in convolutional neural networks solves scene segmentation
title_fullStr	Depth in convolutional neural networks solves scene segmentation
title_full_unstemmed	Depth in convolutional neural networks solves scene segmentation
title_short	Depth in convolutional neural networks solves scene segmentation
title_sort	depth in convolutional neural networks solves scene segmentation
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7406083/ https://www.ncbi.nlm.nih.gov/pubmed/32706770 http://dx.doi.org/10.1371/journal.pcbi.1008022
work_keys_str_mv	AT seijdelnoor depthinconvolutionalneuralnetworkssolvesscenesegmentation AT tsakmakidisnikos depthinconvolutionalneuralnetworkssolvesscenesegmentation AT dehaanedwardhf depthinconvolutionalneuralnetworkssolvesscenesegmentation AT bohtesanderm depthinconvolutionalneuralnetworkssolvesscenesegmentation AT scholtehsteven depthinconvolutionalneuralnetworkssolvesscenesegmentation

Depth in convolutional neural networks solves scene segmentation

Ejemplares similares