Cargando…

Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification

How does the visual system encode natural scenes? What are the basic structures of natural scenes? In current models of scene perception, there are two broad feature representations, global and local representations. Both representations are useful and have some successes; however, many observations...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Xiaoyuan, Yang, Zhiyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3787016/
https://www.ncbi.nlm.nih.gov/pubmed/24098789
http://dx.doi.org/10.1371/journal.pone.0076393
_version_ 1782477819685634048
author Zhu, Xiaoyuan
Yang, Zhiyong
author_facet Zhu, Xiaoyuan
Yang, Zhiyong
author_sort Zhu, Xiaoyuan
collection PubMed
description How does the visual system encode natural scenes? What are the basic structures of natural scenes? In current models of scene perception, there are two broad feature representations, global and local representations. Both representations are useful and have some successes; however, many observations on human scene perception seem to point to an intermediate-level representation. In this paper, we proposed natural scene structures, i.e., multi-scale spatial concatenations of local features, as an intermediate-level representation of natural scenes. To compile the natural scene structures, we first sampled a large number of multi-scale circular scene patches in a hexagonal configuration. We then performed independent component analysis on the patches and classified the independent components into a set of clusters using the K-means method. Finally, we obtained a set of natural scene structures, each of which is characterized by a set of dominant clusters of independent components. We examined a range of statistics of the natural scene structures, compiled from two widely used datasets of natural scenes, and modeled their spatial arrangements at larger spatial scales using adjacency matrices. We found that the natural scene structures include a full range of concatenations of visual features in natural scenes, and can be used to encode spatial information at various scales. We then selected a set of natural scene structures with high information, and used the occurring frequencies and the eigenvalues of the adjacency matrices to classify scenes in the datasets. We found that the performance of this model is comparable to or better than the state-of-the-art models on the two datasets. These results suggest that the natural scene structures are a useful intermediate-level representation of visual scenes for our understanding of natural scene perception.
format Online
Article
Text
id pubmed-3787016
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37870162013-10-04 Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification Zhu, Xiaoyuan Yang, Zhiyong PLoS One Research Article How does the visual system encode natural scenes? What are the basic structures of natural scenes? In current models of scene perception, there are two broad feature representations, global and local representations. Both representations are useful and have some successes; however, many observations on human scene perception seem to point to an intermediate-level representation. In this paper, we proposed natural scene structures, i.e., multi-scale spatial concatenations of local features, as an intermediate-level representation of natural scenes. To compile the natural scene structures, we first sampled a large number of multi-scale circular scene patches in a hexagonal configuration. We then performed independent component analysis on the patches and classified the independent components into a set of clusters using the K-means method. Finally, we obtained a set of natural scene structures, each of which is characterized by a set of dominant clusters of independent components. We examined a range of statistics of the natural scene structures, compiled from two widely used datasets of natural scenes, and modeled their spatial arrangements at larger spatial scales using adjacency matrices. We found that the natural scene structures include a full range of concatenations of visual features in natural scenes, and can be used to encode spatial information at various scales. We then selected a set of natural scene structures with high information, and used the occurring frequencies and the eigenvalues of the adjacency matrices to classify scenes in the datasets. We found that the performance of this model is comparable to or better than the state-of-the-art models on the two datasets. These results suggest that the natural scene structures are a useful intermediate-level representation of visual scenes for our understanding of natural scene perception. Public Library of Science 2013-09-30 /pmc/articles/PMC3787016/ /pubmed/24098789 http://dx.doi.org/10.1371/journal.pone.0076393 Text en © 2013 Zhu and Yang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhu, Xiaoyuan
Yang, Zhiyong
Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification
title Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification
title_full Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification
title_fullStr Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification
title_full_unstemmed Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification
title_short Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification
title_sort multi-scale spatial concatenations of local features in natural scenes and scene classification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3787016/
https://www.ncbi.nlm.nih.gov/pubmed/24098789
http://dx.doi.org/10.1371/journal.pone.0076393
work_keys_str_mv AT zhuxiaoyuan multiscalespatialconcatenationsoflocalfeaturesinnaturalscenesandsceneclassification
AT yangzhiyong multiscalespatialconcatenationsoflocalfeaturesinnaturalscenesandsceneclassification