Cargando…

Category systems for real-world scenes

Categorization performance is a popular metric of scene recognition and understanding in behavioral and computational research. However, categorical constructs and their labels can be somewhat arbitrary. Derived from exhaustive vocabularies of place names (e.g., Deng et al., 2009), or the judgements...

Descripción completa

Detalles Bibliográficos
Autores principales: Anderson, Matt D., Graf, Erich W., Elder, James H., Ehinger, Krista A., Adams, Wendy J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900870/
https://www.ncbi.nlm.nih.gov/pubmed/33595646
http://dx.doi.org/10.1167/jov.21.2.8
_version_ 1783654297368526848
author Anderson, Matt D.
Graf, Erich W.
Elder, James H.
Ehinger, Krista A.
Adams, Wendy J.
author_facet Anderson, Matt D.
Graf, Erich W.
Elder, James H.
Ehinger, Krista A.
Adams, Wendy J.
author_sort Anderson, Matt D.
collection PubMed
description Categorization performance is a popular metric of scene recognition and understanding in behavioral and computational research. However, categorical constructs and their labels can be somewhat arbitrary. Derived from exhaustive vocabularies of place names (e.g., Deng et al., 2009), or the judgements of small groups of researchers (e.g., Fei-Fei, Iyer, Koch, & Perona, 2007), these categories may not correspond with human-preferred taxonomies. Here, we propose clustering by increasing the rand index via coordinate ascent (CIRCA): an unsupervised, data-driven clustering method for deriving ground-truth scene categories. In Experiment 1, human participants organized 80 stereoscopic images of outdoor scenes from the Southampton-York Natural Scenes (SYNS) dataset (Adams et al., 2016) into discrete categories. In separate tasks, images were grouped according to i) semantic content, ii) three-dimensional spatial structure, or iii) two-dimensional image appearance. Participants provided text labels for each group. Using the CIRCA method, we determined the most representative category structure and then derived category labels for each task/dimension. In Experiment 2, we found that these categories generalized well to a larger set of SYNS images, and new observers. In Experiment 3, we tested the relationship between our category systems and the spatial envelope model (Oliva & Torralba, 2001). Finally, in Experiment 4, we validated CIRCA on a larger, independent dataset of same-different category judgements. The derived category systems outperformed the SUN taxonomy (Xiao, Hays, Ehinger, Oliva, & Torralba, 2010) and an alternative clustering method (Greene, 2019). In summary, we believe this novel categorization method can be applied to a wide range of datasets to derive optimal categorical groupings and labels from psychophysical judgements of stimulus similarity.
format Online
Article
Text
id pubmed-7900870
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-79008702021-03-03 Category systems for real-world scenes Anderson, Matt D. Graf, Erich W. Elder, James H. Ehinger, Krista A. Adams, Wendy J. J Vis Article Categorization performance is a popular metric of scene recognition and understanding in behavioral and computational research. However, categorical constructs and their labels can be somewhat arbitrary. Derived from exhaustive vocabularies of place names (e.g., Deng et al., 2009), or the judgements of small groups of researchers (e.g., Fei-Fei, Iyer, Koch, & Perona, 2007), these categories may not correspond with human-preferred taxonomies. Here, we propose clustering by increasing the rand index via coordinate ascent (CIRCA): an unsupervised, data-driven clustering method for deriving ground-truth scene categories. In Experiment 1, human participants organized 80 stereoscopic images of outdoor scenes from the Southampton-York Natural Scenes (SYNS) dataset (Adams et al., 2016) into discrete categories. In separate tasks, images were grouped according to i) semantic content, ii) three-dimensional spatial structure, or iii) two-dimensional image appearance. Participants provided text labels for each group. Using the CIRCA method, we determined the most representative category structure and then derived category labels for each task/dimension. In Experiment 2, we found that these categories generalized well to a larger set of SYNS images, and new observers. In Experiment 3, we tested the relationship between our category systems and the spatial envelope model (Oliva & Torralba, 2001). Finally, in Experiment 4, we validated CIRCA on a larger, independent dataset of same-different category judgements. The derived category systems outperformed the SUN taxonomy (Xiao, Hays, Ehinger, Oliva, & Torralba, 2010) and an alternative clustering method (Greene, 2019). In summary, we believe this novel categorization method can be applied to a wide range of datasets to derive optimal categorical groupings and labels from psychophysical judgements of stimulus similarity. The Association for Research in Vision and Ophthalmology 2021-02-17 /pmc/articles/PMC7900870/ /pubmed/33595646 http://dx.doi.org/10.1167/jov.21.2.8 Text en Copyright 2021 The Authors http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle Article
Anderson, Matt D.
Graf, Erich W.
Elder, James H.
Ehinger, Krista A.
Adams, Wendy J.
Category systems for real-world scenes
title Category systems for real-world scenes
title_full Category systems for real-world scenes
title_fullStr Category systems for real-world scenes
title_full_unstemmed Category systems for real-world scenes
title_short Category systems for real-world scenes
title_sort category systems for real-world scenes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7900870/
https://www.ncbi.nlm.nih.gov/pubmed/33595646
http://dx.doi.org/10.1167/jov.21.2.8
work_keys_str_mv AT andersonmattd categorysystemsforrealworldscenes
AT graferichw categorysystemsforrealworldscenes
AT elderjamesh categorysystemsforrealworldscenes
AT ehingerkristaa categorysystemsforrealworldscenes
AT adamswendyj categorysystemsforrealworldscenes