Cargando…

A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes

Humans can categorize objects in complex natural scenes within 100–150 ms. This amazing ability of rapid categorization has motivated many computational models. Most of these models require extensive training to obtain a decision boundary in a very high dimensional (e.g., ∼6,000 in a leading model)...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Xiaofu, Yang, Zhiyong, Tsien, Joe Z.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3102072/
https://www.ncbi.nlm.nih.gov/pubmed/21647443
http://dx.doi.org/10.1371/journal.pone.0020002
_version_ 1782204342118383616
author He, Xiaofu
Yang, Zhiyong
Tsien, Joe Z.
author_facet He, Xiaofu
Yang, Zhiyong
Tsien, Joe Z.
author_sort He, Xiaofu
collection PubMed
description Humans can categorize objects in complex natural scenes within 100–150 ms. This amazing ability of rapid categorization has motivated many computational models. Most of these models require extensive training to obtain a decision boundary in a very high dimensional (e.g., ∼6,000 in a leading model) feature space and often categorize objects in natural scenes by categorizing the context that co-occurs with objects when objects do not occupy large portions of the scenes. It is thus unclear how humans achieve rapid scene categorization. To address this issue, we developed a hierarchical probabilistic model for rapid object categorization in natural scenes. In this model, a natural object category is represented by a coarse hierarchical probability distribution (PD), which includes PDs of object geometry and spatial configuration of object parts. Object parts are encoded by PDs of a set of natural object structures, each of which is a concatenation of local object features. Rapid categorization is performed as statistical inference. Since the model uses a very small number (∼100) of structures for even complex object categories such as animals and cars, it requires little training and is robust in the presence of large variations within object categories and in their occurrences in natural scenes. Remarkably, we found that the model categorized animals in natural scenes and cars in street scenes with a near human-level performance. We also found that the model located animals and cars in natural scenes, thus overcoming a flaw in many other models which is to categorize objects in natural context by categorizing contextual features. These results suggest that coarse PDs of object categories based on natural object structures and statistical operations on these PDs may underlie the human ability to rapidly categorize scenes.
format Text
id pubmed-3102072
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31020722011-06-06 A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes He, Xiaofu Yang, Zhiyong Tsien, Joe Z. PLoS One Research Article Humans can categorize objects in complex natural scenes within 100–150 ms. This amazing ability of rapid categorization has motivated many computational models. Most of these models require extensive training to obtain a decision boundary in a very high dimensional (e.g., ∼6,000 in a leading model) feature space and often categorize objects in natural scenes by categorizing the context that co-occurs with objects when objects do not occupy large portions of the scenes. It is thus unclear how humans achieve rapid scene categorization. To address this issue, we developed a hierarchical probabilistic model for rapid object categorization in natural scenes. In this model, a natural object category is represented by a coarse hierarchical probability distribution (PD), which includes PDs of object geometry and spatial configuration of object parts. Object parts are encoded by PDs of a set of natural object structures, each of which is a concatenation of local object features. Rapid categorization is performed as statistical inference. Since the model uses a very small number (∼100) of structures for even complex object categories such as animals and cars, it requires little training and is robust in the presence of large variations within object categories and in their occurrences in natural scenes. Remarkably, we found that the model categorized animals in natural scenes and cars in street scenes with a near human-level performance. We also found that the model located animals and cars in natural scenes, thus overcoming a flaw in many other models which is to categorize objects in natural context by categorizing contextual features. These results suggest that coarse PDs of object categories based on natural object structures and statistical operations on these PDs may underlie the human ability to rapidly categorize scenes. Public Library of Science 2011-05-25 /pmc/articles/PMC3102072/ /pubmed/21647443 http://dx.doi.org/10.1371/journal.pone.0020002 Text en He et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
He, Xiaofu
Yang, Zhiyong
Tsien, Joe Z.
A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes
title A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes
title_full A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes
title_fullStr A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes
title_full_unstemmed A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes
title_short A Hierarchical Probabilistic Model for Rapid Object Categorization in Natural Scenes
title_sort hierarchical probabilistic model for rapid object categorization in natural scenes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3102072/
https://www.ncbi.nlm.nih.gov/pubmed/21647443
http://dx.doi.org/10.1371/journal.pone.0020002
work_keys_str_mv AT hexiaofu ahierarchicalprobabilisticmodelforrapidobjectcategorizationinnaturalscenes
AT yangzhiyong ahierarchicalprobabilisticmodelforrapidobjectcategorizationinnaturalscenes
AT tsienjoez ahierarchicalprobabilisticmodelforrapidobjectcategorizationinnaturalscenes
AT hexiaofu hierarchicalprobabilisticmodelforrapidobjectcategorizationinnaturalscenes
AT yangzhiyong hierarchicalprobabilisticmodelforrapidobjectcategorizationinnaturalscenes
AT tsienjoez hierarchicalprobabilisticmodelforrapidobjectcategorizationinnaturalscenes