Cargando…

Basic level scene understanding: categories, attributes and structures

A longstanding goal of computer vision is to build a system that can automatically understand a 3D scene from a single image. This requires extracting semantic concepts and 3D information from 2D images which can depict an enormous variety of environments that comprise our visual world. This paper s...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xiao, Jianxiong, Hays, James, Russell, Bryan C., Patterson, Genevieve, Ehinger, Krista A., Torralba, Antonio, Oliva, Aude
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2013
Materias:	Psychology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3756302/ https://www.ncbi.nlm.nih.gov/pubmed/24009590 http://dx.doi.org/10.3389/fpsyg.2013.00506

_version_	1782282070736764928
author	Xiao, Jianxiong Hays, James Russell, Bryan C. Patterson, Genevieve Ehinger, Krista A. Torralba, Antonio Oliva, Aude
author_facet	Xiao, Jianxiong Hays, James Russell, Bryan C. Patterson, Genevieve Ehinger, Krista A. Torralba, Antonio Oliva, Aude
author_sort	Xiao, Jianxiong
collection	PubMed
description	A longstanding goal of computer vision is to build a system that can automatically understand a 3D scene from a single image. This requires extracting semantic concepts and 3D information from 2D images which can depict an enormous variety of environments that comprise our visual world. This paper summarizes our recent efforts toward these goals. First, we describe the richly annotated SUN database which is a collection of annotated images spanning 908 different scene categories with object, attribute, and geometric labels for many scenes. This database allows us to systematically study the space of scenes and to establish a benchmark for scene and object recognition. We augment the categorical SUN database with 102 scene attributes for every image and explore attribute recognition. Finally, we present an integrated system to extract the 3D structure of the scene and objects depicted in an image.
format	Online Article Text
id	pubmed-3756302
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-37563022013-09-04 Basic level scene understanding: categories, attributes and structures Xiao, Jianxiong Hays, James Russell, Bryan C. Patterson, Genevieve Ehinger, Krista A. Torralba, Antonio Oliva, Aude Front Psychol Psychology A longstanding goal of computer vision is to build a system that can automatically understand a 3D scene from a single image. This requires extracting semantic concepts and 3D information from 2D images which can depict an enormous variety of environments that comprise our visual world. This paper summarizes our recent efforts toward these goals. First, we describe the richly annotated SUN database which is a collection of annotated images spanning 908 different scene categories with object, attribute, and geometric labels for many scenes. This database allows us to systematically study the space of scenes and to establish a benchmark for scene and object recognition. We augment the categorical SUN database with 102 scene attributes for every image and explore attribute recognition. Finally, we present an integrated system to extract the 3D structure of the scene and objects depicted in an image. Frontiers Media S.A. 2013-08-29 /pmc/articles/PMC3756302/ /pubmed/24009590 http://dx.doi.org/10.3389/fpsyg.2013.00506 Text en Copyright © 2013 Xiao, Hays, Russell, Patterson, Ehinger, Torralba and Oliva. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Psychology Xiao, Jianxiong Hays, James Russell, Bryan C. Patterson, Genevieve Ehinger, Krista A. Torralba, Antonio Oliva, Aude Basic level scene understanding: categories, attributes and structures
title	Basic level scene understanding: categories, attributes and structures
title_full	Basic level scene understanding: categories, attributes and structures
title_fullStr	Basic level scene understanding: categories, attributes and structures
title_full_unstemmed	Basic level scene understanding: categories, attributes and structures
title_short	Basic level scene understanding: categories, attributes and structures
title_sort	basic level scene understanding: categories, attributes and structures
topic	Psychology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3756302/ https://www.ncbi.nlm.nih.gov/pubmed/24009590 http://dx.doi.org/10.3389/fpsyg.2013.00506
work_keys_str_mv	AT xiaojianxiong basiclevelsceneunderstandingcategoriesattributesandstructures AT haysjames basiclevelsceneunderstandingcategoriesattributesandstructures AT russellbryanc basiclevelsceneunderstandingcategoriesattributesandstructures AT pattersongenevieve basiclevelsceneunderstandingcategoriesattributesandstructures AT ehingerkristaa basiclevelsceneunderstandingcategoriesattributesandstructures AT torralbaantonio basiclevelsceneunderstandingcategoriesattributesandstructures AT olivaaude basiclevelsceneunderstandingcategoriesattributesandstructures

Basic level scene understanding: categories, attributes and structures

Ejemplares similares