Cargando…

Estimating 3D tilt from local image cues in natural scenes

Estimating three-dimensional (3D) surface orientation (slant and tilt) is an important first step toward estimating 3D shape. Here, we examine how three local image cues from the same location (disparity gradient, luminance gradient, and dominant texture orientation) should be combined to estimate 3...

Descripción completa

Detalles Bibliográficos
Autores principales: Burge, Johannes, McCann, Brian C., Geisler, Wilson S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066913/
https://www.ncbi.nlm.nih.gov/pubmed/27738702
http://dx.doi.org/10.1167/16.13.2
_version_ 1782460564257112064
author Burge, Johannes
McCann, Brian C.
Geisler, Wilson S.
author_facet Burge, Johannes
McCann, Brian C.
Geisler, Wilson S.
author_sort Burge, Johannes
collection PubMed
description Estimating three-dimensional (3D) surface orientation (slant and tilt) is an important first step toward estimating 3D shape. Here, we examine how three local image cues from the same location (disparity gradient, luminance gradient, and dominant texture orientation) should be combined to estimate 3D tilt in natural scenes. We collected a database of natural stereoscopic images with precisely co-registered range images that provide the ground-truth distance at each pixel location. We then analyzed the relationship between ground-truth tilt and image cue values. Our analysis is free of assumptions about the joint probability distributions and yields the Bayes optimal estimates of tilt, given the cue values. Rich results emerge: (a) typical tilt estimates are only moderately accurate and strongly influenced by the cardinal bias in the prior probability distribution; (b) when cue values are similar, or when slant is greater than 40°, estimates are substantially more accurate; (c) when luminance and texture cues agree, they often veto the disparity cue, and when they disagree, they have little effect; and (d) simplifying assumptions common in the cue combination literature is often justified for estimating tilt in natural scenes. The fact that tilt estimates are typically not very accurate is consistent with subjective impressions from viewing small patches of natural scene. The fact that estimates are substantially more accurate for a subset of image locations is also consistent with subjective impressions and with the hypothesis that perceived surface orientation, at more global scales, is achieved by interpolation or extrapolation from estimates at key locations.
format Online
Article
Text
id pubmed-5066913
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-50669132016-10-18 Estimating 3D tilt from local image cues in natural scenes Burge, Johannes McCann, Brian C. Geisler, Wilson S. J Vis Article Estimating three-dimensional (3D) surface orientation (slant and tilt) is an important first step toward estimating 3D shape. Here, we examine how three local image cues from the same location (disparity gradient, luminance gradient, and dominant texture orientation) should be combined to estimate 3D tilt in natural scenes. We collected a database of natural stereoscopic images with precisely co-registered range images that provide the ground-truth distance at each pixel location. We then analyzed the relationship between ground-truth tilt and image cue values. Our analysis is free of assumptions about the joint probability distributions and yields the Bayes optimal estimates of tilt, given the cue values. Rich results emerge: (a) typical tilt estimates are only moderately accurate and strongly influenced by the cardinal bias in the prior probability distribution; (b) when cue values are similar, or when slant is greater than 40°, estimates are substantially more accurate; (c) when luminance and texture cues agree, they often veto the disparity cue, and when they disagree, they have little effect; and (d) simplifying assumptions common in the cue combination literature is often justified for estimating tilt in natural scenes. The fact that tilt estimates are typically not very accurate is consistent with subjective impressions from viewing small patches of natural scene. The fact that estimates are substantially more accurate for a subset of image locations is also consistent with subjective impressions and with the hypothesis that perceived surface orientation, at more global scales, is achieved by interpolation or extrapolation from estimates at key locations. The Association for Research in Vision and Ophthalmology 2016-10-13 /pmc/articles/PMC5066913/ /pubmed/27738702 http://dx.doi.org/10.1167/16.13.2 Text en http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
spellingShingle Article
Burge, Johannes
McCann, Brian C.
Geisler, Wilson S.
Estimating 3D tilt from local image cues in natural scenes
title Estimating 3D tilt from local image cues in natural scenes
title_full Estimating 3D tilt from local image cues in natural scenes
title_fullStr Estimating 3D tilt from local image cues in natural scenes
title_full_unstemmed Estimating 3D tilt from local image cues in natural scenes
title_short Estimating 3D tilt from local image cues in natural scenes
title_sort estimating 3d tilt from local image cues in natural scenes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066913/
https://www.ncbi.nlm.nih.gov/pubmed/27738702
http://dx.doi.org/10.1167/16.13.2
work_keys_str_mv AT burgejohannes estimating3dtiltfromlocalimagecuesinnaturalscenes
AT mccannbrianc estimating3dtiltfromlocalimagecuesinnaturalscenes
AT geislerwilsons estimating3dtiltfromlocalimagecuesinnaturalscenes