Cargando…

Mental geometry of perceiving 3D size in pictures

We show that the classical problem of three-dimensional (3D) size perception in obliquely viewed pictures can be understood by comparing human performance to the optimal geometric solution. A photograph seen from the camera position, can form the same retinal projection as the physical 3D scene, but...

Descripción completa

Detalles Bibliográficos
Autores principales: Maruya, Akihito, Zaidi, Qasim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581493/
https://www.ncbi.nlm.nih.gov/pubmed/33007082
http://dx.doi.org/10.1167/jov.20.10.4
_version_ 1783598989736673280
author Maruya, Akihito
Zaidi, Qasim
author_facet Maruya, Akihito
Zaidi, Qasim
author_sort Maruya, Akihito
collection PubMed
description We show that the classical problem of three-dimensional (3D) size perception in obliquely viewed pictures can be understood by comparing human performance to the optimal geometric solution. A photograph seen from the camera position, can form the same retinal projection as the physical 3D scene, but retinal projections of sizes and shapes are distorted in oblique viewing. For real scenes, we previously showed that size and shape inconstancy result despite observers using the correct geometric back-transform, because some retinal images evoke misestimates of object slant or viewing elevation. Now, we examine how observers estimate 3D sizes in oblique views of pictures of objects lying on the ground in different poses. Compared to estimates for real scenes, in oblique views of pictures, sizes were seriously underestimated for objects at frontoparallel poses, but there was almost no change for objects perceived as pointing toward the viewer. The inverse of the function relating projected length to pose, camera elevation and viewing azimuth, gives the optimal correction factor for inferring correct 3D lengths if the elevation and azimuth are estimated accurately. Empirical correction functions had similar shapes to optimal, but lower amplitude. Measurements revealed that observers systematically underestimated viewing azimuth, similar to the frontoparallel bias for object pose perception. A model that adds underestimation of viewing azimuth to the geometrical back-transform, provided good fits to estimated 3D lengths from oblique views. These results add to accumulating evidence that observers use internalized projective geometry to perceive sizes, shapes, and poses in 3D scenes and their pictures.
format Online
Article
Text
id pubmed-7581493
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-75814932020-10-30 Mental geometry of perceiving 3D size in pictures Maruya, Akihito Zaidi, Qasim J Vis Article We show that the classical problem of three-dimensional (3D) size perception in obliquely viewed pictures can be understood by comparing human performance to the optimal geometric solution. A photograph seen from the camera position, can form the same retinal projection as the physical 3D scene, but retinal projections of sizes and shapes are distorted in oblique viewing. For real scenes, we previously showed that size and shape inconstancy result despite observers using the correct geometric back-transform, because some retinal images evoke misestimates of object slant or viewing elevation. Now, we examine how observers estimate 3D sizes in oblique views of pictures of objects lying on the ground in different poses. Compared to estimates for real scenes, in oblique views of pictures, sizes were seriously underestimated for objects at frontoparallel poses, but there was almost no change for objects perceived as pointing toward the viewer. The inverse of the function relating projected length to pose, camera elevation and viewing azimuth, gives the optimal correction factor for inferring correct 3D lengths if the elevation and azimuth are estimated accurately. Empirical correction functions had similar shapes to optimal, but lower amplitude. Measurements revealed that observers systematically underestimated viewing azimuth, similar to the frontoparallel bias for object pose perception. A model that adds underestimation of viewing azimuth to the geometrical back-transform, provided good fits to estimated 3D lengths from oblique views. These results add to accumulating evidence that observers use internalized projective geometry to perceive sizes, shapes, and poses in 3D scenes and their pictures. The Association for Research in Vision and Ophthalmology 2020-10-02 /pmc/articles/PMC7581493/ /pubmed/33007082 http://dx.doi.org/10.1167/jov.20.10.4 Text en Copyright 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
spellingShingle Article
Maruya, Akihito
Zaidi, Qasim
Mental geometry of perceiving 3D size in pictures
title Mental geometry of perceiving 3D size in pictures
title_full Mental geometry of perceiving 3D size in pictures
title_fullStr Mental geometry of perceiving 3D size in pictures
title_full_unstemmed Mental geometry of perceiving 3D size in pictures
title_short Mental geometry of perceiving 3D size in pictures
title_sort mental geometry of perceiving 3d size in pictures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581493/
https://www.ncbi.nlm.nih.gov/pubmed/33007082
http://dx.doi.org/10.1167/jov.20.10.4
work_keys_str_mv AT maruyaakihito mentalgeometryofperceiving3dsizeinpictures
AT zaidiqasim mentalgeometryofperceiving3dsizeinpictures