Cargando…
Mental geometry of perceiving 3D size in pictures
We show that the classical problem of three-dimensional (3D) size perception in obliquely viewed pictures can be understood by comparing human performance to the optimal geometric solution. A photograph seen from the camera position, can form the same retinal projection as the physical 3D scene, but...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Association for Research in Vision and Ophthalmology
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581493/ https://www.ncbi.nlm.nih.gov/pubmed/33007082 http://dx.doi.org/10.1167/jov.20.10.4 |
_version_ | 1783598989736673280 |
---|---|
author | Maruya, Akihito Zaidi, Qasim |
author_facet | Maruya, Akihito Zaidi, Qasim |
author_sort | Maruya, Akihito |
collection | PubMed |
description | We show that the classical problem of three-dimensional (3D) size perception in obliquely viewed pictures can be understood by comparing human performance to the optimal geometric solution. A photograph seen from the camera position, can form the same retinal projection as the physical 3D scene, but retinal projections of sizes and shapes are distorted in oblique viewing. For real scenes, we previously showed that size and shape inconstancy result despite observers using the correct geometric back-transform, because some retinal images evoke misestimates of object slant or viewing elevation. Now, we examine how observers estimate 3D sizes in oblique views of pictures of objects lying on the ground in different poses. Compared to estimates for real scenes, in oblique views of pictures, sizes were seriously underestimated for objects at frontoparallel poses, but there was almost no change for objects perceived as pointing toward the viewer. The inverse of the function relating projected length to pose, camera elevation and viewing azimuth, gives the optimal correction factor for inferring correct 3D lengths if the elevation and azimuth are estimated accurately. Empirical correction functions had similar shapes to optimal, but lower amplitude. Measurements revealed that observers systematically underestimated viewing azimuth, similar to the frontoparallel bias for object pose perception. A model that adds underestimation of viewing azimuth to the geometrical back-transform, provided good fits to estimated 3D lengths from oblique views. These results add to accumulating evidence that observers use internalized projective geometry to perceive sizes, shapes, and poses in 3D scenes and their pictures. |
format | Online Article Text |
id | pubmed-7581493 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | The Association for Research in Vision and Ophthalmology |
record_format | MEDLINE/PubMed |
spelling | pubmed-75814932020-10-30 Mental geometry of perceiving 3D size in pictures Maruya, Akihito Zaidi, Qasim J Vis Article We show that the classical problem of three-dimensional (3D) size perception in obliquely viewed pictures can be understood by comparing human performance to the optimal geometric solution. A photograph seen from the camera position, can form the same retinal projection as the physical 3D scene, but retinal projections of sizes and shapes are distorted in oblique viewing. For real scenes, we previously showed that size and shape inconstancy result despite observers using the correct geometric back-transform, because some retinal images evoke misestimates of object slant or viewing elevation. Now, we examine how observers estimate 3D sizes in oblique views of pictures of objects lying on the ground in different poses. Compared to estimates for real scenes, in oblique views of pictures, sizes were seriously underestimated for objects at frontoparallel poses, but there was almost no change for objects perceived as pointing toward the viewer. The inverse of the function relating projected length to pose, camera elevation and viewing azimuth, gives the optimal correction factor for inferring correct 3D lengths if the elevation and azimuth are estimated accurately. Empirical correction functions had similar shapes to optimal, but lower amplitude. Measurements revealed that observers systematically underestimated viewing azimuth, similar to the frontoparallel bias for object pose perception. A model that adds underestimation of viewing azimuth to the geometrical back-transform, provided good fits to estimated 3D lengths from oblique views. These results add to accumulating evidence that observers use internalized projective geometry to perceive sizes, shapes, and poses in 3D scenes and their pictures. The Association for Research in Vision and Ophthalmology 2020-10-02 /pmc/articles/PMC7581493/ /pubmed/33007082 http://dx.doi.org/10.1167/jov.20.10.4 Text en Copyright 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. |
spellingShingle | Article Maruya, Akihito Zaidi, Qasim Mental geometry of perceiving 3D size in pictures |
title | Mental geometry of perceiving 3D size in pictures |
title_full | Mental geometry of perceiving 3D size in pictures |
title_fullStr | Mental geometry of perceiving 3D size in pictures |
title_full_unstemmed | Mental geometry of perceiving 3D size in pictures |
title_short | Mental geometry of perceiving 3D size in pictures |
title_sort | mental geometry of perceiving 3d size in pictures |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7581493/ https://www.ncbi.nlm.nih.gov/pubmed/33007082 http://dx.doi.org/10.1167/jov.20.10.4 |
work_keys_str_mv | AT maruyaakihito mentalgeometryofperceiving3dsizeinpictures AT zaidiqasim mentalgeometryofperceiving3dsizeinpictures |