Cargando…

What is “Where”: Physical Reasoning Informs Object Location

A central puzzle the visual system tries to solve is: “what is where?” While a great deal of research attempts to model object recognition (“what”), a comparatively smaller body of work seeks to model object location (“where”), especially in perceiving everyday objects. How do people locate an objec...

Descripción completa

Detalles Bibliográficos
Autores principales: Boger, Tal, Ullman, Tomer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MIT Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320814/
https://www.ncbi.nlm.nih.gov/pubmed/37416073
http://dx.doi.org/10.1162/opmi_a_00075
_version_ 1785068506457309184
author Boger, Tal
Ullman, Tomer
author_facet Boger, Tal
Ullman, Tomer
author_sort Boger, Tal
collection PubMed
description A central puzzle the visual system tries to solve is: “what is where?” While a great deal of research attempts to model object recognition (“what”), a comparatively smaller body of work seeks to model object location (“where”), especially in perceiving everyday objects. How do people locate an object, right now, in front of them? In three experiments collecting over 35,000 judgements on stimuli spanning different levels of realism (line drawings, real images, and crude forms), participants clicked “where” an object is, as if pointing to it. We modeled their responses with eight different methods, including both human response-based models (judgements of physical reasoning, spatial memory, free-response “click anywhere” judgements, and judgements of where people would grab the object), and image-based models (uniform distributions over the image, convex hull, saliency map, and medial axis). Physical reasoning was the best predictor of “where,” performing significantly better than even spatial memory and free-response judgements. Our results offer insight into the perception of object locations while also raising interesting questions about the relationship between physical reasoning and visual perception.
format Online
Article
Text
id pubmed-10320814
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MIT Press
record_format MEDLINE/PubMed
spelling pubmed-103208142023-07-06 What is “Where”: Physical Reasoning Informs Object Location Boger, Tal Ullman, Tomer Open Mind (Camb) Research Article A central puzzle the visual system tries to solve is: “what is where?” While a great deal of research attempts to model object recognition (“what”), a comparatively smaller body of work seeks to model object location (“where”), especially in perceiving everyday objects. How do people locate an object, right now, in front of them? In three experiments collecting over 35,000 judgements on stimuli spanning different levels of realism (line drawings, real images, and crude forms), participants clicked “where” an object is, as if pointing to it. We modeled their responses with eight different methods, including both human response-based models (judgements of physical reasoning, spatial memory, free-response “click anywhere” judgements, and judgements of where people would grab the object), and image-based models (uniform distributions over the image, convex hull, saliency map, and medial axis). Physical reasoning was the best predictor of “where,” performing significantly better than even spatial memory and free-response judgements. Our results offer insight into the perception of object locations while also raising interesting questions about the relationship between physical reasoning and visual perception. MIT Press 2023-05-01 /pmc/articles/PMC10320814/ /pubmed/37416073 http://dx.doi.org/10.1162/opmi_a_00075 Text en © 2023 Massachusetts Institute of Technology https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. For a full description of the license, please visit https://creativecommons.org/licenses/by/4.0/.
spellingShingle Research Article
Boger, Tal
Ullman, Tomer
What is “Where”: Physical Reasoning Informs Object Location
title What is “Where”: Physical Reasoning Informs Object Location
title_full What is “Where”: Physical Reasoning Informs Object Location
title_fullStr What is “Where”: Physical Reasoning Informs Object Location
title_full_unstemmed What is “Where”: Physical Reasoning Informs Object Location
title_short What is “Where”: Physical Reasoning Informs Object Location
title_sort what is “where”: physical reasoning informs object location
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10320814/
https://www.ncbi.nlm.nih.gov/pubmed/37416073
http://dx.doi.org/10.1162/opmi_a_00075
work_keys_str_mv AT bogertal whatiswherephysicalreasoninginformsobjectlocation
AT ullmantomer whatiswherephysicalreasoninginformsobjectlocation