Cargando…

Distinguishing mirror from glass: A “big data” approach to material perception

Distinguishing mirror from glass is a challenging visual inference, because both materials derive their appearance from their surroundings, yet we rarely experience difficulties in telling them apart. Very few studies have investigated how the visual system distinguishes reflections from refractions...

Descripción completa

Detalles Bibliográficos
Autores principales: Tamura, Hideki, Prokott, Konrad Eugen, Fleming, Roland W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8934559/
https://www.ncbi.nlm.nih.gov/pubmed/35266961
http://dx.doi.org/10.1167/jov.22.4.4
_version_ 1784671874090795008
author Tamura, Hideki
Prokott, Konrad Eugen
Fleming, Roland W.
author_facet Tamura, Hideki
Prokott, Konrad Eugen
Fleming, Roland W.
author_sort Tamura, Hideki
collection PubMed
description Distinguishing mirror from glass is a challenging visual inference, because both materials derive their appearance from their surroundings, yet we rarely experience difficulties in telling them apart. Very few studies have investigated how the visual system distinguishes reflections from refractions and to date, there is no image-computable model that emulates human judgments. Here we sought to develop a deep neural network that reproduces the patterns of visual judgments human observers make. To do this, we trained thousands of convolutional neural networks on more than 750,000 simulated mirror and glass objects, and compared their performance with human judgments, as well as alternative classifiers based on “hand-engineered” image features. For randomly chosen images, all classifiers and humans performed with high accuracy, and therefore correlated highly with one another. However, to assess how similar models are to humans, it is not sufficient to compare accuracy or correlation on random images. A good model should also predict the characteristic errors that humans make. We, therefore, painstakingly assembled a diagnostic image set for which humans make systematic errors, allowing us to isolate signatures of human-like performance. A large-scale, systematic search through feedforward neural architectures revealed that relatively shallow (three-layer) networks predicted human judgments better than any other models we tested. This is the first image-computable model that emulates human errors and succeeds in distinguishing mirror from glass, and hints that mid-level visual processing might be particularly important for the task.
format Online
Article
Text
id pubmed-8934559
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-89345592022-03-21 Distinguishing mirror from glass: A “big data” approach to material perception Tamura, Hideki Prokott, Konrad Eugen Fleming, Roland W. J Vis Article Distinguishing mirror from glass is a challenging visual inference, because both materials derive their appearance from their surroundings, yet we rarely experience difficulties in telling them apart. Very few studies have investigated how the visual system distinguishes reflections from refractions and to date, there is no image-computable model that emulates human judgments. Here we sought to develop a deep neural network that reproduces the patterns of visual judgments human observers make. To do this, we trained thousands of convolutional neural networks on more than 750,000 simulated mirror and glass objects, and compared their performance with human judgments, as well as alternative classifiers based on “hand-engineered” image features. For randomly chosen images, all classifiers and humans performed with high accuracy, and therefore correlated highly with one another. However, to assess how similar models are to humans, it is not sufficient to compare accuracy or correlation on random images. A good model should also predict the characteristic errors that humans make. We, therefore, painstakingly assembled a diagnostic image set for which humans make systematic errors, allowing us to isolate signatures of human-like performance. A large-scale, systematic search through feedforward neural architectures revealed that relatively shallow (three-layer) networks predicted human judgments better than any other models we tested. This is the first image-computable model that emulates human errors and succeeds in distinguishing mirror from glass, and hints that mid-level visual processing might be particularly important for the task. The Association for Research in Vision and Ophthalmology 2022-03-10 /pmc/articles/PMC8934559/ /pubmed/35266961 http://dx.doi.org/10.1167/jov.22.4.4 Text en Copyright 2022 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle Article
Tamura, Hideki
Prokott, Konrad Eugen
Fleming, Roland W.
Distinguishing mirror from glass: A “big data” approach to material perception
title Distinguishing mirror from glass: A “big data” approach to material perception
title_full Distinguishing mirror from glass: A “big data” approach to material perception
title_fullStr Distinguishing mirror from glass: A “big data” approach to material perception
title_full_unstemmed Distinguishing mirror from glass: A “big data” approach to material perception
title_short Distinguishing mirror from glass: A “big data” approach to material perception
title_sort distinguishing mirror from glass: a “big data” approach to material perception
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8934559/
https://www.ncbi.nlm.nih.gov/pubmed/35266961
http://dx.doi.org/10.1167/jov.22.4.4
work_keys_str_mv AT tamurahideki distinguishingmirrorfromglassabigdataapproachtomaterialperception
AT prokottkonradeugen distinguishingmirrorfromglassabigdataapproachtomaterialperception
AT flemingrolandw distinguishingmirrorfromglassabigdataapproachtomaterialperception