Cargando…

Identifying specular highlights: Insights from deep learning

Specular highlights are the most important image feature for surface gloss perception. Yet, recognizing whether a bright patch in an image is due to specular reflection or some other cause (e.g., texture marking) is challenging, and it remains unclear how the visual system reliably identifies highli...

Descripción completa

Detalles Bibliográficos
Autores principales: Prokott, Eugen, Fleming, Roland W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Association for Research in Vision and Ophthalmology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206496/
https://www.ncbi.nlm.nih.gov/pubmed/35713928
http://dx.doi.org/10.1167/jov.22.7.6
_version_ 1784729342985633792
author Prokott, Eugen
Fleming, Roland W.
author_facet Prokott, Eugen
Fleming, Roland W.
author_sort Prokott, Eugen
collection PubMed
description Specular highlights are the most important image feature for surface gloss perception. Yet, recognizing whether a bright patch in an image is due to specular reflection or some other cause (e.g., texture marking) is challenging, and it remains unclear how the visual system reliably identifies highlights. There is currently no image-computable model that emulates human highlight identification, so here we sought to develop a neural network that reproduces observers’ characteristic successes and failures. We rendered 179,085 images of glossy, undulating, textured surfaces. Given such images as input, a feedforward convolutional neural network was trained to output an image containing only the specular reflectance component. Participants viewed such images and reported whether or not specific pixels were highlights. The queried pixels were carefully selected to distinguish between ground truth and a simple thresholding of image intensity. The neural network outperformed the simple thresholding model—and ground truth—at predicting human responses. We then used a genetic algorithm to selectively delete connections within the neural network to identify variants of the network that approximated human judgments even more closely. The best resulting network shared 68% of the variance with human judgments—more than the unpruned network. As a first step toward interpreting the network, we then used representational similarity analysis to compare its inner representations to a wide variety of hand-engineered image features. We find that the network learns representations that are similar not only to directly image-computable predictors but also to more complex predictors such as intrinsic or geometric factors, as well as some indications of photo-geometrical constraints learned by the network. However, our network fails to replicate human response patterns to violations of photo-geometric constraints (rotated highlights) as described by other authors.
format Online
Article
Text
id pubmed-9206496
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Association for Research in Vision and Ophthalmology
record_format MEDLINE/PubMed
spelling pubmed-92064962022-06-19 Identifying specular highlights: Insights from deep learning Prokott, Eugen Fleming, Roland W. J Vis Article Specular highlights are the most important image feature for surface gloss perception. Yet, recognizing whether a bright patch in an image is due to specular reflection or some other cause (e.g., texture marking) is challenging, and it remains unclear how the visual system reliably identifies highlights. There is currently no image-computable model that emulates human highlight identification, so here we sought to develop a neural network that reproduces observers’ characteristic successes and failures. We rendered 179,085 images of glossy, undulating, textured surfaces. Given such images as input, a feedforward convolutional neural network was trained to output an image containing only the specular reflectance component. Participants viewed such images and reported whether or not specific pixels were highlights. The queried pixels were carefully selected to distinguish between ground truth and a simple thresholding of image intensity. The neural network outperformed the simple thresholding model—and ground truth—at predicting human responses. We then used a genetic algorithm to selectively delete connections within the neural network to identify variants of the network that approximated human judgments even more closely. The best resulting network shared 68% of the variance with human judgments—more than the unpruned network. As a first step toward interpreting the network, we then used representational similarity analysis to compare its inner representations to a wide variety of hand-engineered image features. We find that the network learns representations that are similar not only to directly image-computable predictors but also to more complex predictors such as intrinsic or geometric factors, as well as some indications of photo-geometrical constraints learned by the network. However, our network fails to replicate human response patterns to violations of photo-geometric constraints (rotated highlights) as described by other authors. The Association for Research in Vision and Ophthalmology 2022-06-17 /pmc/articles/PMC9206496/ /pubmed/35713928 http://dx.doi.org/10.1167/jov.22.7.6 Text en Copyright 2022 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle Article
Prokott, Eugen
Fleming, Roland W.
Identifying specular highlights: Insights from deep learning
title Identifying specular highlights: Insights from deep learning
title_full Identifying specular highlights: Insights from deep learning
title_fullStr Identifying specular highlights: Insights from deep learning
title_full_unstemmed Identifying specular highlights: Insights from deep learning
title_short Identifying specular highlights: Insights from deep learning
title_sort identifying specular highlights: insights from deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206496/
https://www.ncbi.nlm.nih.gov/pubmed/35713928
http://dx.doi.org/10.1167/jov.22.7.6
work_keys_str_mv AT prokotteugen identifyingspecularhighlightsinsightsfromdeeplearning
AT flemingrolandw identifyingspecularhighlightsinsightsfromdeeplearning