Cargando…

Identifying specular highlights: Insights from deep learning

Specular highlights are the most important image feature for surface gloss perception. Yet, recognizing whether a bright patch in an image is due to specular reflection or some other cause (e.g., texture marking) is challenging, and it remains unclear how the visual system reliably identifies highli...

Descripción completa

Detalles Bibliográficos
Autores principales:	Prokott, Eugen, Fleming, Roland W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Association for Research in Vision and Ophthalmology 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206496/ https://www.ncbi.nlm.nih.gov/pubmed/35713928 http://dx.doi.org/10.1167/jov.22.7.6

_version_	1784729342985633792
author	Prokott, Eugen Fleming, Roland W.
author_facet	Prokott, Eugen Fleming, Roland W.
author_sort	Prokott, Eugen
collection	PubMed
description	Specular highlights are the most important image feature for surface gloss perception. Yet, recognizing whether a bright patch in an image is due to specular reflection or some other cause (e.g., texture marking) is challenging, and it remains unclear how the visual system reliably identifies highlights. There is currently no image-computable model that emulates human highlight identification, so here we sought to develop a neural network that reproduces observers’ characteristic successes and failures. We rendered 179,085 images of glossy, undulating, textured surfaces. Given such images as input, a feedforward convolutional neural network was trained to output an image containing only the specular reflectance component. Participants viewed such images and reported whether or not specific pixels were highlights. The queried pixels were carefully selected to distinguish between ground truth and a simple thresholding of image intensity. The neural network outperformed the simple thresholding model—and ground truth—at predicting human responses. We then used a genetic algorithm to selectively delete connections within the neural network to identify variants of the network that approximated human judgments even more closely. The best resulting network shared 68% of the variance with human judgments—more than the unpruned network. As a first step toward interpreting the network, we then used representational similarity analysis to compare its inner representations to a wide variety of hand-engineered image features. We find that the network learns representations that are similar not only to directly image-computable predictors but also to more complex predictors such as intrinsic or geometric factors, as well as some indications of photo-geometrical constraints learned by the network. However, our network fails to replicate human response patterns to violations of photo-geometric constraints (rotated highlights) as described by other authors.
format	Online Article Text
id	pubmed-9206496
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	The Association for Research in Vision and Ophthalmology
record_format	MEDLINE/PubMed
spelling	pubmed-92064962022-06-19 Identifying specular highlights: Insights from deep learning Prokott, Eugen Fleming, Roland W. J Vis Article Specular highlights are the most important image feature for surface gloss perception. Yet, recognizing whether a bright patch in an image is due to specular reflection or some other cause (e.g., texture marking) is challenging, and it remains unclear how the visual system reliably identifies highlights. There is currently no image-computable model that emulates human highlight identification, so here we sought to develop a neural network that reproduces observers’ characteristic successes and failures. We rendered 179,085 images of glossy, undulating, textured surfaces. Given such images as input, a feedforward convolutional neural network was trained to output an image containing only the specular reflectance component. Participants viewed such images and reported whether or not specific pixels were highlights. The queried pixels were carefully selected to distinguish between ground truth and a simple thresholding of image intensity. The neural network outperformed the simple thresholding model—and ground truth—at predicting human responses. We then used a genetic algorithm to selectively delete connections within the neural network to identify variants of the network that approximated human judgments even more closely. The best resulting network shared 68% of the variance with human judgments—more than the unpruned network. As a first step toward interpreting the network, we then used representational similarity analysis to compare its inner representations to a wide variety of hand-engineered image features. We find that the network learns representations that are similar not only to directly image-computable predictors but also to more complex predictors such as intrinsic or geometric factors, as well as some indications of photo-geometrical constraints learned by the network. However, our network fails to replicate human response patterns to violations of photo-geometric constraints (rotated highlights) as described by other authors. The Association for Research in Vision and Ophthalmology 2022-06-17 /pmc/articles/PMC9206496/ /pubmed/35713928 http://dx.doi.org/10.1167/jov.22.7.6 Text en Copyright 2022 The Authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License.
spellingShingle	Article Prokott, Eugen Fleming, Roland W. Identifying specular highlights: Insights from deep learning
title	Identifying specular highlights: Insights from deep learning
title_full	Identifying specular highlights: Insights from deep learning
title_fullStr	Identifying specular highlights: Insights from deep learning
title_full_unstemmed	Identifying specular highlights: Insights from deep learning
title_short	Identifying specular highlights: Insights from deep learning
title_sort	identifying specular highlights: insights from deep learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206496/ https://www.ncbi.nlm.nih.gov/pubmed/35713928 http://dx.doi.org/10.1167/jov.22.7.6
work_keys_str_mv	AT prokotteugen identifyingspecularhighlightsinsightsfromdeeplearning AT flemingrolandw identifyingspecularhighlightsinsightsfromdeeplearning

Identifying specular highlights: Insights from deep learning

Ejemplares similares