Cargando…

Unsupervised learning predicts human perception and misperception of gloss

Reflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis...

Descripción completa

Detalles Bibliográficos
Autores principales: Storrs, Katherine R., Anderson, Barton L., Fleming, Roland W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8526360/
https://www.ncbi.nlm.nih.gov/pubmed/33958744
http://dx.doi.org/10.1038/s41562-021-01097-6
_version_ 1784585858804875264
author Storrs, Katherine R.
Anderson, Barton L.
Fleming, Roland W.
author_facet Storrs, Katherine R.
Anderson, Barton L.
Fleming, Roland W.
author_sort Storrs, Katherine R.
collection PubMed
description Reflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis, we trained unsupervised generative neural networks on renderings of glossy surfaces and compared their representations with human gloss judgements. The networks spontaneously cluster images according to distal properties such as reflectance and illumination, despite receiving no explicit information about these properties. Intriguingly, the resulting representations also predict the specific patterns of ‘successes’ and ‘errors’ in human perception. Linearly decoding specular reflectance from the model’s internal code predicts human gloss perception better than ground truth, supervised networks or control models, and it predicts, on an image-by-image basis, illusions of gloss perception caused by interactions between material, shape and lighting. Unsupervised learning may underlie many perceptual dimensions in vision and beyond.
format Online
Article
Text
id pubmed-8526360
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85263602021-11-04 Unsupervised learning predicts human perception and misperception of gloss Storrs, Katherine R. Anderson, Barton L. Fleming, Roland W. Nat Hum Behav Article Reflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis, we trained unsupervised generative neural networks on renderings of glossy surfaces and compared their representations with human gloss judgements. The networks spontaneously cluster images according to distal properties such as reflectance and illumination, despite receiving no explicit information about these properties. Intriguingly, the resulting representations also predict the specific patterns of ‘successes’ and ‘errors’ in human perception. Linearly decoding specular reflectance from the model’s internal code predicts human gloss perception better than ground truth, supervised networks or control models, and it predicts, on an image-by-image basis, illusions of gloss perception caused by interactions between material, shape and lighting. Unsupervised learning may underlie many perceptual dimensions in vision and beyond. Nature Publishing Group UK 2021-05-06 2021 /pmc/articles/PMC8526360/ /pubmed/33958744 http://dx.doi.org/10.1038/s41562-021-01097-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Storrs, Katherine R.
Anderson, Barton L.
Fleming, Roland W.
Unsupervised learning predicts human perception and misperception of gloss
title Unsupervised learning predicts human perception and misperception of gloss
title_full Unsupervised learning predicts human perception and misperception of gloss
title_fullStr Unsupervised learning predicts human perception and misperception of gloss
title_full_unstemmed Unsupervised learning predicts human perception and misperception of gloss
title_short Unsupervised learning predicts human perception and misperception of gloss
title_sort unsupervised learning predicts human perception and misperception of gloss
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8526360/
https://www.ncbi.nlm.nih.gov/pubmed/33958744
http://dx.doi.org/10.1038/s41562-021-01097-6
work_keys_str_mv AT storrskatheriner unsupervisedlearningpredictshumanperceptionandmisperceptionofgloss
AT andersonbartonl unsupervisedlearningpredictshumanperceptionandmisperceptionofgloss
AT flemingrolandw unsupervisedlearningpredictshumanperceptionandmisperceptionofgloss