Cargando…
Unsupervised learning predicts human perception and misperception of gloss
Reflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8526360/ https://www.ncbi.nlm.nih.gov/pubmed/33958744 http://dx.doi.org/10.1038/s41562-021-01097-6 |
_version_ | 1784585858804875264 |
---|---|
author | Storrs, Katherine R. Anderson, Barton L. Fleming, Roland W. |
author_facet | Storrs, Katherine R. Anderson, Barton L. Fleming, Roland W. |
author_sort | Storrs, Katherine R. |
collection | PubMed |
description | Reflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis, we trained unsupervised generative neural networks on renderings of glossy surfaces and compared their representations with human gloss judgements. The networks spontaneously cluster images according to distal properties such as reflectance and illumination, despite receiving no explicit information about these properties. Intriguingly, the resulting representations also predict the specific patterns of ‘successes’ and ‘errors’ in human perception. Linearly decoding specular reflectance from the model’s internal code predicts human gloss perception better than ground truth, supervised networks or control models, and it predicts, on an image-by-image basis, illusions of gloss perception caused by interactions between material, shape and lighting. Unsupervised learning may underlie many perceptual dimensions in vision and beyond. |
format | Online Article Text |
id | pubmed-8526360 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-85263602021-11-04 Unsupervised learning predicts human perception and misperception of gloss Storrs, Katherine R. Anderson, Barton L. Fleming, Roland W. Nat Hum Behav Article Reflectance, lighting and geometry combine in complex ways to create images. How do we disentangle these to perceive individual properties, such as surface glossiness? We suggest that brains disentangle properties by learning to model statistical structure in proximal images. To test this hypothesis, we trained unsupervised generative neural networks on renderings of glossy surfaces and compared their representations with human gloss judgements. The networks spontaneously cluster images according to distal properties such as reflectance and illumination, despite receiving no explicit information about these properties. Intriguingly, the resulting representations also predict the specific patterns of ‘successes’ and ‘errors’ in human perception. Linearly decoding specular reflectance from the model’s internal code predicts human gloss perception better than ground truth, supervised networks or control models, and it predicts, on an image-by-image basis, illusions of gloss perception caused by interactions between material, shape and lighting. Unsupervised learning may underlie many perceptual dimensions in vision and beyond. Nature Publishing Group UK 2021-05-06 2021 /pmc/articles/PMC8526360/ /pubmed/33958744 http://dx.doi.org/10.1038/s41562-021-01097-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Storrs, Katherine R. Anderson, Barton L. Fleming, Roland W. Unsupervised learning predicts human perception and misperception of gloss |
title | Unsupervised learning predicts human perception and misperception of gloss |
title_full | Unsupervised learning predicts human perception and misperception of gloss |
title_fullStr | Unsupervised learning predicts human perception and misperception of gloss |
title_full_unstemmed | Unsupervised learning predicts human perception and misperception of gloss |
title_short | Unsupervised learning predicts human perception and misperception of gloss |
title_sort | unsupervised learning predicts human perception and misperception of gloss |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8526360/ https://www.ncbi.nlm.nih.gov/pubmed/33958744 http://dx.doi.org/10.1038/s41562-021-01097-6 |
work_keys_str_mv | AT storrskatheriner unsupervisedlearningpredictshumanperceptionandmisperceptionofgloss AT andersonbartonl unsupervisedlearningpredictshumanperceptionandmisperceptionofgloss AT flemingrolandw unsupervisedlearningpredictshumanperceptionandmisperceptionofgloss |