Cargando…

Predicting human complexity perception of real-world scenes

Perceptual load is a well-established determinant of attentional engagement in a task. So far, perceptual load has typically been manipulated by increasing either the number of task-relevant items or the perceptual processing demand (e.g. conjunction versus feature tasks). The tasks used often invol...

Descripción completa

Detalles Bibliográficos
Autores principales:	Nagle, Fintan, Lavie, Nilli
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Royal Society 2020
Materias:	Psychology and Cognitive Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7277246/ https://www.ncbi.nlm.nih.gov/pubmed/32537189 http://dx.doi.org/10.1098/rsos.191487

_version_	1783543079659110400
author	Nagle, Fintan Lavie, Nilli
author_facet	Nagle, Fintan Lavie, Nilli
author_sort	Nagle, Fintan
collection	PubMed
description	Perceptual load is a well-established determinant of attentional engagement in a task. So far, perceptual load has typically been manipulated by increasing either the number of task-relevant items or the perceptual processing demand (e.g. conjunction versus feature tasks). The tasks used often involved rather simple visual displays (e.g. letters or single objects). How can perceptual load be operationalized for richer, real-world images? A promising proxy is the visual complexity of an image. However, current predictive models for visual complexity have limited applicability to diverse real-world images. Here we modelled visual complexity using a deep convolutional neural network (CNN) trained to learn perceived ratings of visual complexity. We presented 53 observers with 4000 images from the PASCAL VOC dataset, obtaining 75 020 2-alternative forced choice paired comparisons across observers. Image visual complexity scores were obtained using the TrueSkill algorithm. A CNN with weights pre-trained on an object recognition task predicted complexity ratings with r = 0.83. By contrast, feature-based models used in the literature, working on image statistics such as entropy, edge density and JPEG compression ratio, only achieved r = 0.70. Thus, our model offers a promising method to quantify the perceptual load of real-world scenes through visual complexity.
format	Online Article Text
id	pubmed-7277246
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	The Royal Society
record_format	MEDLINE/PubMed
spelling	pubmed-72772462020-06-11 Predicting human complexity perception of real-world scenes Nagle, Fintan Lavie, Nilli R Soc Open Sci Psychology and Cognitive Neuroscience Perceptual load is a well-established determinant of attentional engagement in a task. So far, perceptual load has typically been manipulated by increasing either the number of task-relevant items or the perceptual processing demand (e.g. conjunction versus feature tasks). The tasks used often involved rather simple visual displays (e.g. letters or single objects). How can perceptual load be operationalized for richer, real-world images? A promising proxy is the visual complexity of an image. However, current predictive models for visual complexity have limited applicability to diverse real-world images. Here we modelled visual complexity using a deep convolutional neural network (CNN) trained to learn perceived ratings of visual complexity. We presented 53 observers with 4000 images from the PASCAL VOC dataset, obtaining 75 020 2-alternative forced choice paired comparisons across observers. Image visual complexity scores were obtained using the TrueSkill algorithm. A CNN with weights pre-trained on an object recognition task predicted complexity ratings with r = 0.83. By contrast, feature-based models used in the literature, working on image statistics such as entropy, edge density and JPEG compression ratio, only achieved r = 0.70. Thus, our model offers a promising method to quantify the perceptual load of real-world scenes through visual complexity. The Royal Society 2020-05-13 /pmc/articles/PMC7277246/ /pubmed/32537189 http://dx.doi.org/10.1098/rsos.191487 Text en © 2020 The Authors. http://creativecommons.org/licenses/by/4.0/ http://creativecommons.org/licenses/by/4.0/http://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle	Psychology and Cognitive Neuroscience Nagle, Fintan Lavie, Nilli Predicting human complexity perception of real-world scenes
title	Predicting human complexity perception of real-world scenes
title_full	Predicting human complexity perception of real-world scenes
title_fullStr	Predicting human complexity perception of real-world scenes
title_full_unstemmed	Predicting human complexity perception of real-world scenes
title_short	Predicting human complexity perception of real-world scenes
title_sort	predicting human complexity perception of real-world scenes
topic	Psychology and Cognitive Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7277246/ https://www.ncbi.nlm.nih.gov/pubmed/32537189 http://dx.doi.org/10.1098/rsos.191487
work_keys_str_mv	AT naglefintan predictinghumancomplexityperceptionofrealworldscenes AT lavienilli predictinghumancomplexityperceptionofrealworldscenes

Predicting human complexity perception of real-world scenes

Ejemplares similares