Cargando…

An ecologically motivated image dataset for deep learning yields better models of human vision

Deep neural networks provide the current best models of visual information processing in the primate brain. Drawing on work from computer vision, the most commonly used networks are pretrained on data from the ImageNet Large Scale Visual Recognition Challenge. This dataset comprises images from 1,00...

Descripción completa

Detalles Bibliográficos
Autores principales: Mehrer, Johannes, Spoerer, Courtney J., Jones, Emer C., Kriegeskorte, Nikolaus, Kietzmann, Tim C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7923360/
https://www.ncbi.nlm.nih.gov/pubmed/33593900
http://dx.doi.org/10.1073/pnas.2011417118
_version_ 1783658891537547264
author Mehrer, Johannes
Spoerer, Courtney J.
Jones, Emer C.
Kriegeskorte, Nikolaus
Kietzmann, Tim C.
author_facet Mehrer, Johannes
Spoerer, Courtney J.
Jones, Emer C.
Kriegeskorte, Nikolaus
Kietzmann, Tim C.
author_sort Mehrer, Johannes
collection PubMed
description Deep neural networks provide the current best models of visual information processing in the primate brain. Drawing on work from computer vision, the most commonly used networks are pretrained on data from the ImageNet Large Scale Visual Recognition Challenge. This dataset comprises images from 1,000 categories, selected to provide a challenging testbed for automated visual object recognition systems. Moving beyond this common practice, we here introduce ecoset, a collection of >1.5 million images from 565 basic-level categories selected to better capture the distribution of objects relevant to humans. Ecoset categories were chosen to be both frequent in linguistic usage and concrete, thereby mirroring important physical objects in the world. We test the effects of training on this ecologically more valid dataset using multiple instances of two neural network architectures: AlexNet and vNet, a novel architecture designed to mimic the progressive increase in receptive field sizes along the human ventral stream. We show that training on ecoset leads to significant improvements in predicting representations in human higher-level visual cortex and perceptual judgments, surpassing the previous state of the art. Significant and highly consistent benefits are demonstrated for both architectures on two separate functional magnetic resonance imaging (fMRI) datasets and behavioral data, jointly covering responses to 1,292 visual stimuli from a wide variety of object categories. These results suggest that computational visual neuroscience may take better advantage of the deep learning framework by using image sets that reflect the human perceptual and cognitive experience. Ecoset and trained network models are openly available to the research community.
format Online
Article
Text
id pubmed-7923360
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-79233602021-03-10 An ecologically motivated image dataset for deep learning yields better models of human vision Mehrer, Johannes Spoerer, Courtney J. Jones, Emer C. Kriegeskorte, Nikolaus Kietzmann, Tim C. Proc Natl Acad Sci U S A Biological Sciences Deep neural networks provide the current best models of visual information processing in the primate brain. Drawing on work from computer vision, the most commonly used networks are pretrained on data from the ImageNet Large Scale Visual Recognition Challenge. This dataset comprises images from 1,000 categories, selected to provide a challenging testbed for automated visual object recognition systems. Moving beyond this common practice, we here introduce ecoset, a collection of >1.5 million images from 565 basic-level categories selected to better capture the distribution of objects relevant to humans. Ecoset categories were chosen to be both frequent in linguistic usage and concrete, thereby mirroring important physical objects in the world. We test the effects of training on this ecologically more valid dataset using multiple instances of two neural network architectures: AlexNet and vNet, a novel architecture designed to mimic the progressive increase in receptive field sizes along the human ventral stream. We show that training on ecoset leads to significant improvements in predicting representations in human higher-level visual cortex and perceptual judgments, surpassing the previous state of the art. Significant and highly consistent benefits are demonstrated for both architectures on two separate functional magnetic resonance imaging (fMRI) datasets and behavioral data, jointly covering responses to 1,292 visual stimuli from a wide variety of object categories. These results suggest that computational visual neuroscience may take better advantage of the deep learning framework by using image sets that reflect the human perceptual and cognitive experience. Ecoset and trained network models are openly available to the research community. National Academy of Sciences 2021-02-23 2021-02-15 /pmc/articles/PMC7923360/ /pubmed/33593900 http://dx.doi.org/10.1073/pnas.2011417118 Text en Copyright © 2021 the Author(s). Published by PNAS. http://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (http://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Mehrer, Johannes
Spoerer, Courtney J.
Jones, Emer C.
Kriegeskorte, Nikolaus
Kietzmann, Tim C.
An ecologically motivated image dataset for deep learning yields better models of human vision
title An ecologically motivated image dataset for deep learning yields better models of human vision
title_full An ecologically motivated image dataset for deep learning yields better models of human vision
title_fullStr An ecologically motivated image dataset for deep learning yields better models of human vision
title_full_unstemmed An ecologically motivated image dataset for deep learning yields better models of human vision
title_short An ecologically motivated image dataset for deep learning yields better models of human vision
title_sort ecologically motivated image dataset for deep learning yields better models of human vision
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7923360/
https://www.ncbi.nlm.nih.gov/pubmed/33593900
http://dx.doi.org/10.1073/pnas.2011417118
work_keys_str_mv AT mehrerjohannes anecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT spoerercourtneyj anecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT jonesemerc anecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT kriegeskortenikolaus anecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT kietzmanntimc anecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT mehrerjohannes ecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT spoerercourtneyj ecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT jonesemerc ecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT kriegeskortenikolaus ecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision
AT kietzmanntimc ecologicallymotivatedimagedatasetfordeeplearningyieldsbettermodelsofhumanvision