Cargando…

Machine vision benefits from human contextual expectations

Scene context is known to facilitate object recognition in both machines and humans, suggesting that the underlying representations may be similar. Alternatively, they may be qualitatively different since the training experience of machines and humans are strikingly different. Machines are typically...

Descripción completa

Detalles Bibliográficos
Autores principales: Katti, Harish, Peelen, Marius V., Arun, S. P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6375915/
https://www.ncbi.nlm.nih.gov/pubmed/30765753
http://dx.doi.org/10.1038/s41598-018-38427-0
_version_ 1783395445349810176
author Katti, Harish
Peelen, Marius V.
Arun, S. P.
author_facet Katti, Harish
Peelen, Marius V.
Arun, S. P.
author_sort Katti, Harish
collection PubMed
description Scene context is known to facilitate object recognition in both machines and humans, suggesting that the underlying representations may be similar. Alternatively, they may be qualitatively different since the training experience of machines and humans are strikingly different. Machines are typically trained on images containing objects and their context, whereas humans frequently experience scenes without objects (such as highways without cars). If these context representations are indeed different, machine vision algorithms will be improved on augmenting them with human context representations, provided these expectations can be measured and are systematic. Here, we developed a paradigm to measure human contextual expectations. We asked human subjects to indicate the scale, location and likelihood at which cars or people might occur in scenes without these objects. This yielded highly systematic expectations that we could then accurately predict using scene features. This allowed us to predict human expectations on novel scenes without requiring explicit measurements. Next we augmented decisions made by deep neural networks with these predicted human expectations and obtained substantial gains in accuracy for detecting cars and people (1–3%) as well as on detecting associated objects (3–20%). In contrast, augmenting deep network decisions with other conventional computer vision features yielded far smaller gains. Taken together, our results show that augmenting deep neural networks with human-derived contextual expectations improves their performance, suggesting that contextual representations are qualitatively different in humans and deep neural networks.
format Online
Article
Text
id pubmed-6375915
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-63759152019-02-19 Machine vision benefits from human contextual expectations Katti, Harish Peelen, Marius V. Arun, S. P. Sci Rep Article Scene context is known to facilitate object recognition in both machines and humans, suggesting that the underlying representations may be similar. Alternatively, they may be qualitatively different since the training experience of machines and humans are strikingly different. Machines are typically trained on images containing objects and their context, whereas humans frequently experience scenes without objects (such as highways without cars). If these context representations are indeed different, machine vision algorithms will be improved on augmenting them with human context representations, provided these expectations can be measured and are systematic. Here, we developed a paradigm to measure human contextual expectations. We asked human subjects to indicate the scale, location and likelihood at which cars or people might occur in scenes without these objects. This yielded highly systematic expectations that we could then accurately predict using scene features. This allowed us to predict human expectations on novel scenes without requiring explicit measurements. Next we augmented decisions made by deep neural networks with these predicted human expectations and obtained substantial gains in accuracy for detecting cars and people (1–3%) as well as on detecting associated objects (3–20%). In contrast, augmenting deep network decisions with other conventional computer vision features yielded far smaller gains. Taken together, our results show that augmenting deep neural networks with human-derived contextual expectations improves their performance, suggesting that contextual representations are qualitatively different in humans and deep neural networks. Nature Publishing Group UK 2019-02-14 /pmc/articles/PMC6375915/ /pubmed/30765753 http://dx.doi.org/10.1038/s41598-018-38427-0 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Katti, Harish
Peelen, Marius V.
Arun, S. P.
Machine vision benefits from human contextual expectations
title Machine vision benefits from human contextual expectations
title_full Machine vision benefits from human contextual expectations
title_fullStr Machine vision benefits from human contextual expectations
title_full_unstemmed Machine vision benefits from human contextual expectations
title_short Machine vision benefits from human contextual expectations
title_sort machine vision benefits from human contextual expectations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6375915/
https://www.ncbi.nlm.nih.gov/pubmed/30765753
http://dx.doi.org/10.1038/s41598-018-38427-0
work_keys_str_mv AT kattiharish machinevisionbenefitsfromhumancontextualexpectations
AT peelenmariusv machinevisionbenefitsfromhumancontextualexpectations
AT arunsp machinevisionbenefitsfromhumancontextualexpectations