Cargando…
Machine vision benefits from human contextual expectations
Scene context is known to facilitate object recognition in both machines and humans, suggesting that the underlying representations may be similar. Alternatively, they may be qualitatively different since the training experience of machines and humans are strikingly different. Machines are typically...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6375915/ https://www.ncbi.nlm.nih.gov/pubmed/30765753 http://dx.doi.org/10.1038/s41598-018-38427-0 |
_version_ | 1783395445349810176 |
---|---|
author | Katti, Harish Peelen, Marius V. Arun, S. P. |
author_facet | Katti, Harish Peelen, Marius V. Arun, S. P. |
author_sort | Katti, Harish |
collection | PubMed |
description | Scene context is known to facilitate object recognition in both machines and humans, suggesting that the underlying representations may be similar. Alternatively, they may be qualitatively different since the training experience of machines and humans are strikingly different. Machines are typically trained on images containing objects and their context, whereas humans frequently experience scenes without objects (such as highways without cars). If these context representations are indeed different, machine vision algorithms will be improved on augmenting them with human context representations, provided these expectations can be measured and are systematic. Here, we developed a paradigm to measure human contextual expectations. We asked human subjects to indicate the scale, location and likelihood at which cars or people might occur in scenes without these objects. This yielded highly systematic expectations that we could then accurately predict using scene features. This allowed us to predict human expectations on novel scenes without requiring explicit measurements. Next we augmented decisions made by deep neural networks with these predicted human expectations and obtained substantial gains in accuracy for detecting cars and people (1–3%) as well as on detecting associated objects (3–20%). In contrast, augmenting deep network decisions with other conventional computer vision features yielded far smaller gains. Taken together, our results show that augmenting deep neural networks with human-derived contextual expectations improves their performance, suggesting that contextual representations are qualitatively different in humans and deep neural networks. |
format | Online Article Text |
id | pubmed-6375915 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-63759152019-02-19 Machine vision benefits from human contextual expectations Katti, Harish Peelen, Marius V. Arun, S. P. Sci Rep Article Scene context is known to facilitate object recognition in both machines and humans, suggesting that the underlying representations may be similar. Alternatively, they may be qualitatively different since the training experience of machines and humans are strikingly different. Machines are typically trained on images containing objects and their context, whereas humans frequently experience scenes without objects (such as highways without cars). If these context representations are indeed different, machine vision algorithms will be improved on augmenting them with human context representations, provided these expectations can be measured and are systematic. Here, we developed a paradigm to measure human contextual expectations. We asked human subjects to indicate the scale, location and likelihood at which cars or people might occur in scenes without these objects. This yielded highly systematic expectations that we could then accurately predict using scene features. This allowed us to predict human expectations on novel scenes without requiring explicit measurements. Next we augmented decisions made by deep neural networks with these predicted human expectations and obtained substantial gains in accuracy for detecting cars and people (1–3%) as well as on detecting associated objects (3–20%). In contrast, augmenting deep network decisions with other conventional computer vision features yielded far smaller gains. Taken together, our results show that augmenting deep neural networks with human-derived contextual expectations improves their performance, suggesting that contextual representations are qualitatively different in humans and deep neural networks. Nature Publishing Group UK 2019-02-14 /pmc/articles/PMC6375915/ /pubmed/30765753 http://dx.doi.org/10.1038/s41598-018-38427-0 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Katti, Harish Peelen, Marius V. Arun, S. P. Machine vision benefits from human contextual expectations |
title | Machine vision benefits from human contextual expectations |
title_full | Machine vision benefits from human contextual expectations |
title_fullStr | Machine vision benefits from human contextual expectations |
title_full_unstemmed | Machine vision benefits from human contextual expectations |
title_short | Machine vision benefits from human contextual expectations |
title_sort | machine vision benefits from human contextual expectations |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6375915/ https://www.ncbi.nlm.nih.gov/pubmed/30765753 http://dx.doi.org/10.1038/s41598-018-38427-0 |
work_keys_str_mv | AT kattiharish machinevisionbenefitsfromhumancontextualexpectations AT peelenmariusv machinevisionbenefitsfromhumancontextualexpectations AT arunsp machinevisionbenefitsfromhumancontextualexpectations |