Cargando…
Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification
Achieving human-like visual abilities is a holy grail for machine vision, yet precisely how insights from human vision can improve machines has remained unclear. Here, we demonstrate two key conceptual advances: First, we show that most machine vision models are systematically different from human o...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611439/ https://www.ncbi.nlm.nih.gov/pubmed/32750809 http://dx.doi.org/10.1109/TPAMI.2020.3008107 |
_version_ | 1783605276979494912 |
---|---|
author | Pramod, RT Arun, SP |
author_facet | Pramod, RT Arun, SP |
author_sort | Pramod, RT |
collection | PubMed |
description | Achieving human-like visual abilities is a holy grail for machine vision, yet precisely how insights from human vision can improve machines has remained unclear. Here, we demonstrate two key conceptual advances: First, we show that most machine vision models are systematically different from human object perception. To do so, we collected a large dataset of perceptual distances between isolated objects in humans and asked whether these perceptual data can be predicted by many common machine vision algorithms. We found that while the best algorithms explain ∼70% of the variance in the perceptual data, all the algorithms we tested make systematic errors on several types of objects. In particular, machine algorithms underestimated distances between symmetric objects compared to human perception. Second, we show that fixing these systematic biases can lead to substantial gains in classification performance. In particular, augmenting a state-of-the-art convolutional neural network with planar/reflection symmetry scores along multiple axes produced significant improvements in classification accuracy (1-10%) across categories. These results show that machine vision can be improved by discovering and fixing systematic differences from human vision. |
format | Online Article Text |
id | pubmed-7611439 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-76114392021-07-31 Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification Pramod, RT Arun, SP IEEE Trans Pattern Anal Mach Intell Article Achieving human-like visual abilities is a holy grail for machine vision, yet precisely how insights from human vision can improve machines has remained unclear. Here, we demonstrate two key conceptual advances: First, we show that most machine vision models are systematically different from human object perception. To do so, we collected a large dataset of perceptual distances between isolated objects in humans and asked whether these perceptual data can be predicted by many common machine vision algorithms. We found that while the best algorithms explain ∼70% of the variance in the perceptual data, all the algorithms we tested make systematic errors on several types of objects. In particular, machine algorithms underestimated distances between symmetric objects compared to human perception. Second, we show that fixing these systematic biases can lead to substantial gains in classification performance. In particular, augmenting a state-of-the-art convolutional neural network with planar/reflection symmetry scores along multiple axes produced significant improvements in classification accuracy (1-10%) across categories. These results show that machine vision can be improved by discovering and fixing systematic differences from human vision. 2020-07-09 /pmc/articles/PMC7611439/ /pubmed/32750809 http://dx.doi.org/10.1109/TPAMI.2020.3008107 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Pramod, RT Arun, SP Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification |
title | Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification |
title_full | Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification |
title_fullStr | Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification |
title_full_unstemmed | Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification |
title_short | Improving Machine Vision Using Human Perceptual Representations: The Case of Planar Reflection Symmetry for Object Classification |
title_sort | improving machine vision using human perceptual representations: the case of planar reflection symmetry for object classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611439/ https://www.ncbi.nlm.nih.gov/pubmed/32750809 http://dx.doi.org/10.1109/TPAMI.2020.3008107 |
work_keys_str_mv | AT pramodrt improvingmachinevisionusinghumanperceptualrepresentationsthecaseofplanarreflectionsymmetryforobjectclassification AT arunsp improvingmachinevisionusinghumanperceptualrepresentationsthecaseofplanarreflectionsymmetryforobjectclassification |