Cargando…

Understanding Image Representations by Measuring Their Equivariance and Equivalence

Despite the importance of image representations such as histograms of oriented gradients and deep Convolutional Neural Networks (CNN), our theoretical understanding of them remains limited. Aimed at filling this gap, we investigate two key mathematical properties of representations: equivariance and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lenc, Karel, Vedaldi, Andrea
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6510825/ https://www.ncbi.nlm.nih.gov/pubmed/31148885 http://dx.doi.org/10.1007/s11263-018-1098-y

_version_	1783417475976658944
author	Lenc, Karel Vedaldi, Andrea
author_facet	Lenc, Karel Vedaldi, Andrea
author_sort	Lenc, Karel
collection	PubMed
description	Despite the importance of image representations such as histograms of oriented gradients and deep Convolutional Neural Networks (CNN), our theoretical understanding of them remains limited. Aimed at filling this gap, we investigate two key mathematical properties of representations: equivariance and equivalence. Equivariance studies how transformations of the input image are encoded by the representation, invariance being a special case where a transformation has no effect. Equivalence studies whether two representations, for example two different parameterizations of a CNN, two different layers, or two different CNN architectures, share the same visual information or not. A number of methods to establish these properties empirically are proposed, including introducing transformation and stitching layers in CNNs. These methods are then applied to popular representations to reveal insightful aspects of their structure, including clarifying at which layers in a CNN certain geometric invariances are achieved and how various CNN architectures differ. We identify several predictors of geometric and architectural compatibility, including the spatial resolution of the representation and the complexity and depth of the models. While the focus of the paper is theoretical, direct applications to structured-output regression are demonstrated too.
format	Online Article Text
id	pubmed-6510825
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-65108252019-05-28 Understanding Image Representations by Measuring Their Equivariance and Equivalence Lenc, Karel Vedaldi, Andrea Int J Comput Vis Article Despite the importance of image representations such as histograms of oriented gradients and deep Convolutional Neural Networks (CNN), our theoretical understanding of them remains limited. Aimed at filling this gap, we investigate two key mathematical properties of representations: equivariance and equivalence. Equivariance studies how transformations of the input image are encoded by the representation, invariance being a special case where a transformation has no effect. Equivalence studies whether two representations, for example two different parameterizations of a CNN, two different layers, or two different CNN architectures, share the same visual information or not. A number of methods to establish these properties empirically are proposed, including introducing transformation and stitching layers in CNNs. These methods are then applied to popular representations to reveal insightful aspects of their structure, including clarifying at which layers in a CNN certain geometric invariances are achieved and how various CNN architectures differ. We identify several predictors of geometric and architectural compatibility, including the spatial resolution of the representation and the complexity and depth of the models. While the focus of the paper is theoretical, direct applications to structured-output regression are demonstrated too. Springer US 2018-05-18 2019 /pmc/articles/PMC6510825/ /pubmed/31148885 http://dx.doi.org/10.1007/s11263-018-1098-y Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle	Article Lenc, Karel Vedaldi, Andrea Understanding Image Representations by Measuring Their Equivariance and Equivalence
title	Understanding Image Representations by Measuring Their Equivariance and Equivalence
title_full	Understanding Image Representations by Measuring Their Equivariance and Equivalence
title_fullStr	Understanding Image Representations by Measuring Their Equivariance and Equivalence
title_full_unstemmed	Understanding Image Representations by Measuring Their Equivariance and Equivalence
title_short	Understanding Image Representations by Measuring Their Equivariance and Equivalence
title_sort	understanding image representations by measuring their equivariance and equivalence
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6510825/ https://www.ncbi.nlm.nih.gov/pubmed/31148885 http://dx.doi.org/10.1007/s11263-018-1098-y
work_keys_str_mv	AT lenckarel understandingimagerepresentationsbymeasuringtheirequivarianceandequivalence AT vedaldiandrea understandingimagerepresentationsbymeasuringtheirequivarianceandequivalence

Understanding Image Representations by Measuring Their Equivariance and Equivalence

Ejemplares similares