Cargando…

Visual Features and Their Own Optical Flow

Symmetries, invariances and conservation equations have always been an invaluable guide in Science to model natural phenomena through simple yet effective relations. For instance, in computer vision, translation equivariance is typically a built-in property of neural architectures that are used to s...

Descripción completa

Detalles Bibliográficos
Autores principales: Betti, Alessandro, Boccignone, Giuseppe, Faggi, Lapo, Gori, Marco, Melacci, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672218/
https://www.ncbi.nlm.nih.gov/pubmed/34927064
http://dx.doi.org/10.3389/frai.2021.768516
_version_ 1784615314526306304
author Betti, Alessandro
Boccignone, Giuseppe
Faggi, Lapo
Gori, Marco
Melacci, Stefano
author_facet Betti, Alessandro
Boccignone, Giuseppe
Faggi, Lapo
Gori, Marco
Melacci, Stefano
author_sort Betti, Alessandro
collection PubMed
description Symmetries, invariances and conservation equations have always been an invaluable guide in Science to model natural phenomena through simple yet effective relations. For instance, in computer vision, translation equivariance is typically a built-in property of neural architectures that are used to solve visual tasks; networks with computational layers implementing such a property are known as Convolutional Neural Networks (CNNs). This kind of mathematical symmetry, as well as many others that have been recently studied, are typically generated by some underlying group of transformations (translations in the case of CNNs, rotations, etc.) and are particularly suitable to process highly structured data such as molecules or chemical compounds which are known to possess those specific symmetries. When dealing with video streams, common built-in equivariances are able to handle only a small fraction of the broad spectrum of transformations encoded in the visual stimulus and, therefore, the corresponding neural architectures have to resort to a huge amount of supervision in order to achieve good generalization capabilities. In the paper we formulate a theory on the development of visual features that is based on the idea that movement itself provides trajectories on which to impose consistency. We introduce the principle of Material Point Invariance which states that each visual feature is invariant with respect to the associated optical flow, so that features and corresponding velocities are an indissoluble pair. Then, we discuss the interaction of features and velocities and show that certain motion invariance traits could be regarded as a generalization of the classical concept of affordance. These analyses of feature-velocity interactions and their invariance properties leads to a visual field theory which expresses the dynamical constraints of motion coherence and might lead to discover the joint evolution of the visual features along with the associated optical flows.
format Online
Article
Text
id pubmed-8672218
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-86722182021-12-16 Visual Features and Their Own Optical Flow Betti, Alessandro Boccignone, Giuseppe Faggi, Lapo Gori, Marco Melacci, Stefano Front Artif Intell Artificial Intelligence Symmetries, invariances and conservation equations have always been an invaluable guide in Science to model natural phenomena through simple yet effective relations. For instance, in computer vision, translation equivariance is typically a built-in property of neural architectures that are used to solve visual tasks; networks with computational layers implementing such a property are known as Convolutional Neural Networks (CNNs). This kind of mathematical symmetry, as well as many others that have been recently studied, are typically generated by some underlying group of transformations (translations in the case of CNNs, rotations, etc.) and are particularly suitable to process highly structured data such as molecules or chemical compounds which are known to possess those specific symmetries. When dealing with video streams, common built-in equivariances are able to handle only a small fraction of the broad spectrum of transformations encoded in the visual stimulus and, therefore, the corresponding neural architectures have to resort to a huge amount of supervision in order to achieve good generalization capabilities. In the paper we formulate a theory on the development of visual features that is based on the idea that movement itself provides trajectories on which to impose consistency. We introduce the principle of Material Point Invariance which states that each visual feature is invariant with respect to the associated optical flow, so that features and corresponding velocities are an indissoluble pair. Then, we discuss the interaction of features and velocities and show that certain motion invariance traits could be regarded as a generalization of the classical concept of affordance. These analyses of feature-velocity interactions and their invariance properties leads to a visual field theory which expresses the dynamical constraints of motion coherence and might lead to discover the joint evolution of the visual features along with the associated optical flows. Frontiers Media S.A. 2021-12-01 /pmc/articles/PMC8672218/ /pubmed/34927064 http://dx.doi.org/10.3389/frai.2021.768516 Text en Copyright © 2021 Betti, Boccignone, Faggi, Gori and Melacci. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Betti, Alessandro
Boccignone, Giuseppe
Faggi, Lapo
Gori, Marco
Melacci, Stefano
Visual Features and Their Own Optical Flow
title Visual Features and Their Own Optical Flow
title_full Visual Features and Their Own Optical Flow
title_fullStr Visual Features and Their Own Optical Flow
title_full_unstemmed Visual Features and Their Own Optical Flow
title_short Visual Features and Their Own Optical Flow
title_sort visual features and their own optical flow
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672218/
https://www.ncbi.nlm.nih.gov/pubmed/34927064
http://dx.doi.org/10.3389/frai.2021.768516
work_keys_str_mv AT bettialessandro visualfeaturesandtheirownopticalflow
AT boccignonegiuseppe visualfeaturesandtheirownopticalflow
AT faggilapo visualfeaturesandtheirownopticalflow
AT gorimarco visualfeaturesandtheirownopticalflow
AT melaccistefano visualfeaturesandtheirownopticalflow