Cargando…

Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation

Geometric deep learning (GDL) generalizes convolutional neural networks (CNNs) to non-Euclidean domains. In this work, a GDL technique, allowing the application of CNN on graphs, is examined. It defines convolutional filters with the use of the Gaussian mixture model (GMM). As those filters are defi...

Descripción completa

Detalles Bibliográficos
Autores principales: Tomczyk, Arkadiusz, Szczepaniak, Piotr S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6961017/
https://www.ncbi.nlm.nih.gov/pubmed/31847162
http://dx.doi.org/10.3390/s19245510
_version_ 1783487903087722496
author Tomczyk, Arkadiusz
Szczepaniak, Piotr S.
author_facet Tomczyk, Arkadiusz
Szczepaniak, Piotr S.
author_sort Tomczyk, Arkadiusz
collection PubMed
description Geometric deep learning (GDL) generalizes convolutional neural networks (CNNs) to non-Euclidean domains. In this work, a GDL technique, allowing the application of CNN on graphs, is examined. It defines convolutional filters with the use of the Gaussian mixture model (GMM). As those filters are defined in continuous space, they can be easily rotated without the need for some additional interpolation. This, in turn, allows constructing systems having rotation equivariance property. The characteristic of the proposed approach is illustrated with the problem of ear detection, which is of great importance in biometric systems enabling image based, discrete human identification. The analyzed graphs were constructed taking into account superpixels representing image content. This kind of representation has several advantages. On the one hand, it significantly reduces the amount of processed data, allowing building simpler and more effective models. On the other hand, it seems to be closer to the conscious process of human image understanding as it does not operate on millions of pixels. The contributions of the paper lie both in GDL application area extension (semantic segmentation of the images) and in the novel concept of trained filter transformations. We show that even significantly reduced information about image content and a relatively simple, in comparison with classic CNN, model (smaller number of parameters and significantly faster processing) allows obtaining detection results on the quality level similar to those reported in the literature on the UBEAR dataset. Moreover, we show experimentally that the proposed approach possesses in fact the rotation equivariance property allowing detecting rotated structures without the need for labor consuming training on all rotated and non-rotated images.
format Online
Article
Text
id pubmed-6961017
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-69610172020-01-24 Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation Tomczyk, Arkadiusz Szczepaniak, Piotr S. Sensors (Basel) Article Geometric deep learning (GDL) generalizes convolutional neural networks (CNNs) to non-Euclidean domains. In this work, a GDL technique, allowing the application of CNN on graphs, is examined. It defines convolutional filters with the use of the Gaussian mixture model (GMM). As those filters are defined in continuous space, they can be easily rotated without the need for some additional interpolation. This, in turn, allows constructing systems having rotation equivariance property. The characteristic of the proposed approach is illustrated with the problem of ear detection, which is of great importance in biometric systems enabling image based, discrete human identification. The analyzed graphs were constructed taking into account superpixels representing image content. This kind of representation has several advantages. On the one hand, it significantly reduces the amount of processed data, allowing building simpler and more effective models. On the other hand, it seems to be closer to the conscious process of human image understanding as it does not operate on millions of pixels. The contributions of the paper lie both in GDL application area extension (semantic segmentation of the images) and in the novel concept of trained filter transformations. We show that even significantly reduced information about image content and a relatively simple, in comparison with classic CNN, model (smaller number of parameters and significantly faster processing) allows obtaining detection results on the quality level similar to those reported in the literature on the UBEAR dataset. Moreover, we show experimentally that the proposed approach possesses in fact the rotation equivariance property allowing detecting rotated structures without the need for labor consuming training on all rotated and non-rotated images. MDPI 2019-12-13 /pmc/articles/PMC6961017/ /pubmed/31847162 http://dx.doi.org/10.3390/s19245510 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tomczyk, Arkadiusz
Szczepaniak, Piotr S.
Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation
title Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation
title_full Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation
title_fullStr Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation
title_full_unstemmed Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation
title_short Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation
title_sort ear detection using convolutional neural network on graphs with filter rotation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6961017/
https://www.ncbi.nlm.nih.gov/pubmed/31847162
http://dx.doi.org/10.3390/s19245510
work_keys_str_mv AT tomczykarkadiusz eardetectionusingconvolutionalneuralnetworkongraphswithfilterrotation
AT szczepaniakpiotrs eardetectionusingconvolutionalneuralnetworkongraphswithfilterrotation