Cargando…

Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN

Recently, both single modality and cross modality near-duplicate image detection tasks have received wide attention in the community of pattern recognition and computer vision. Existing deep neural networks-based methods have achieved remarkable performance in this task. However, most of the methods...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Yi, Zhang, Shizhou, Li, Ying, Zhang, Yanning
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7794762/ https://www.ncbi.nlm.nih.gov/pubmed/33401740 http://dx.doi.org/10.3390/s21010255

_version_	1783634285300809728
author	Zhang, Yi Zhang, Shizhou Li, Ying Zhang, Yanning
author_facet	Zhang, Yi Zhang, Shizhou Li, Ying Zhang, Yanning
author_sort	Zhang, Yi
collection	PubMed
description	Recently, both single modality and cross modality near-duplicate image detection tasks have received wide attention in the community of pattern recognition and computer vision. Existing deep neural networks-based methods have achieved remarkable performance in this task. However, most of the methods mainly focus on the learning of each image from the image pair, thus leading to less use of the information between the near duplicate image pairs to some extent. In this paper, to make more use of the correlations between image pairs, we propose a spatial transformer comparing convolutional neural network (CNN) model to compare near-duplicate image pairs. Specifically, we firstly propose a comparing CNN framework, which is equipped with a cross-stream to fully learn the correlation information between image pairs, while considering the features of each image. Furthermore, to deal with the local deformations led by cropping, translation, scaling, and non-rigid transformations, we additionally introduce a spatial transformer comparing CNN model by incorporating a spatial transformer module to the comparing CNN architecture. To demonstrate the effectiveness of the proposed method on both the single-modality and cross-modality (Optical-InfraRed) near-duplicate image pair detection tasks, we conduct extensive experiments on three popular benchmark datasets, namely CaliforniaND (ND means near duplicate), Mir-Flickr Near Duplicate, and TNO Multi-band Image Data Collection. The experimental results show that the proposed method can achieve superior performance compared with many state-of-the-art methods on both tasks.
format	Online Article Text
id	pubmed-7794762
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-77947622021-01-10 Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN Zhang, Yi Zhang, Shizhou Li, Ying Zhang, Yanning Sensors (Basel) Article Recently, both single modality and cross modality near-duplicate image detection tasks have received wide attention in the community of pattern recognition and computer vision. Existing deep neural networks-based methods have achieved remarkable performance in this task. However, most of the methods mainly focus on the learning of each image from the image pair, thus leading to less use of the information between the near duplicate image pairs to some extent. In this paper, to make more use of the correlations between image pairs, we propose a spatial transformer comparing convolutional neural network (CNN) model to compare near-duplicate image pairs. Specifically, we firstly propose a comparing CNN framework, which is equipped with a cross-stream to fully learn the correlation information between image pairs, while considering the features of each image. Furthermore, to deal with the local deformations led by cropping, translation, scaling, and non-rigid transformations, we additionally introduce a spatial transformer comparing CNN model by incorporating a spatial transformer module to the comparing CNN architecture. To demonstrate the effectiveness of the proposed method on both the single-modality and cross-modality (Optical-InfraRed) near-duplicate image pair detection tasks, we conduct extensive experiments on three popular benchmark datasets, namely CaliforniaND (ND means near duplicate), Mir-Flickr Near Duplicate, and TNO Multi-band Image Data Collection. The experimental results show that the proposed method can achieve superior performance compared with many state-of-the-art methods on both tasks. MDPI 2021-01-02 /pmc/articles/PMC7794762/ /pubmed/33401740 http://dx.doi.org/10.3390/s21010255 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Yi Zhang, Shizhou Li, Ying Zhang, Yanning Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN
title	Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN
title_full	Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN
title_fullStr	Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN
title_full_unstemmed	Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN
title_short	Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN
title_sort	single- and cross-modality near duplicate image pairs detection via spatial transformer comparing cnn
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7794762/ https://www.ncbi.nlm.nih.gov/pubmed/33401740 http://dx.doi.org/10.3390/s21010255
work_keys_str_mv	AT zhangyi singleandcrossmodalitynearduplicateimagepairsdetectionviaspatialtransformercomparingcnn AT zhangshizhou singleandcrossmodalitynearduplicateimagepairsdetectionviaspatialtransformercomparingcnn AT liying singleandcrossmodalitynearduplicateimagepairsdetectionviaspatialtransformercomparingcnn AT zhangyanning singleandcrossmodalitynearduplicateimagepairsdetectionviaspatialtransformercomparingcnn

Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN

Ejemplares similares