Cargando…

On the Use of Normalized Compression Distances for Image Similarity Detection

This paper investigates the usefulness of the normalized compression distance (NCD) for image similarity detection. Instead of the direct NCD between images, the paper considers the correlation between NCD based feature vectors extracted for each image. The vectors are derived by computing the NCD b...

Descripción completa

Detalles Bibliográficos
Autores principales: Coltuc, Dinu, Datcu, Mihai, Coltuc, Daniela
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512663/
https://www.ncbi.nlm.nih.gov/pubmed/33265190
http://dx.doi.org/10.3390/e20020099
_version_ 1783586210012200960
author Coltuc, Dinu
Datcu, Mihai
Coltuc, Daniela
author_facet Coltuc, Dinu
Datcu, Mihai
Coltuc, Daniela
author_sort Coltuc, Dinu
collection PubMed
description This paper investigates the usefulness of the normalized compression distance (NCD) for image similarity detection. Instead of the direct NCD between images, the paper considers the correlation between NCD based feature vectors extracted for each image. The vectors are derived by computing the NCD between the original image and sequences of translated (rotated) versions. Feature vectors for simple transforms (circular translations on horizontal, vertical, diagonal directions and rotations around image center) and several standard compressors are generated and tested in a very simple experiment of similarity detection between the original image and two filtered versions (median and moving average). The promising vector configurations (geometric transform, lossless compressor) are further tested for similarity detection on the 24 images of the Kodak set subject to some common image processing. While the direct computation of NCD fails to detect image similarity even in the case of simple median and moving average filtering in 3 × 3 windows, for certain transforms and compressors, the proposed approach appears to provide robustness at similarity detection against smoothing, lossy compression, contrast enhancement, noise addition and some robustness against geometrical transforms (scaling, cropping and rotation).
format Online
Article
Text
id pubmed-7512663
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75126632020-11-09 On the Use of Normalized Compression Distances for Image Similarity Detection Coltuc, Dinu Datcu, Mihai Coltuc, Daniela Entropy (Basel) Article This paper investigates the usefulness of the normalized compression distance (NCD) for image similarity detection. Instead of the direct NCD between images, the paper considers the correlation between NCD based feature vectors extracted for each image. The vectors are derived by computing the NCD between the original image and sequences of translated (rotated) versions. Feature vectors for simple transforms (circular translations on horizontal, vertical, diagonal directions and rotations around image center) and several standard compressors are generated and tested in a very simple experiment of similarity detection between the original image and two filtered versions (median and moving average). The promising vector configurations (geometric transform, lossless compressor) are further tested for similarity detection on the 24 images of the Kodak set subject to some common image processing. While the direct computation of NCD fails to detect image similarity even in the case of simple median and moving average filtering in 3 × 3 windows, for certain transforms and compressors, the proposed approach appears to provide robustness at similarity detection against smoothing, lossy compression, contrast enhancement, noise addition and some robustness against geometrical transforms (scaling, cropping and rotation). MDPI 2018-01-31 /pmc/articles/PMC7512663/ /pubmed/33265190 http://dx.doi.org/10.3390/e20020099 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Coltuc, Dinu
Datcu, Mihai
Coltuc, Daniela
On the Use of Normalized Compression Distances for Image Similarity Detection
title On the Use of Normalized Compression Distances for Image Similarity Detection
title_full On the Use of Normalized Compression Distances for Image Similarity Detection
title_fullStr On the Use of Normalized Compression Distances for Image Similarity Detection
title_full_unstemmed On the Use of Normalized Compression Distances for Image Similarity Detection
title_short On the Use of Normalized Compression Distances for Image Similarity Detection
title_sort on the use of normalized compression distances for image similarity detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512663/
https://www.ncbi.nlm.nih.gov/pubmed/33265190
http://dx.doi.org/10.3390/e20020099
work_keys_str_mv AT coltucdinu ontheuseofnormalizedcompressiondistancesforimagesimilaritydetection
AT datcumihai ontheuseofnormalizedcompressiondistancesforimagesimilaritydetection
AT coltucdaniela ontheuseofnormalizedcompressiondistancesforimagesimilaritydetection