Cargando…

Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE

For a suspected forgery that involves the falsification of a document or its contents, the investigator will primarily analyze the document’s paper and ink in order to establish the authenticity of the subject under investigation. As a non-destructive and contactless technique, Hyperspectral Imaging...

Descripción completa

Detalles Bibliográficos
Autores principales: Melit Devassy, Binu, George, Sony, Nussbaum, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321027/
https://www.ncbi.nlm.nih.gov/pubmed/34460731
http://dx.doi.org/10.3390/jimaging6050029
_version_ 1783730753833533440
author Melit Devassy, Binu
George, Sony
Nussbaum, Peter
author_facet Melit Devassy, Binu
George, Sony
Nussbaum, Peter
author_sort Melit Devassy, Binu
collection PubMed
description For a suspected forgery that involves the falsification of a document or its contents, the investigator will primarily analyze the document’s paper and ink in order to establish the authenticity of the subject under investigation. As a non-destructive and contactless technique, Hyperspectral Imaging (HSI) is gaining popularity in the field of forensic document analysis. HSI returns more information compared to conventional three channel imaging systems due to the vast number of narrowband images recorded across the electromagnetic spectrum. As a result, HSI can provide better classification results. In this publication, we present results of an approach known as the t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm, which we have applied to HSI paper data analysis. Even though t-SNE has been widely accepted as a method for dimensionality reduction and visualization of high dimensional data, its usefulness has not yet been evaluated for the classification of paper data. In this research, we present a hyperspectral dataset of paper samples, and evaluate the clustering quality of the proposed method both visually and quantitatively. The t-SNE algorithm shows exceptional discrimination power when compared to traditional PCA with k-means clustering, in both visual and quantitative evaluations.
format Online
Article
Text
id pubmed-8321027
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83210272021-08-26 Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE Melit Devassy, Binu George, Sony Nussbaum, Peter J Imaging Article For a suspected forgery that involves the falsification of a document or its contents, the investigator will primarily analyze the document’s paper and ink in order to establish the authenticity of the subject under investigation. As a non-destructive and contactless technique, Hyperspectral Imaging (HSI) is gaining popularity in the field of forensic document analysis. HSI returns more information compared to conventional three channel imaging systems due to the vast number of narrowband images recorded across the electromagnetic spectrum. As a result, HSI can provide better classification results. In this publication, we present results of an approach known as the t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm, which we have applied to HSI paper data analysis. Even though t-SNE has been widely accepted as a method for dimensionality reduction and visualization of high dimensional data, its usefulness has not yet been evaluated for the classification of paper data. In this research, we present a hyperspectral dataset of paper samples, and evaluate the clustering quality of the proposed method both visually and quantitatively. The t-SNE algorithm shows exceptional discrimination power when compared to traditional PCA with k-means clustering, in both visual and quantitative evaluations. MDPI 2020-05-05 /pmc/articles/PMC8321027/ /pubmed/34460731 http://dx.doi.org/10.3390/jimaging6050029 Text en © 2020 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Melit Devassy, Binu
George, Sony
Nussbaum, Peter
Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
title Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
title_full Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
title_fullStr Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
title_full_unstemmed Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
title_short Unsupervised Clustering of Hyperspectral Paper Data Using t-SNE
title_sort unsupervised clustering of hyperspectral paper data using t-sne
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321027/
https://www.ncbi.nlm.nih.gov/pubmed/34460731
http://dx.doi.org/10.3390/jimaging6050029
work_keys_str_mv AT melitdevassybinu unsupervisedclusteringofhyperspectralpaperdatausingtsne
AT georgesony unsupervisedclusteringofhyperspectralpaperdatausingtsne
AT nussbaumpeter unsupervisedclusteringofhyperspectralpaperdatausingtsne