Cargando…

Deep neural network and model-based clustering technique for forensic electronic mail author attribution

Electronic mail is the primary source of different cyber scams. Identifying the author of electronic mail is essential. It forms significant documentary evidence in the field of digital forensics. This paper presents a model for email author identification (or) attribution by utilizing deep neural n...

Descripción completa

Detalles Bibliográficos
Autores principales: Apoorva, K. A., Sangeetha, S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7890392/
https://www.ncbi.nlm.nih.gov/pubmed/33619463
http://dx.doi.org/10.1007/s42452-020-04127-6
_version_ 1783652506828537856
author Apoorva, K. A.
Sangeetha, S.
author_facet Apoorva, K. A.
Sangeetha, S.
author_sort Apoorva, K. A.
collection PubMed
description Electronic mail is the primary source of different cyber scams. Identifying the author of electronic mail is essential. It forms significant documentary evidence in the field of digital forensics. This paper presents a model for email author identification (or) attribution by utilizing deep neural networks and model-based clustering techniques. It is perceived that stylometry features in the authorship identification have gained a lot of importance as it enhances the author attribution task's accuracy. The experiments were performed on a publicly available benchmark Enron dataset, considering many authors. The proposed model achieves an accuracy of 94% on five authors, 90% on ten authors, 86% on 25 authors and 75% on the entire dataset for the Deep Neural Network technique, which is a good measure of accuracy on a highly imbalanced data. The second cluster-based technique yielded an excellent 86% accuracy on the entire dataset, considering the authors' number based on their contribution to the aggregate data.
format Online
Article
Text
id pubmed-7890392
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-78903922021-02-18 Deep neural network and model-based clustering technique for forensic electronic mail author attribution Apoorva, K. A. Sangeetha, S. SN Appl Sci Research Article Electronic mail is the primary source of different cyber scams. Identifying the author of electronic mail is essential. It forms significant documentary evidence in the field of digital forensics. This paper presents a model for email author identification (or) attribution by utilizing deep neural networks and model-based clustering techniques. It is perceived that stylometry features in the authorship identification have gained a lot of importance as it enhances the author attribution task's accuracy. The experiments were performed on a publicly available benchmark Enron dataset, considering many authors. The proposed model achieves an accuracy of 94% on five authors, 90% on ten authors, 86% on 25 authors and 75% on the entire dataset for the Deep Neural Network technique, which is a good measure of accuracy on a highly imbalanced data. The second cluster-based technique yielded an excellent 86% accuracy on the entire dataset, considering the authors' number based on their contribution to the aggregate data. Springer International Publishing 2021-02-18 2021 /pmc/articles/PMC7890392/ /pubmed/33619463 http://dx.doi.org/10.1007/s42452-020-04127-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Apoorva, K. A.
Sangeetha, S.
Deep neural network and model-based clustering technique for forensic electronic mail author attribution
title Deep neural network and model-based clustering technique for forensic electronic mail author attribution
title_full Deep neural network and model-based clustering technique for forensic electronic mail author attribution
title_fullStr Deep neural network and model-based clustering technique for forensic electronic mail author attribution
title_full_unstemmed Deep neural network and model-based clustering technique for forensic electronic mail author attribution
title_short Deep neural network and model-based clustering technique for forensic electronic mail author attribution
title_sort deep neural network and model-based clustering technique for forensic electronic mail author attribution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7890392/
https://www.ncbi.nlm.nih.gov/pubmed/33619463
http://dx.doi.org/10.1007/s42452-020-04127-6
work_keys_str_mv AT apoorvaka deepneuralnetworkandmodelbasedclusteringtechniqueforforensicelectronicmailauthorattribution
AT sangeethas deepneuralnetworkandmodelbasedclusteringtechniqueforforensicelectronicmailauthorattribution