Cargando…
Deep neural network and model-based clustering technique for forensic electronic mail author attribution
Electronic mail is the primary source of different cyber scams. Identifying the author of electronic mail is essential. It forms significant documentary evidence in the field of digital forensics. This paper presents a model for email author identification (or) attribution by utilizing deep neural n...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7890392/ https://www.ncbi.nlm.nih.gov/pubmed/33619463 http://dx.doi.org/10.1007/s42452-020-04127-6 |
_version_ | 1783652506828537856 |
---|---|
author | Apoorva, K. A. Sangeetha, S. |
author_facet | Apoorva, K. A. Sangeetha, S. |
author_sort | Apoorva, K. A. |
collection | PubMed |
description | Electronic mail is the primary source of different cyber scams. Identifying the author of electronic mail is essential. It forms significant documentary evidence in the field of digital forensics. This paper presents a model for email author identification (or) attribution by utilizing deep neural networks and model-based clustering techniques. It is perceived that stylometry features in the authorship identification have gained a lot of importance as it enhances the author attribution task's accuracy. The experiments were performed on a publicly available benchmark Enron dataset, considering many authors. The proposed model achieves an accuracy of 94% on five authors, 90% on ten authors, 86% on 25 authors and 75% on the entire dataset for the Deep Neural Network technique, which is a good measure of accuracy on a highly imbalanced data. The second cluster-based technique yielded an excellent 86% accuracy on the entire dataset, considering the authors' number based on their contribution to the aggregate data. |
format | Online Article Text |
id | pubmed-7890392 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-78903922021-02-18 Deep neural network and model-based clustering technique for forensic electronic mail author attribution Apoorva, K. A. Sangeetha, S. SN Appl Sci Research Article Electronic mail is the primary source of different cyber scams. Identifying the author of electronic mail is essential. It forms significant documentary evidence in the field of digital forensics. This paper presents a model for email author identification (or) attribution by utilizing deep neural networks and model-based clustering techniques. It is perceived that stylometry features in the authorship identification have gained a lot of importance as it enhances the author attribution task's accuracy. The experiments were performed on a publicly available benchmark Enron dataset, considering many authors. The proposed model achieves an accuracy of 94% on five authors, 90% on ten authors, 86% on 25 authors and 75% on the entire dataset for the Deep Neural Network technique, which is a good measure of accuracy on a highly imbalanced data. The second cluster-based technique yielded an excellent 86% accuracy on the entire dataset, considering the authors' number based on their contribution to the aggregate data. Springer International Publishing 2021-02-18 2021 /pmc/articles/PMC7890392/ /pubmed/33619463 http://dx.doi.org/10.1007/s42452-020-04127-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article Apoorva, K. A. Sangeetha, S. Deep neural network and model-based clustering technique for forensic electronic mail author attribution |
title | Deep neural network and model-based clustering technique for forensic electronic mail author attribution |
title_full | Deep neural network and model-based clustering technique for forensic electronic mail author attribution |
title_fullStr | Deep neural network and model-based clustering technique for forensic electronic mail author attribution |
title_full_unstemmed | Deep neural network and model-based clustering technique for forensic electronic mail author attribution |
title_short | Deep neural network and model-based clustering technique for forensic electronic mail author attribution |
title_sort | deep neural network and model-based clustering technique for forensic electronic mail author attribution |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7890392/ https://www.ncbi.nlm.nih.gov/pubmed/33619463 http://dx.doi.org/10.1007/s42452-020-04127-6 |
work_keys_str_mv | AT apoorvaka deepneuralnetworkandmodelbasedclusteringtechniqueforforensicelectronicmailauthorattribution AT sangeethas deepneuralnetworkandmodelbasedclusteringtechniqueforforensicelectronicmailauthorattribution |