Cargando…

A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection

With the recent influx of technology, Unsolicited Bulk Emails (UBEs) have become a potential problem, leaving computer users and organizations at the risk of brand, data, and financial loss. In this paper, we present a novel bio-inspired hybrid parallel optimization algorithm (Cuckoo-Firefly-GR), wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Gangavarapu, Tushaar, Jaidhar, C. D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304023/
http://dx.doi.org/10.1007/978-3-030-50420-5_18
_version_ 1783548182234398720
author Gangavarapu, Tushaar
Jaidhar, C. D.
author_facet Gangavarapu, Tushaar
Jaidhar, C. D.
author_sort Gangavarapu, Tushaar
collection PubMed
description With the recent influx of technology, Unsolicited Bulk Emails (UBEs) have become a potential problem, leaving computer users and organizations at the risk of brand, data, and financial loss. In this paper, we present a novel bio-inspired hybrid parallel optimization algorithm (Cuckoo-Firefly-GR), which combines Genetic Replacement (GR) of low fitness individuals with a hybrid of Cuckoo Search (CS) and Firefly (FA) optimizations. Cuckoo-Firefly-GR not only employs the random walk in CS, but also uses mechanisms in FA to generate and select fitter individuals. The content- and behavior-based features of emails used in the existing works, along with Doc2Vec features of the email body are employed to extract the syntactic and semantic information in the emails. By establishing an optimal balance between intensification and diversification, and reaching global optimization using two metaheuristics, we argue that the proposed algorithm significantly improves the performance of UBE detection, by selecting the most discriminative feature subspace. This study presents significant observations from the extensive evaluations on UBE corpora of 3, 844 emails, that underline the efficiency and superiority of our proposed Cuckoo-Firefly-GR over the base optimizations (Cuckoo-GR and Firefly-GR), dense autoencoders, recurrent neural autoencoders, and several state-of-the-art methods. Furthermore, the instructive feature subset obtained using the proposed Cuckoo-Firefly-GR, when classified using a dense neural model, achieved an accuracy of [Formula: see text].
format Online
Article
Text
id pubmed-7304023
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73040232020-06-19 A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection Gangavarapu, Tushaar Jaidhar, C. D. Computational Science – ICCS 2020 Article With the recent influx of technology, Unsolicited Bulk Emails (UBEs) have become a potential problem, leaving computer users and organizations at the risk of brand, data, and financial loss. In this paper, we present a novel bio-inspired hybrid parallel optimization algorithm (Cuckoo-Firefly-GR), which combines Genetic Replacement (GR) of low fitness individuals with a hybrid of Cuckoo Search (CS) and Firefly (FA) optimizations. Cuckoo-Firefly-GR not only employs the random walk in CS, but also uses mechanisms in FA to generate and select fitter individuals. The content- and behavior-based features of emails used in the existing works, along with Doc2Vec features of the email body are employed to extract the syntactic and semantic information in the emails. By establishing an optimal balance between intensification and diversification, and reaching global optimization using two metaheuristics, we argue that the proposed algorithm significantly improves the performance of UBE detection, by selecting the most discriminative feature subspace. This study presents significant observations from the extensive evaluations on UBE corpora of 3, 844 emails, that underline the efficiency and superiority of our proposed Cuckoo-Firefly-GR over the base optimizations (Cuckoo-GR and Firefly-GR), dense autoencoders, recurrent neural autoencoders, and several state-of-the-art methods. Furthermore, the instructive feature subset obtained using the proposed Cuckoo-Firefly-GR, when classified using a dense neural model, achieved an accuracy of [Formula: see text]. 2020-05-22 /pmc/articles/PMC7304023/ http://dx.doi.org/10.1007/978-3-030-50420-5_18 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Gangavarapu, Tushaar
Jaidhar, C. D.
A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection
title A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection
title_full A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection
title_fullStr A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection
title_full_unstemmed A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection
title_short A Novel Bio-inspired Hybrid Metaheuristic for Unsolicited Bulk Email Detection
title_sort novel bio-inspired hybrid metaheuristic for unsolicited bulk email detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304023/
http://dx.doi.org/10.1007/978-3-030-50420-5_18
work_keys_str_mv AT gangavaraputushaar anovelbioinspiredhybridmetaheuristicforunsolicitedbulkemaildetection
AT jaidharcd anovelbioinspiredhybridmetaheuristicforunsolicitedbulkemaildetection
AT gangavaraputushaar novelbioinspiredhybridmetaheuristicforunsolicitedbulkemaildetection
AT jaidharcd novelbioinspiredhybridmetaheuristicforunsolicitedbulkemaildetection