Cargando…

An efficient incremental learning mechanism for tracking concept drift in spam filtering

This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association...

Descripción completa

Detalles Bibliográficos
Autores principales: Sheu, Jyh-Jian, Chu, Ko-Tsung, Li, Nien-Feng, Lee, Cheng-Chi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300145/
https://www.ncbi.nlm.nih.gov/pubmed/28182691
http://dx.doi.org/10.1371/journal.pone.0171518
_version_ 1782506133602172928
author Sheu, Jyh-Jian
Chu, Ko-Tsung
Li, Nien-Feng
Lee, Cheng-Chi
author_facet Sheu, Jyh-Jian
Chu, Ko-Tsung
Li, Nien-Feng
Lee, Cheng-Chi
author_sort Sheu, Jyh-Jian
collection PubMed
description This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association rules about spams. Then, we propose an efficient systematic filtering method based on these association rules. Our systematic method has the following major advantages: (1) Checking only the header sections of emails, which is different from those spam filtering methods at present that have to analyze fully the email’s content. Meanwhile, the email filtering accuracy is expected to be enhanced. (2) Regarding the solution to the problem of concept drift, we propose a window-based technique to estimate for the condition of concept drift for each unknown email, which will help our filtering method in recognizing the occurrence of spam. (3) We propose an incremental learning mechanism for our filtering method to strengthen the ability of adapting to the dynamic environment.
format Online
Article
Text
id pubmed-5300145
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53001452017-02-28 An efficient incremental learning mechanism for tracking concept drift in spam filtering Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi PLoS One Research Article This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association rules about spams. Then, we propose an efficient systematic filtering method based on these association rules. Our systematic method has the following major advantages: (1) Checking only the header sections of emails, which is different from those spam filtering methods at present that have to analyze fully the email’s content. Meanwhile, the email filtering accuracy is expected to be enhanced. (2) Regarding the solution to the problem of concept drift, we propose a window-based technique to estimate for the condition of concept drift for each unknown email, which will help our filtering method in recognizing the occurrence of spam. (3) We propose an incremental learning mechanism for our filtering method to strengthen the ability of adapting to the dynamic environment. Public Library of Science 2017-02-09 /pmc/articles/PMC5300145/ /pubmed/28182691 http://dx.doi.org/10.1371/journal.pone.0171518 Text en © 2017 Sheu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sheu, Jyh-Jian
Chu, Ko-Tsung
Li, Nien-Feng
Lee, Cheng-Chi
An efficient incremental learning mechanism for tracking concept drift in spam filtering
title An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_full An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_fullStr An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_full_unstemmed An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_short An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_sort efficient incremental learning mechanism for tracking concept drift in spam filtering
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300145/
https://www.ncbi.nlm.nih.gov/pubmed/28182691
http://dx.doi.org/10.1371/journal.pone.0171518
work_keys_str_mv AT sheujyhjian anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering
AT chukotsung anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering
AT linienfeng anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering
AT leechengchi anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering
AT sheujyhjian efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering
AT chukotsung efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering
AT linienfeng efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering
AT leechengchi efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering