Cargando…
An efficient incremental learning mechanism for tracking concept drift in spam filtering
This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300145/ https://www.ncbi.nlm.nih.gov/pubmed/28182691 http://dx.doi.org/10.1371/journal.pone.0171518 |
_version_ | 1782506133602172928 |
---|---|
author | Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi |
author_facet | Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi |
author_sort | Sheu, Jyh-Jian |
collection | PubMed |
description | This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association rules about spams. Then, we propose an efficient systematic filtering method based on these association rules. Our systematic method has the following major advantages: (1) Checking only the header sections of emails, which is different from those spam filtering methods at present that have to analyze fully the email’s content. Meanwhile, the email filtering accuracy is expected to be enhanced. (2) Regarding the solution to the problem of concept drift, we propose a window-based technique to estimate for the condition of concept drift for each unknown email, which will help our filtering method in recognizing the occurrence of spam. (3) We propose an incremental learning mechanism for our filtering method to strengthen the ability of adapting to the dynamic environment. |
format | Online Article Text |
id | pubmed-5300145 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53001452017-02-28 An efficient incremental learning mechanism for tracking concept drift in spam filtering Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi PLoS One Research Article This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association rules about spams. Then, we propose an efficient systematic filtering method based on these association rules. Our systematic method has the following major advantages: (1) Checking only the header sections of emails, which is different from those spam filtering methods at present that have to analyze fully the email’s content. Meanwhile, the email filtering accuracy is expected to be enhanced. (2) Regarding the solution to the problem of concept drift, we propose a window-based technique to estimate for the condition of concept drift for each unknown email, which will help our filtering method in recognizing the occurrence of spam. (3) We propose an incremental learning mechanism for our filtering method to strengthen the ability of adapting to the dynamic environment. Public Library of Science 2017-02-09 /pmc/articles/PMC5300145/ /pubmed/28182691 http://dx.doi.org/10.1371/journal.pone.0171518 Text en © 2017 Sheu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi An efficient incremental learning mechanism for tracking concept drift in spam filtering |
title | An efficient incremental learning mechanism for tracking concept drift in spam filtering |
title_full | An efficient incremental learning mechanism for tracking concept drift in spam filtering |
title_fullStr | An efficient incremental learning mechanism for tracking concept drift in spam filtering |
title_full_unstemmed | An efficient incremental learning mechanism for tracking concept drift in spam filtering |
title_short | An efficient incremental learning mechanism for tracking concept drift in spam filtering |
title_sort | efficient incremental learning mechanism for tracking concept drift in spam filtering |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300145/ https://www.ncbi.nlm.nih.gov/pubmed/28182691 http://dx.doi.org/10.1371/journal.pone.0171518 |
work_keys_str_mv | AT sheujyhjian anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT chukotsung anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT linienfeng anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT leechengchi anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT sheujyhjian efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT chukotsung efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT linienfeng efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT leechengchi efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering |