Cargando…

An efficient incremental learning mechanism for tracking concept drift in spam filtering

This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sheu, Jyh-Jian, Chu, Ko-Tsung, Li, Nien-Feng, Lee, Cheng-Chi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300145/ https://www.ncbi.nlm.nih.gov/pubmed/28182691 http://dx.doi.org/10.1371/journal.pone.0171518

_version_	1782506133602172928
author	Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi
author_facet	Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi
author_sort	Sheu, Jyh-Jian
collection	PubMed
description	This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association rules about spams. Then, we propose an efficient systematic filtering method based on these association rules. Our systematic method has the following major advantages: (1) Checking only the header sections of emails, which is different from those spam filtering methods at present that have to analyze fully the email’s content. Meanwhile, the email filtering accuracy is expected to be enhanced. (2) Regarding the solution to the problem of concept drift, we propose a window-based technique to estimate for the condition of concept drift for each unknown email, which will help our filtering method in recognizing the occurrence of spam. (3) We propose an incremental learning mechanism for our filtering method to strengthen the ability of adapting to the dynamic environment.
format	Online Article Text
id	pubmed-5300145
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-53001452017-02-28 An efficient incremental learning mechanism for tracking concept drift in spam filtering Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi PLoS One Research Article This research manages in-depth analysis on the knowledge about spams and expects to propose an efficient spam filtering method with the ability of adapting to the dynamic environment. We focus on the analysis of email’s header and apply decision tree data mining technique to look for the association rules about spams. Then, we propose an efficient systematic filtering method based on these association rules. Our systematic method has the following major advantages: (1) Checking only the header sections of emails, which is different from those spam filtering methods at present that have to analyze fully the email’s content. Meanwhile, the email filtering accuracy is expected to be enhanced. (2) Regarding the solution to the problem of concept drift, we propose a window-based technique to estimate for the condition of concept drift for each unknown email, which will help our filtering method in recognizing the occurrence of spam. (3) We propose an incremental learning mechanism for our filtering method to strengthen the ability of adapting to the dynamic environment. Public Library of Science 2017-02-09 /pmc/articles/PMC5300145/ /pubmed/28182691 http://dx.doi.org/10.1371/journal.pone.0171518 Text en © 2017 Sheu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Sheu, Jyh-Jian Chu, Ko-Tsung Li, Nien-Feng Lee, Cheng-Chi An efficient incremental learning mechanism for tracking concept drift in spam filtering
title	An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_full	An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_fullStr	An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_full_unstemmed	An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_short	An efficient incremental learning mechanism for tracking concept drift in spam filtering
title_sort	efficient incremental learning mechanism for tracking concept drift in spam filtering
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300145/ https://www.ncbi.nlm.nih.gov/pubmed/28182691 http://dx.doi.org/10.1371/journal.pone.0171518
work_keys_str_mv	AT sheujyhjian anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT chukotsung anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT linienfeng anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT leechengchi anefficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT sheujyhjian efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT chukotsung efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT linienfeng efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering AT leechengchi efficientincrementallearningmechanismfortrackingconceptdriftinspamfiltering

An efficient incremental learning mechanism for tracking concept drift in spam filtering

Ejemplares similares