Cargando…

Building an associative classifier with multiple minimum supports

Classification is one of the most important technologies used in data mining. Researchers have recently proposed several classification techniques based on the concept of association rules (also known as CBA-based methods). Experimental evaluations on these studies show that in average the CBA-based...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Li-Yu, Hu, Ya-Han, Tsai, Chih-Fong, Wang, Jian-Shian, Huang, Min-Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4844591/
https://www.ncbi.nlm.nih.gov/pubmed/27186492
http://dx.doi.org/10.1186/s40064-016-2153-1
_version_ 1782428798677942272
author Hu, Li-Yu
Hu, Ya-Han
Tsai, Chih-Fong
Wang, Jian-Shian
Huang, Min-Wei
author_facet Hu, Li-Yu
Hu, Ya-Han
Tsai, Chih-Fong
Wang, Jian-Shian
Huang, Min-Wei
author_sort Hu, Li-Yu
collection PubMed
description Classification is one of the most important technologies used in data mining. Researchers have recently proposed several classification techniques based on the concept of association rules (also known as CBA-based methods). Experimental evaluations on these studies show that in average the CBA-based approaches can yield higher accuracy than some of conventional classification methods. However, conventional CBA-based methods adopt a single threshold of minimum support for all items, resulting in the rare item problem. In other words, the classification rules will only contain frequent items if minimum support (minsup) is set as high or any combinations of items are discovered as frequent if minsup is set as low. To solve this problem, this paper proposes a novel CBA-based method called MMSCBA, which considers the concept of multiple minimum supports (MMSs). Based on MMSs, different classification rules appear in the corresponding minsups. Several experiments were conducted with six real-world datasets selected from the UCI Machine Learning Repository. The results show that MMSCBA achieves higher accuracy than conventional CBA methods, especially when the dataset contains rare items.
format Online
Article
Text
id pubmed-4844591
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-48445912016-05-16 Building an associative classifier with multiple minimum supports Hu, Li-Yu Hu, Ya-Han Tsai, Chih-Fong Wang, Jian-Shian Huang, Min-Wei Springerplus Research Classification is one of the most important technologies used in data mining. Researchers have recently proposed several classification techniques based on the concept of association rules (also known as CBA-based methods). Experimental evaluations on these studies show that in average the CBA-based approaches can yield higher accuracy than some of conventional classification methods. However, conventional CBA-based methods adopt a single threshold of minimum support for all items, resulting in the rare item problem. In other words, the classification rules will only contain frequent items if minimum support (minsup) is set as high or any combinations of items are discovered as frequent if minsup is set as low. To solve this problem, this paper proposes a novel CBA-based method called MMSCBA, which considers the concept of multiple minimum supports (MMSs). Based on MMSs, different classification rules appear in the corresponding minsups. Several experiments were conducted with six real-world datasets selected from the UCI Machine Learning Repository. The results show that MMSCBA achieves higher accuracy than conventional CBA methods, especially when the dataset contains rare items. Springer International Publishing 2016-04-26 /pmc/articles/PMC4844591/ /pubmed/27186492 http://dx.doi.org/10.1186/s40064-016-2153-1 Text en © Hu et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Hu, Li-Yu
Hu, Ya-Han
Tsai, Chih-Fong
Wang, Jian-Shian
Huang, Min-Wei
Building an associative classifier with multiple minimum supports
title Building an associative classifier with multiple minimum supports
title_full Building an associative classifier with multiple minimum supports
title_fullStr Building an associative classifier with multiple minimum supports
title_full_unstemmed Building an associative classifier with multiple minimum supports
title_short Building an associative classifier with multiple minimum supports
title_sort building an associative classifier with multiple minimum supports
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4844591/
https://www.ncbi.nlm.nih.gov/pubmed/27186492
http://dx.doi.org/10.1186/s40064-016-2153-1
work_keys_str_mv AT huliyu buildinganassociativeclassifierwithmultipleminimumsupports
AT huyahan buildinganassociativeclassifierwithmultipleminimumsupports
AT tsaichihfong buildinganassociativeclassifierwithmultipleminimumsupports
AT wangjianshian buildinganassociativeclassifierwithmultipleminimumsupports
AT huangminwei buildinganassociativeclassifierwithmultipleminimumsupports