Cargando…

Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism

Mining association rules with multiple minimum supports is an important generalization of the association-rule-mining problem, which was recently proposed by Liu et al. Instead of setting a single minimum support threshold for all items, they allow users to specify multiple minimum supports to refle...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Ya-Han, Chen, Yen-Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier B.V. 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7127670/
https://www.ncbi.nlm.nih.gov/pubmed/32287563
http://dx.doi.org/10.1016/j.dss.2004.09.007
_version_ 1783516411176419328
author Hu, Ya-Han
Chen, Yen-Liang
author_facet Hu, Ya-Han
Chen, Yen-Liang
author_sort Hu, Ya-Han
collection PubMed
description Mining association rules with multiple minimum supports is an important generalization of the association-rule-mining problem, which was recently proposed by Liu et al. Instead of setting a single minimum support threshold for all items, they allow users to specify multiple minimum supports to reflect the natures of the items, and an Apriori-based algorithm, named MSapriori, is developed to mine all frequent itemsets. In this paper, we study the same problem but with two additional improvements. First, we propose a FP-tree-like structure, MIS-tree, to store the crucial information about frequent patterns. Accordingly, an efficient MIS-tree-based algorithm, called the CFP-growth algorithm, is developed for mining all frequent itemsets. Second, since each item can have its own minimum support, it is very difficult for users to set the appropriate thresholds for all items at a time. In practice, users need to tune items' supports and run the mining algorithm repeatedly until a satisfactory end is reached. To speed up this time-consuming tuning process, an efficient algorithm which can maintain the MIS-tree structure without rescanning database is proposed. Experiments on both synthetic and real-life datasets show that our algorithms are much more efficient and scalable than the previous algorithm.
format Online
Article
Text
id pubmed-7127670
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-71276702020-04-08 Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism Hu, Ya-Han Chen, Yen-Liang Decis Support Syst Article Mining association rules with multiple minimum supports is an important generalization of the association-rule-mining problem, which was recently proposed by Liu et al. Instead of setting a single minimum support threshold for all items, they allow users to specify multiple minimum supports to reflect the natures of the items, and an Apriori-based algorithm, named MSapriori, is developed to mine all frequent itemsets. In this paper, we study the same problem but with two additional improvements. First, we propose a FP-tree-like structure, MIS-tree, to store the crucial information about frequent patterns. Accordingly, an efficient MIS-tree-based algorithm, called the CFP-growth algorithm, is developed for mining all frequent itemsets. Second, since each item can have its own minimum support, it is very difficult for users to set the appropriate thresholds for all items at a time. In practice, users need to tune items' supports and run the mining algorithm repeatedly until a satisfactory end is reached. To speed up this time-consuming tuning process, an efficient algorithm which can maintain the MIS-tree structure without rescanning database is proposed. Experiments on both synthetic and real-life datasets show that our algorithms are much more efficient and scalable than the previous algorithm. Elsevier B.V. 2006-10 2004-11-30 /pmc/articles/PMC7127670/ /pubmed/32287563 http://dx.doi.org/10.1016/j.dss.2004.09.007 Text en Copyright © 2004 Elsevier B.V. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Hu, Ya-Han
Chen, Yen-Liang
Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
title Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
title_full Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
title_fullStr Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
title_full_unstemmed Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
title_short Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
title_sort mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7127670/
https://www.ncbi.nlm.nih.gov/pubmed/32287563
http://dx.doi.org/10.1016/j.dss.2004.09.007
work_keys_str_mv AT huyahan miningassociationruleswithmultipleminimumsupportsanewminingalgorithmandasupporttuningmechanism
AT chenyenliang miningassociationruleswithmultipleminimumsupportsanewminingalgorithmandasupporttuningmechanism