Cargando…
A novel association rule mining approach using TID intermediate itemset
Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5774682/ https://www.ncbi.nlm.nih.gov/pubmed/29351287 http://dx.doi.org/10.1371/journal.pone.0179703 |
_version_ | 1783293786090110976 |
---|---|
author | Aqra, Iyad Herawan, Tutut Abdul Ghani, Norjihan Akhunzada, Adnan Ali, Akhtar Bin Razali, Ramdan Ilahi, Manzoor Raymond Choo, Kim-Kwang |
author_facet | Aqra, Iyad Herawan, Tutut Abdul Ghani, Norjihan Akhunzada, Adnan Ali, Akhtar Bin Razali, Ramdan Ilahi, Manzoor Raymond Choo, Kim-Kwang |
author_sort | Aqra, Iyad |
collection | PubMed |
description | Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output knowledge certainly necessitates the extant state-of-the-art algorithms to rescan the entire database. Subsequently, the process incurs heavy computation cost and is not feasible for real-time applications. The paper addresses efficiently the problem of threshold dynamic updation for a given purpose. The paper contributes by presenting a novel ARM approach that creates an intermediate itemset and applies a threshold to extract categorical frequent itemsets with diverse threshold values. Thus, improving the overall efficiency as we no longer needs to scan the whole database. After the entire itemset is built, we are able to obtain real support without the need of rebuilding the itemset (e.g. Itemset list is intersected to obtain the actual support). Moreover, the algorithm supports to extract many frequent itemsets according to a pre-determined minimum support with an independent purpose. Additionally, the experimental results of our proposed approach demonstrate the capability to be deployed in any mining system in a fully parallel mode; consequently, increasing the efficiency of the real-time association rules discovery process. The proposed approach outperforms the extant state-of-the-art and shows promising results that reduce computation cost, increase accuracy, and produce all possible itemsets. |
format | Online Article Text |
id | pubmed-5774682 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-57746822018-01-26 A novel association rule mining approach using TID intermediate itemset Aqra, Iyad Herawan, Tutut Abdul Ghani, Norjihan Akhunzada, Adnan Ali, Akhtar Bin Razali, Ramdan Ilahi, Manzoor Raymond Choo, Kim-Kwang PLoS One Research Article Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output knowledge certainly necessitates the extant state-of-the-art algorithms to rescan the entire database. Subsequently, the process incurs heavy computation cost and is not feasible for real-time applications. The paper addresses efficiently the problem of threshold dynamic updation for a given purpose. The paper contributes by presenting a novel ARM approach that creates an intermediate itemset and applies a threshold to extract categorical frequent itemsets with diverse threshold values. Thus, improving the overall efficiency as we no longer needs to scan the whole database. After the entire itemset is built, we are able to obtain real support without the need of rebuilding the itemset (e.g. Itemset list is intersected to obtain the actual support). Moreover, the algorithm supports to extract many frequent itemsets according to a pre-determined minimum support with an independent purpose. Additionally, the experimental results of our proposed approach demonstrate the capability to be deployed in any mining system in a fully parallel mode; consequently, increasing the efficiency of the real-time association rules discovery process. The proposed approach outperforms the extant state-of-the-art and shows promising results that reduce computation cost, increase accuracy, and produce all possible itemsets. Public Library of Science 2018-01-19 /pmc/articles/PMC5774682/ /pubmed/29351287 http://dx.doi.org/10.1371/journal.pone.0179703 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Aqra, Iyad Herawan, Tutut Abdul Ghani, Norjihan Akhunzada, Adnan Ali, Akhtar Bin Razali, Ramdan Ilahi, Manzoor Raymond Choo, Kim-Kwang A novel association rule mining approach using TID intermediate itemset |
title | A novel association rule mining approach using TID intermediate itemset |
title_full | A novel association rule mining approach using TID intermediate itemset |
title_fullStr | A novel association rule mining approach using TID intermediate itemset |
title_full_unstemmed | A novel association rule mining approach using TID intermediate itemset |
title_short | A novel association rule mining approach using TID intermediate itemset |
title_sort | novel association rule mining approach using tid intermediate itemset |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5774682/ https://www.ncbi.nlm.nih.gov/pubmed/29351287 http://dx.doi.org/10.1371/journal.pone.0179703 |
work_keys_str_mv | AT aqraiyad anovelassociationruleminingapproachusingtidintermediateitemset AT herawantutut anovelassociationruleminingapproachusingtidintermediateitemset AT abdulghaninorjihan anovelassociationruleminingapproachusingtidintermediateitemset AT akhunzadaadnan anovelassociationruleminingapproachusingtidintermediateitemset AT aliakhtar anovelassociationruleminingapproachusingtidintermediateitemset AT binrazaliramdan anovelassociationruleminingapproachusingtidintermediateitemset AT ilahimanzoor anovelassociationruleminingapproachusingtidintermediateitemset AT raymondchookimkwang anovelassociationruleminingapproachusingtidintermediateitemset AT aqraiyad novelassociationruleminingapproachusingtidintermediateitemset AT herawantutut novelassociationruleminingapproachusingtidintermediateitemset AT abdulghaninorjihan novelassociationruleminingapproachusingtidintermediateitemset AT akhunzadaadnan novelassociationruleminingapproachusingtidintermediateitemset AT aliakhtar novelassociationruleminingapproachusingtidintermediateitemset AT binrazaliramdan novelassociationruleminingapproachusingtidintermediateitemset AT ilahimanzoor novelassociationruleminingapproachusingtidintermediateitemset AT raymondchookimkwang novelassociationruleminingapproachusingtidintermediateitemset |