Cargando…

A novel association rule mining approach using TID intermediate itemset

Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output...

Descripción completa

Detalles Bibliográficos
Autores principales: Aqra, Iyad, Herawan, Tutut, Abdul Ghani, Norjihan, Akhunzada, Adnan, Ali, Akhtar, Bin Razali, Ramdan, Ilahi, Manzoor, Raymond Choo, Kim-Kwang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5774682/
https://www.ncbi.nlm.nih.gov/pubmed/29351287
http://dx.doi.org/10.1371/journal.pone.0179703
_version_ 1783293786090110976
author Aqra, Iyad
Herawan, Tutut
Abdul Ghani, Norjihan
Akhunzada, Adnan
Ali, Akhtar
Bin Razali, Ramdan
Ilahi, Manzoor
Raymond Choo, Kim-Kwang
author_facet Aqra, Iyad
Herawan, Tutut
Abdul Ghani, Norjihan
Akhunzada, Adnan
Ali, Akhtar
Bin Razali, Ramdan
Ilahi, Manzoor
Raymond Choo, Kim-Kwang
author_sort Aqra, Iyad
collection PubMed
description Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output knowledge certainly necessitates the extant state-of-the-art algorithms to rescan the entire database. Subsequently, the process incurs heavy computation cost and is not feasible for real-time applications. The paper addresses efficiently the problem of threshold dynamic updation for a given purpose. The paper contributes by presenting a novel ARM approach that creates an intermediate itemset and applies a threshold to extract categorical frequent itemsets with diverse threshold values. Thus, improving the overall efficiency as we no longer needs to scan the whole database. After the entire itemset is built, we are able to obtain real support without the need of rebuilding the itemset (e.g. Itemset list is intersected to obtain the actual support). Moreover, the algorithm supports to extract many frequent itemsets according to a pre-determined minimum support with an independent purpose. Additionally, the experimental results of our proposed approach demonstrate the capability to be deployed in any mining system in a fully parallel mode; consequently, increasing the efficiency of the real-time association rules discovery process. The proposed approach outperforms the extant state-of-the-art and shows promising results that reduce computation cost, increase accuracy, and produce all possible itemsets.
format Online
Article
Text
id pubmed-5774682
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-57746822018-01-26 A novel association rule mining approach using TID intermediate itemset Aqra, Iyad Herawan, Tutut Abdul Ghani, Norjihan Akhunzada, Adnan Ali, Akhtar Bin Razali, Ramdan Ilahi, Manzoor Raymond Choo, Kim-Kwang PLoS One Research Article Designing an efficient association rule mining (ARM) algorithm for multilevel knowledge-based transactional databases that is appropriate for real-world deployments is of paramount concern. However, dynamic decision making that needs to modify the threshold either to minimize or maximize the output knowledge certainly necessitates the extant state-of-the-art algorithms to rescan the entire database. Subsequently, the process incurs heavy computation cost and is not feasible for real-time applications. The paper addresses efficiently the problem of threshold dynamic updation for a given purpose. The paper contributes by presenting a novel ARM approach that creates an intermediate itemset and applies a threshold to extract categorical frequent itemsets with diverse threshold values. Thus, improving the overall efficiency as we no longer needs to scan the whole database. After the entire itemset is built, we are able to obtain real support without the need of rebuilding the itemset (e.g. Itemset list is intersected to obtain the actual support). Moreover, the algorithm supports to extract many frequent itemsets according to a pre-determined minimum support with an independent purpose. Additionally, the experimental results of our proposed approach demonstrate the capability to be deployed in any mining system in a fully parallel mode; consequently, increasing the efficiency of the real-time association rules discovery process. The proposed approach outperforms the extant state-of-the-art and shows promising results that reduce computation cost, increase accuracy, and produce all possible itemsets. Public Library of Science 2018-01-19 /pmc/articles/PMC5774682/ /pubmed/29351287 http://dx.doi.org/10.1371/journal.pone.0179703 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Aqra, Iyad
Herawan, Tutut
Abdul Ghani, Norjihan
Akhunzada, Adnan
Ali, Akhtar
Bin Razali, Ramdan
Ilahi, Manzoor
Raymond Choo, Kim-Kwang
A novel association rule mining approach using TID intermediate itemset
title A novel association rule mining approach using TID intermediate itemset
title_full A novel association rule mining approach using TID intermediate itemset
title_fullStr A novel association rule mining approach using TID intermediate itemset
title_full_unstemmed A novel association rule mining approach using TID intermediate itemset
title_short A novel association rule mining approach using TID intermediate itemset
title_sort novel association rule mining approach using tid intermediate itemset
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5774682/
https://www.ncbi.nlm.nih.gov/pubmed/29351287
http://dx.doi.org/10.1371/journal.pone.0179703
work_keys_str_mv AT aqraiyad anovelassociationruleminingapproachusingtidintermediateitemset
AT herawantutut anovelassociationruleminingapproachusingtidintermediateitemset
AT abdulghaninorjihan anovelassociationruleminingapproachusingtidintermediateitemset
AT akhunzadaadnan anovelassociationruleminingapproachusingtidintermediateitemset
AT aliakhtar anovelassociationruleminingapproachusingtidintermediateitemset
AT binrazaliramdan anovelassociationruleminingapproachusingtidintermediateitemset
AT ilahimanzoor anovelassociationruleminingapproachusingtidintermediateitemset
AT raymondchookimkwang anovelassociationruleminingapproachusingtidintermediateitemset
AT aqraiyad novelassociationruleminingapproachusingtidintermediateitemset
AT herawantutut novelassociationruleminingapproachusingtidintermediateitemset
AT abdulghaninorjihan novelassociationruleminingapproachusingtidintermediateitemset
AT akhunzadaadnan novelassociationruleminingapproachusingtidintermediateitemset
AT aliakhtar novelassociationruleminingapproachusingtidintermediateitemset
AT binrazaliramdan novelassociationruleminingapproachusingtidintermediateitemset
AT ilahimanzoor novelassociationruleminingapproachusingtidintermediateitemset
AT raymondchookimkwang novelassociationruleminingapproachusingtidintermediateitemset