Cargando…

An efficient pattern growth approach for mining fault tolerant frequent itemsets

Mining fault tolerant (FT) frequent itemsets from transactional databases are computationally more expensive than mining exact matching frequent itemsets. Previous algorithms mine FT frequent itemsets using Apriori heuristic. Apriori-like algorithms generate exponential number of candidate itemsets...

Descripción completa

Detalles Bibliográficos
Autor principal: Bashir, Shariq
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7126664/
https://www.ncbi.nlm.nih.gov/pubmed/32288329
http://dx.doi.org/10.1016/j.eswa.2019.113046
_version_ 1783516195405692928
author Bashir, Shariq
author_facet Bashir, Shariq
author_sort Bashir, Shariq
collection PubMed
description Mining fault tolerant (FT) frequent itemsets from transactional databases are computationally more expensive than mining exact matching frequent itemsets. Previous algorithms mine FT frequent itemsets using Apriori heuristic. Apriori-like algorithms generate exponential number of candidate itemsets including the itemsets that do not exist in the database. These algorithms require multiple scans of database for counting the support of candidate FT itemsets. In this paper we present a novel algorithm, which mines FT frequent itemsets using frequent pattern growth approach (FT-PatternGrowth). FT-PatternGrowth adopts a divide-and-conquer technique and recursively projects transactional database into a set of smaller projected transactional databases and mines FT frequent itemsets in each projected database by exploring only locally frequent items. This mines the complete set of FT frequent itemsets and substantially reduces those candidate itemsets that do not exist in the database. FT-PatternGrowth stores the transactional database in a highly condensed much smaller data structure called frequent pattern tree (FP-tree). The support of candidate itemsets are counted directly from the FP-tree without scanning the original database multiple times. This improves the processing speed of algorithm. Our experiments on benchmark databases indicates mining FT frequent itemsets using FT-PatternGrowth is highly efficient than Apriori-like algorithms.
format Online
Article
Text
id pubmed-7126664
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-71266642020-04-08 An efficient pattern growth approach for mining fault tolerant frequent itemsets Bashir, Shariq Expert Syst Appl Article Mining fault tolerant (FT) frequent itemsets from transactional databases are computationally more expensive than mining exact matching frequent itemsets. Previous algorithms mine FT frequent itemsets using Apriori heuristic. Apriori-like algorithms generate exponential number of candidate itemsets including the itemsets that do not exist in the database. These algorithms require multiple scans of database for counting the support of candidate FT itemsets. In this paper we present a novel algorithm, which mines FT frequent itemsets using frequent pattern growth approach (FT-PatternGrowth). FT-PatternGrowth adopts a divide-and-conquer technique and recursively projects transactional database into a set of smaller projected transactional databases and mines FT frequent itemsets in each projected database by exploring only locally frequent items. This mines the complete set of FT frequent itemsets and substantially reduces those candidate itemsets that do not exist in the database. FT-PatternGrowth stores the transactional database in a highly condensed much smaller data structure called frequent pattern tree (FP-tree). The support of candidate itemsets are counted directly from the FP-tree without scanning the original database multiple times. This improves the processing speed of algorithm. Our experiments on benchmark databases indicates mining FT frequent itemsets using FT-PatternGrowth is highly efficient than Apriori-like algorithms. Elsevier Ltd. 2020-04-01 2019-10-21 /pmc/articles/PMC7126664/ /pubmed/32288329 http://dx.doi.org/10.1016/j.eswa.2019.113046 Text en © 2019 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Bashir, Shariq
An efficient pattern growth approach for mining fault tolerant frequent itemsets
title An efficient pattern growth approach for mining fault tolerant frequent itemsets
title_full An efficient pattern growth approach for mining fault tolerant frequent itemsets
title_fullStr An efficient pattern growth approach for mining fault tolerant frequent itemsets
title_full_unstemmed An efficient pattern growth approach for mining fault tolerant frequent itemsets
title_short An efficient pattern growth approach for mining fault tolerant frequent itemsets
title_sort efficient pattern growth approach for mining fault tolerant frequent itemsets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7126664/
https://www.ncbi.nlm.nih.gov/pubmed/32288329
http://dx.doi.org/10.1016/j.eswa.2019.113046
work_keys_str_mv AT bashirshariq anefficientpatterngrowthapproachforminingfaulttolerantfrequentitemsets
AT bashirshariq efficientpatterngrowthapproachforminingfaulttolerantfrequentitemsets