Cargando…
An efficient pattern growth approach for mining fault tolerant frequent itemsets
Mining fault tolerant (FT) frequent itemsets from transactional databases are computationally more expensive than mining exact matching frequent itemsets. Previous algorithms mine FT frequent itemsets using Apriori heuristic. Apriori-like algorithms generate exponential number of candidate itemsets...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7126664/ https://www.ncbi.nlm.nih.gov/pubmed/32288329 http://dx.doi.org/10.1016/j.eswa.2019.113046 |
_version_ | 1783516195405692928 |
---|---|
author | Bashir, Shariq |
author_facet | Bashir, Shariq |
author_sort | Bashir, Shariq |
collection | PubMed |
description | Mining fault tolerant (FT) frequent itemsets from transactional databases are computationally more expensive than mining exact matching frequent itemsets. Previous algorithms mine FT frequent itemsets using Apriori heuristic. Apriori-like algorithms generate exponential number of candidate itemsets including the itemsets that do not exist in the database. These algorithms require multiple scans of database for counting the support of candidate FT itemsets. In this paper we present a novel algorithm, which mines FT frequent itemsets using frequent pattern growth approach (FT-PatternGrowth). FT-PatternGrowth adopts a divide-and-conquer technique and recursively projects transactional database into a set of smaller projected transactional databases and mines FT frequent itemsets in each projected database by exploring only locally frequent items. This mines the complete set of FT frequent itemsets and substantially reduces those candidate itemsets that do not exist in the database. FT-PatternGrowth stores the transactional database in a highly condensed much smaller data structure called frequent pattern tree (FP-tree). The support of candidate itemsets are counted directly from the FP-tree without scanning the original database multiple times. This improves the processing speed of algorithm. Our experiments on benchmark databases indicates mining FT frequent itemsets using FT-PatternGrowth is highly efficient than Apriori-like algorithms. |
format | Online Article Text |
id | pubmed-7126664 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-71266642020-04-08 An efficient pattern growth approach for mining fault tolerant frequent itemsets Bashir, Shariq Expert Syst Appl Article Mining fault tolerant (FT) frequent itemsets from transactional databases are computationally more expensive than mining exact matching frequent itemsets. Previous algorithms mine FT frequent itemsets using Apriori heuristic. Apriori-like algorithms generate exponential number of candidate itemsets including the itemsets that do not exist in the database. These algorithms require multiple scans of database for counting the support of candidate FT itemsets. In this paper we present a novel algorithm, which mines FT frequent itemsets using frequent pattern growth approach (FT-PatternGrowth). FT-PatternGrowth adopts a divide-and-conquer technique and recursively projects transactional database into a set of smaller projected transactional databases and mines FT frequent itemsets in each projected database by exploring only locally frequent items. This mines the complete set of FT frequent itemsets and substantially reduces those candidate itemsets that do not exist in the database. FT-PatternGrowth stores the transactional database in a highly condensed much smaller data structure called frequent pattern tree (FP-tree). The support of candidate itemsets are counted directly from the FP-tree without scanning the original database multiple times. This improves the processing speed of algorithm. Our experiments on benchmark databases indicates mining FT frequent itemsets using FT-PatternGrowth is highly efficient than Apriori-like algorithms. Elsevier Ltd. 2020-04-01 2019-10-21 /pmc/articles/PMC7126664/ /pubmed/32288329 http://dx.doi.org/10.1016/j.eswa.2019.113046 Text en © 2019 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Bashir, Shariq An efficient pattern growth approach for mining fault tolerant frequent itemsets |
title | An efficient pattern growth approach for mining fault tolerant frequent itemsets |
title_full | An efficient pattern growth approach for mining fault tolerant frequent itemsets |
title_fullStr | An efficient pattern growth approach for mining fault tolerant frequent itemsets |
title_full_unstemmed | An efficient pattern growth approach for mining fault tolerant frequent itemsets |
title_short | An efficient pattern growth approach for mining fault tolerant frequent itemsets |
title_sort | efficient pattern growth approach for mining fault tolerant frequent itemsets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7126664/ https://www.ncbi.nlm.nih.gov/pubmed/32288329 http://dx.doi.org/10.1016/j.eswa.2019.113046 |
work_keys_str_mv | AT bashirshariq anefficientpatterngrowthapproachforminingfaulttolerantfrequentitemsets AT bashirshariq efficientpatterngrowthapproachforminingfaulttolerantfrequentitemsets |