Cargando…
RPP Algorithm: A Method for Discovering Interesting Rare Itemsets
The importance of rare itemset mining stems from its ability to discover unseen knowledge from datasets in real-life domains, such as identifying network failures, or suspicious behavior. There are significant efforts proposed to extract rare itemsets. The RP-growth algorithm outperforms previous me...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351680/ http://dx.doi.org/10.1007/978-981-15-7205-0_2 |
_version_ | 1783557486518730752 |
---|---|
author | Darrab, Sadeq Broneske, David Saake, Gunter |
author_facet | Darrab, Sadeq Broneske, David Saake, Gunter |
author_sort | Darrab, Sadeq |
collection | PubMed |
description | The importance of rare itemset mining stems from its ability to discover unseen knowledge from datasets in real-life domains, such as identifying network failures, or suspicious behavior. There are significant efforts proposed to extract rare itemsets. The RP-growth algorithm outperforms previous methods proposed for generating rare itemsets. However, the performance of the RP-growth degrades on sparse datasets, and it is costly in terms of time and memory consumption. Hence, in this paper, we propose the RPP algorithm to extract rare itemsets. The advantage of the RPP algorithm is that it avoids time for generating useless candidate itemsets by omitting conditional trees as RP-growth does. Furthermore, our RPP algorithm uses a novel data structure, RN-list, for creating rare itemsets. To evaluate the performance of the proposed method, we conduct extensive experiments on sparse and dense datasets. The results show that the RPP algorithm is around an order of magnitude better than the RP-growth algorithm. |
format | Online Article Text |
id | pubmed-7351680 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73516802020-07-13 RPP Algorithm: A Method for Discovering Interesting Rare Itemsets Darrab, Sadeq Broneske, David Saake, Gunter Data Mining and Big Data Article The importance of rare itemset mining stems from its ability to discover unseen knowledge from datasets in real-life domains, such as identifying network failures, or suspicious behavior. There are significant efforts proposed to extract rare itemsets. The RP-growth algorithm outperforms previous methods proposed for generating rare itemsets. However, the performance of the RP-growth degrades on sparse datasets, and it is costly in terms of time and memory consumption. Hence, in this paper, we propose the RPP algorithm to extract rare itemsets. The advantage of the RPP algorithm is that it avoids time for generating useless candidate itemsets by omitting conditional trees as RP-growth does. Furthermore, our RPP algorithm uses a novel data structure, RN-list, for creating rare itemsets. To evaluate the performance of the proposed method, we conduct extensive experiments on sparse and dense datasets. The results show that the RPP algorithm is around an order of magnitude better than the RP-growth algorithm. 2020-07-11 /pmc/articles/PMC7351680/ http://dx.doi.org/10.1007/978-981-15-7205-0_2 Text en © Springer Nature Singapore Pte Ltd. 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Darrab, Sadeq Broneske, David Saake, Gunter RPP Algorithm: A Method for Discovering Interesting Rare Itemsets |
title | RPP Algorithm: A Method for Discovering Interesting Rare Itemsets |
title_full | RPP Algorithm: A Method for Discovering Interesting Rare Itemsets |
title_fullStr | RPP Algorithm: A Method for Discovering Interesting Rare Itemsets |
title_full_unstemmed | RPP Algorithm: A Method for Discovering Interesting Rare Itemsets |
title_short | RPP Algorithm: A Method for Discovering Interesting Rare Itemsets |
title_sort | rpp algorithm: a method for discovering interesting rare itemsets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351680/ http://dx.doi.org/10.1007/978-981-15-7205-0_2 |
work_keys_str_mv | AT darrabsadeq rppalgorithmamethodfordiscoveringinterestingrareitemsets AT broneskedavid rppalgorithmamethodfordiscoveringinterestingrareitemsets AT saakegunter rppalgorithmamethodfordiscoveringinterestingrareitemsets |