Cargando…

RPP Algorithm: A Method for Discovering Interesting Rare Itemsets

The importance of rare itemset mining stems from its ability to discover unseen knowledge from datasets in real-life domains, such as identifying network failures, or suspicious behavior. There are significant efforts proposed to extract rare itemsets. The RP-growth algorithm outperforms previous me...

Descripción completa

Detalles Bibliográficos
Autores principales: Darrab, Sadeq, Broneske, David, Saake, Gunter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351680/
http://dx.doi.org/10.1007/978-981-15-7205-0_2
_version_ 1783557486518730752
author Darrab, Sadeq
Broneske, David
Saake, Gunter
author_facet Darrab, Sadeq
Broneske, David
Saake, Gunter
author_sort Darrab, Sadeq
collection PubMed
description The importance of rare itemset mining stems from its ability to discover unseen knowledge from datasets in real-life domains, such as identifying network failures, or suspicious behavior. There are significant efforts proposed to extract rare itemsets. The RP-growth algorithm outperforms previous methods proposed for generating rare itemsets. However, the performance of the RP-growth degrades on sparse datasets, and it is costly in terms of time and memory consumption. Hence, in this paper, we propose the RPP algorithm to extract rare itemsets. The advantage of the RPP algorithm is that it avoids time for generating useless candidate itemsets by omitting conditional trees as RP-growth does. Furthermore, our RPP algorithm uses a novel data structure, RN-list, for creating rare itemsets. To evaluate the performance of the proposed method, we conduct extensive experiments on sparse and dense datasets. The results show that the RPP algorithm is around an order of magnitude better than the RP-growth algorithm.
format Online
Article
Text
id pubmed-7351680
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73516802020-07-13 RPP Algorithm: A Method for Discovering Interesting Rare Itemsets Darrab, Sadeq Broneske, David Saake, Gunter Data Mining and Big Data Article The importance of rare itemset mining stems from its ability to discover unseen knowledge from datasets in real-life domains, such as identifying network failures, or suspicious behavior. There are significant efforts proposed to extract rare itemsets. The RP-growth algorithm outperforms previous methods proposed for generating rare itemsets. However, the performance of the RP-growth degrades on sparse datasets, and it is costly in terms of time and memory consumption. Hence, in this paper, we propose the RPP algorithm to extract rare itemsets. The advantage of the RPP algorithm is that it avoids time for generating useless candidate itemsets by omitting conditional trees as RP-growth does. Furthermore, our RPP algorithm uses a novel data structure, RN-list, for creating rare itemsets. To evaluate the performance of the proposed method, we conduct extensive experiments on sparse and dense datasets. The results show that the RPP algorithm is around an order of magnitude better than the RP-growth algorithm. 2020-07-11 /pmc/articles/PMC7351680/ http://dx.doi.org/10.1007/978-981-15-7205-0_2 Text en © Springer Nature Singapore Pte Ltd. 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Darrab, Sadeq
Broneske, David
Saake, Gunter
RPP Algorithm: A Method for Discovering Interesting Rare Itemsets
title RPP Algorithm: A Method for Discovering Interesting Rare Itemsets
title_full RPP Algorithm: A Method for Discovering Interesting Rare Itemsets
title_fullStr RPP Algorithm: A Method for Discovering Interesting Rare Itemsets
title_full_unstemmed RPP Algorithm: A Method for Discovering Interesting Rare Itemsets
title_short RPP Algorithm: A Method for Discovering Interesting Rare Itemsets
title_sort rpp algorithm: a method for discovering interesting rare itemsets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7351680/
http://dx.doi.org/10.1007/978-981-15-7205-0_2
work_keys_str_mv AT darrabsadeq rppalgorithmamethodfordiscoveringinterestingrareitemsets
AT broneskedavid rppalgorithmamethodfordiscoveringinterestingrareitemsets
AT saakegunter rppalgorithmamethodfordiscoveringinterestingrareitemsets