Cargando…

HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth

In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-N...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Le, Wang, Shui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954358/
https://www.ncbi.nlm.nih.gov/pubmed/33711048
http://dx.doi.org/10.1371/journal.pone.0248349
_version_ 1783664064427196416
author Wang, Le
Wang, Shui
author_facet Wang, Le
Wang, Shui
author_sort Wang, Le
collection PubMed
description In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-Node tree (HUI-TN). These two algorithms avoid the time-consuming candidate generation stage and the need of scanning the original dataset multiple times for exact utility values. A novel tree structure, named tail-node tree (TN-tree) is proposed as a key element of our algorithms to maintain complete utililty-information of existing itemsets of a dataset. The performance of HUIL-TN and HUI-TN was evaluated against state-of-the-art reference methods on various datasets. Experimental results showed that our algorithms exceed or close to the best performance on all datasets in terms of running time, while other algorithms can only excel in certain types of dataset. Scalability tests were also performed and our algorithms obtained the flattest curves among all competitors.
format Online
Article
Text
id pubmed-7954358
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-79543582021-03-22 HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth Wang, Le Wang, Shui PLoS One Research Article In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-Node tree (HUI-TN). These two algorithms avoid the time-consuming candidate generation stage and the need of scanning the original dataset multiple times for exact utility values. A novel tree structure, named tail-node tree (TN-tree) is proposed as a key element of our algorithms to maintain complete utililty-information of existing itemsets of a dataset. The performance of HUIL-TN and HUI-TN was evaluated against state-of-the-art reference methods on various datasets. Experimental results showed that our algorithms exceed or close to the best performance on all datasets in terms of running time, while other algorithms can only excel in certain types of dataset. Scalability tests were also performed and our algorithms obtained the flattest curves among all competitors. Public Library of Science 2021-03-12 /pmc/articles/PMC7954358/ /pubmed/33711048 http://dx.doi.org/10.1371/journal.pone.0248349 Text en © 2021 Wang, Wang http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Le
Wang, Shui
HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
title HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
title_full HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
title_fullStr HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
title_full_unstemmed HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
title_short HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
title_sort huil-tn & hui-tn: mining high utility itemsets based on pattern-growth
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954358/
https://www.ncbi.nlm.nih.gov/pubmed/33711048
http://dx.doi.org/10.1371/journal.pone.0248349
work_keys_str_mv AT wangle huiltnhuitnmininghighutilityitemsetsbasedonpatterngrowth
AT wangshui huiltnhuitnmininghighutilityitemsetsbasedonpatterngrowth