Cargando…
HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth
In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-N...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954358/ https://www.ncbi.nlm.nih.gov/pubmed/33711048 http://dx.doi.org/10.1371/journal.pone.0248349 |
_version_ | 1783664064427196416 |
---|---|
author | Wang, Le Wang, Shui |
author_facet | Wang, Le Wang, Shui |
author_sort | Wang, Le |
collection | PubMed |
description | In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-Node tree (HUI-TN). These two algorithms avoid the time-consuming candidate generation stage and the need of scanning the original dataset multiple times for exact utility values. A novel tree structure, named tail-node tree (TN-tree) is proposed as a key element of our algorithms to maintain complete utililty-information of existing itemsets of a dataset. The performance of HUIL-TN and HUI-TN was evaluated against state-of-the-art reference methods on various datasets. Experimental results showed that our algorithms exceed or close to the best performance on all datasets in terms of running time, while other algorithms can only excel in certain types of dataset. Scalability tests were also performed and our algorithms obtained the flattest curves among all competitors. |
format | Online Article Text |
id | pubmed-7954358 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-79543582021-03-22 HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth Wang, Le Wang, Shui PLoS One Research Article In recent years, high utility itemsets (HUIs) mining has been an active research topic in data mining. In this study, we propose two efficient pattern-growth based HUI mining algorithms, called High Utility Itemset based on Length and Tail-Node tree (HUIL-TN) and High Utility Itemset based on Tail-Node tree (HUI-TN). These two algorithms avoid the time-consuming candidate generation stage and the need of scanning the original dataset multiple times for exact utility values. A novel tree structure, named tail-node tree (TN-tree) is proposed as a key element of our algorithms to maintain complete utililty-information of existing itemsets of a dataset. The performance of HUIL-TN and HUI-TN was evaluated against state-of-the-art reference methods on various datasets. Experimental results showed that our algorithms exceed or close to the best performance on all datasets in terms of running time, while other algorithms can only excel in certain types of dataset. Scalability tests were also performed and our algorithms obtained the flattest curves among all competitors. Public Library of Science 2021-03-12 /pmc/articles/PMC7954358/ /pubmed/33711048 http://dx.doi.org/10.1371/journal.pone.0248349 Text en © 2021 Wang, Wang http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Wang, Le Wang, Shui HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth |
title | HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth |
title_full | HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth |
title_fullStr | HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth |
title_full_unstemmed | HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth |
title_short | HUIL-TN & HUI-TN: Mining high utility itemsets based on pattern-growth |
title_sort | huil-tn & hui-tn: mining high utility itemsets based on pattern-growth |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954358/ https://www.ncbi.nlm.nih.gov/pubmed/33711048 http://dx.doi.org/10.1371/journal.pone.0248349 |
work_keys_str_mv | AT wangle huiltnhuitnmininghighutilityitemsetsbasedonpatterngrowth AT wangshui huiltnhuitnmininghighutilityitemsetsbasedonpatterngrowth |