Cargando…
A Two-Parameter Fractional Tsallis Decision Tree
Decision trees are decision support data mining tools that create, as the name suggests, a tree-like model. The classical C4.5 decision tree, based on the Shannon entropy, is a simple algorithm to calculate the gain ratio and then split the attributes based on this entropy measure. Tsallis and Renyi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141694/ https://www.ncbi.nlm.nih.gov/pubmed/35626457 http://dx.doi.org/10.3390/e24050572 |
_version_ | 1784715405638500352 |
---|---|
author | De la Cruz-García, Jazmín S. Bory-Reyes, Juan Ramirez-Arellano, Aldo |
author_facet | De la Cruz-García, Jazmín S. Bory-Reyes, Juan Ramirez-Arellano, Aldo |
author_sort | De la Cruz-García, Jazmín S. |
collection | PubMed |
description | Decision trees are decision support data mining tools that create, as the name suggests, a tree-like model. The classical C4.5 decision tree, based on the Shannon entropy, is a simple algorithm to calculate the gain ratio and then split the attributes based on this entropy measure. Tsallis and Renyi entropies (instead of Shannon) can be employed to generate a decision tree with better results. In practice, the entropic index parameter of these entropies is tuned to outperform the classical decision trees. However, this process is carried out by testing a range of values for a given database, which is time-consuming and unfeasible for massive data. This paper introduces a decision tree based on a two-parameter fractional Tsallis entropy. We propose a constructionist approach to the representation of databases as complex networks that enable us an efficient computation of the parameters of this entropy using the box-covering algorithm and renormalization of the complex network. The experimental results support the conclusion that the two-parameter fractional Tsallis entropy is a more sensitive measure than parametric Renyi, Tsallis, and Gini index precedents for a decision tree classifier. |
format | Online Article Text |
id | pubmed-9141694 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91416942022-05-28 A Two-Parameter Fractional Tsallis Decision Tree De la Cruz-García, Jazmín S. Bory-Reyes, Juan Ramirez-Arellano, Aldo Entropy (Basel) Article Decision trees are decision support data mining tools that create, as the name suggests, a tree-like model. The classical C4.5 decision tree, based on the Shannon entropy, is a simple algorithm to calculate the gain ratio and then split the attributes based on this entropy measure. Tsallis and Renyi entropies (instead of Shannon) can be employed to generate a decision tree with better results. In practice, the entropic index parameter of these entropies is tuned to outperform the classical decision trees. However, this process is carried out by testing a range of values for a given database, which is time-consuming and unfeasible for massive data. This paper introduces a decision tree based on a two-parameter fractional Tsallis entropy. We propose a constructionist approach to the representation of databases as complex networks that enable us an efficient computation of the parameters of this entropy using the box-covering algorithm and renormalization of the complex network. The experimental results support the conclusion that the two-parameter fractional Tsallis entropy is a more sensitive measure than parametric Renyi, Tsallis, and Gini index precedents for a decision tree classifier. MDPI 2022-04-19 /pmc/articles/PMC9141694/ /pubmed/35626457 http://dx.doi.org/10.3390/e24050572 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article De la Cruz-García, Jazmín S. Bory-Reyes, Juan Ramirez-Arellano, Aldo A Two-Parameter Fractional Tsallis Decision Tree |
title | A Two-Parameter Fractional Tsallis Decision Tree |
title_full | A Two-Parameter Fractional Tsallis Decision Tree |
title_fullStr | A Two-Parameter Fractional Tsallis Decision Tree |
title_full_unstemmed | A Two-Parameter Fractional Tsallis Decision Tree |
title_short | A Two-Parameter Fractional Tsallis Decision Tree |
title_sort | two-parameter fractional tsallis decision tree |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9141694/ https://www.ncbi.nlm.nih.gov/pubmed/35626457 http://dx.doi.org/10.3390/e24050572 |
work_keys_str_mv | AT delacruzgarciajazmins atwoparameterfractionaltsallisdecisiontree AT boryreyesjuan atwoparameterfractionaltsallisdecisiontree AT ramirezarellanoaldo atwoparameterfractionaltsallisdecisiontree AT delacruzgarciajazmins twoparameterfractionaltsallisdecisiontree AT boryreyesjuan twoparameterfractionaltsallisdecisiontree AT ramirezarellanoaldo twoparameterfractionaltsallisdecisiontree |