Cargando…
Modeling Categorical Variables by Mutual Information Decomposition
This paper proposed the use of mutual information (MI) decomposition as a novel approach to identifying indispensable variables and their interactions for contingency table analysis. The MI analysis identified subsets of associative variables based on multinomial distributions and validated parsimon...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217765/ https://www.ncbi.nlm.nih.gov/pubmed/37238505 http://dx.doi.org/10.3390/e25050750 |
_version_ | 1785048616596930560 |
---|---|
author | Liou, Jiun-Wei Liou, Michelle Cheng, Philip E. |
author_facet | Liou, Jiun-Wei Liou, Michelle Cheng, Philip E. |
author_sort | Liou, Jiun-Wei |
collection | PubMed |
description | This paper proposed the use of mutual information (MI) decomposition as a novel approach to identifying indispensable variables and their interactions for contingency table analysis. The MI analysis identified subsets of associative variables based on multinomial distributions and validated parsimonious log-linear and logistic models. The proposed approach was assessed using two real-world datasets dealing with ischemic stroke (with 6 risk factors) and banking credit (with 21 discrete attributes in a sparse table). This paper also provided an empirical comparison of MI analysis versus two state-of-the-art methods in terms of variable and model selections. The proposed MI analysis scheme can be used in the construction of parsimonious log-linear and logistic models with a concise interpretation of discrete multivariate data. |
format | Online Article Text |
id | pubmed-10217765 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102177652023-05-27 Modeling Categorical Variables by Mutual Information Decomposition Liou, Jiun-Wei Liou, Michelle Cheng, Philip E. Entropy (Basel) Article This paper proposed the use of mutual information (MI) decomposition as a novel approach to identifying indispensable variables and their interactions for contingency table analysis. The MI analysis identified subsets of associative variables based on multinomial distributions and validated parsimonious log-linear and logistic models. The proposed approach was assessed using two real-world datasets dealing with ischemic stroke (with 6 risk factors) and banking credit (with 21 discrete attributes in a sparse table). This paper also provided an empirical comparison of MI analysis versus two state-of-the-art methods in terms of variable and model selections. The proposed MI analysis scheme can be used in the construction of parsimonious log-linear and logistic models with a concise interpretation of discrete multivariate data. MDPI 2023-05-04 /pmc/articles/PMC10217765/ /pubmed/37238505 http://dx.doi.org/10.3390/e25050750 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liou, Jiun-Wei Liou, Michelle Cheng, Philip E. Modeling Categorical Variables by Mutual Information Decomposition |
title | Modeling Categorical Variables by Mutual Information Decomposition |
title_full | Modeling Categorical Variables by Mutual Information Decomposition |
title_fullStr | Modeling Categorical Variables by Mutual Information Decomposition |
title_full_unstemmed | Modeling Categorical Variables by Mutual Information Decomposition |
title_short | Modeling Categorical Variables by Mutual Information Decomposition |
title_sort | modeling categorical variables by mutual information decomposition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217765/ https://www.ncbi.nlm.nih.gov/pubmed/37238505 http://dx.doi.org/10.3390/e25050750 |
work_keys_str_mv | AT lioujiunwei modelingcategoricalvariablesbymutualinformationdecomposition AT lioumichelle modelingcategoricalvariablesbymutualinformationdecomposition AT chengphilipe modelingcategoricalvariablesbymutualinformationdecomposition |