Cargando…

Modeling Categorical Variables by Mutual Information Decomposition

This paper proposed the use of mutual information (MI) decomposition as a novel approach to identifying indispensable variables and their interactions for contingency table analysis. The MI analysis identified subsets of associative variables based on multinomial distributions and validated parsimon...

Descripción completa

Detalles Bibliográficos
Autores principales: Liou, Jiun-Wei, Liou, Michelle, Cheng, Philip E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217765/
https://www.ncbi.nlm.nih.gov/pubmed/37238505
http://dx.doi.org/10.3390/e25050750
_version_ 1785048616596930560
author Liou, Jiun-Wei
Liou, Michelle
Cheng, Philip E.
author_facet Liou, Jiun-Wei
Liou, Michelle
Cheng, Philip E.
author_sort Liou, Jiun-Wei
collection PubMed
description This paper proposed the use of mutual information (MI) decomposition as a novel approach to identifying indispensable variables and their interactions for contingency table analysis. The MI analysis identified subsets of associative variables based on multinomial distributions and validated parsimonious log-linear and logistic models. The proposed approach was assessed using two real-world datasets dealing with ischemic stroke (with 6 risk factors) and banking credit (with 21 discrete attributes in a sparse table). This paper also provided an empirical comparison of MI analysis versus two state-of-the-art methods in terms of variable and model selections. The proposed MI analysis scheme can be used in the construction of parsimonious log-linear and logistic models with a concise interpretation of discrete multivariate data.
format Online
Article
Text
id pubmed-10217765
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102177652023-05-27 Modeling Categorical Variables by Mutual Information Decomposition Liou, Jiun-Wei Liou, Michelle Cheng, Philip E. Entropy (Basel) Article This paper proposed the use of mutual information (MI) decomposition as a novel approach to identifying indispensable variables and their interactions for contingency table analysis. The MI analysis identified subsets of associative variables based on multinomial distributions and validated parsimonious log-linear and logistic models. The proposed approach was assessed using two real-world datasets dealing with ischemic stroke (with 6 risk factors) and banking credit (with 21 discrete attributes in a sparse table). This paper also provided an empirical comparison of MI analysis versus two state-of-the-art methods in terms of variable and model selections. The proposed MI analysis scheme can be used in the construction of parsimonious log-linear and logistic models with a concise interpretation of discrete multivariate data. MDPI 2023-05-04 /pmc/articles/PMC10217765/ /pubmed/37238505 http://dx.doi.org/10.3390/e25050750 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liou, Jiun-Wei
Liou, Michelle
Cheng, Philip E.
Modeling Categorical Variables by Mutual Information Decomposition
title Modeling Categorical Variables by Mutual Information Decomposition
title_full Modeling Categorical Variables by Mutual Information Decomposition
title_fullStr Modeling Categorical Variables by Mutual Information Decomposition
title_full_unstemmed Modeling Categorical Variables by Mutual Information Decomposition
title_short Modeling Categorical Variables by Mutual Information Decomposition
title_sort modeling categorical variables by mutual information decomposition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10217765/
https://www.ncbi.nlm.nih.gov/pubmed/37238505
http://dx.doi.org/10.3390/e25050750
work_keys_str_mv AT lioujiunwei modelingcategoricalvariablesbymutualinformationdecomposition
AT lioumichelle modelingcategoricalvariablesbymutualinformationdecomposition
AT chengphilipe modelingcategoricalvariablesbymutualinformationdecomposition