Cargando…

Categorical Nature of Major Factor Selection via Information Theoretic Measurements

Without assuming any functional or distributional structure, we select collections of major factors embedded within response-versus-covariate (Re-Co) dynamics via selection criteria [C1: confirmable] and [C2: irrepaceable], which are based on information theoretic measurements. The two criteria are...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Ting-Li, Chou, Elizabeth P., Fushing, Hsieh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700576/
https://www.ncbi.nlm.nih.gov/pubmed/34945990
http://dx.doi.org/10.3390/e23121684
_version_ 1784620790396747776
author Chen, Ting-Li
Chou, Elizabeth P.
Fushing, Hsieh
author_facet Chen, Ting-Li
Chou, Elizabeth P.
Fushing, Hsieh
author_sort Chen, Ting-Li
collection PubMed
description Without assuming any functional or distributional structure, we select collections of major factors embedded within response-versus-covariate (Re-Co) dynamics via selection criteria [C1: confirmable] and [C2: irrepaceable], which are based on information theoretic measurements. The two criteria are constructed based on the computing paradigm called Categorical Exploratory Data Analysis (CEDA) and linked to Wiener–Granger causality. All the information theoretical measurements, including conditional mutual information and entropy, are evaluated through the contingency table platform, which primarily rests on the categorical nature within all involved features of any data types: quantitative or qualitative. Our selection task identifies one chief collection, together with several secondary collections of major factors of various orders underlying the targeted Re-Co dynamics. Each selected collection is checked with algorithmically computed reliability against the finite sample phenomenon, and so is each member’s major factor individually. The developments of our selection protocol are illustrated in detail through two experimental examples: a simple one and a complex one. We then apply this protocol on two data sets pertaining to two somewhat related but distinct pitching dynamics of two pitch types: slider and fastball. In particular, we refer to a specific Major League Baseball (MLB) pitcher and we consider data of multiple seasons.
format Online
Article
Text
id pubmed-8700576
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87005762021-12-24 Categorical Nature of Major Factor Selection via Information Theoretic Measurements Chen, Ting-Li Chou, Elizabeth P. Fushing, Hsieh Entropy (Basel) Article Without assuming any functional or distributional structure, we select collections of major factors embedded within response-versus-covariate (Re-Co) dynamics via selection criteria [C1: confirmable] and [C2: irrepaceable], which are based on information theoretic measurements. The two criteria are constructed based on the computing paradigm called Categorical Exploratory Data Analysis (CEDA) and linked to Wiener–Granger causality. All the information theoretical measurements, including conditional mutual information and entropy, are evaluated through the contingency table platform, which primarily rests on the categorical nature within all involved features of any data types: quantitative or qualitative. Our selection task identifies one chief collection, together with several secondary collections of major factors of various orders underlying the targeted Re-Co dynamics. Each selected collection is checked with algorithmically computed reliability against the finite sample phenomenon, and so is each member’s major factor individually. The developments of our selection protocol are illustrated in detail through two experimental examples: a simple one and a complex one. We then apply this protocol on two data sets pertaining to two somewhat related but distinct pitching dynamics of two pitch types: slider and fastball. In particular, we refer to a specific Major League Baseball (MLB) pitcher and we consider data of multiple seasons. MDPI 2021-12-15 /pmc/articles/PMC8700576/ /pubmed/34945990 http://dx.doi.org/10.3390/e23121684 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chen, Ting-Li
Chou, Elizabeth P.
Fushing, Hsieh
Categorical Nature of Major Factor Selection via Information Theoretic Measurements
title Categorical Nature of Major Factor Selection via Information Theoretic Measurements
title_full Categorical Nature of Major Factor Selection via Information Theoretic Measurements
title_fullStr Categorical Nature of Major Factor Selection via Information Theoretic Measurements
title_full_unstemmed Categorical Nature of Major Factor Selection via Information Theoretic Measurements
title_short Categorical Nature of Major Factor Selection via Information Theoretic Measurements
title_sort categorical nature of major factor selection via information theoretic measurements
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8700576/
https://www.ncbi.nlm.nih.gov/pubmed/34945990
http://dx.doi.org/10.3390/e23121684
work_keys_str_mv AT chentingli categoricalnatureofmajorfactorselectionviainformationtheoreticmeasurements
AT chouelizabethp categoricalnatureofmajorfactorselectionviainformationtheoreticmeasurements
AT fushinghsieh categoricalnatureofmajorfactorselectionviainformationtheoreticmeasurements