Cargando…

From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching

Data generated from a system of interest typically consists of measurements on many covariate features and possibly multiple response features across all subjects in a designated ensemble. Such data is naturally represented by one response-matrix against one covariate-matrix. A matrix lattice is an...

Descripción completa

Detalles Bibliográficos
Autores principales: Fushing, Hsieh, Liu, Shan-Yu, Hsieh, Yin-Chen, McCowan, Brenda
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6006982/
https://www.ncbi.nlm.nih.gov/pubmed/29902187
http://dx.doi.org/10.1371/journal.pone.0198253
_version_ 1783332952936022016
author Fushing, Hsieh
Liu, Shan-Yu
Hsieh, Yin-Chen
McCowan, Brenda
author_facet Fushing, Hsieh
Liu, Shan-Yu
Hsieh, Yin-Chen
McCowan, Brenda
author_sort Fushing, Hsieh
collection PubMed
description Data generated from a system of interest typically consists of measurements on many covariate features and possibly multiple response features across all subjects in a designated ensemble. Such data is naturally represented by one response-matrix against one covariate-matrix. A matrix lattice is an advantageous platform for simultaneously accommodating heterogeneous data types: continuous, discrete and categorical, and exploring hidden dependency among/between features and subjects. After each feature being individually renormalized with respect to its own histogram, the categorical version of mutual conditional entropy is evaluated for all pairs of response and covariate features according to the combinatorial information theory. Then, by applying Data Could Geometry (DCG) algorithmic computations on such a mutual conditional entropy matrix, multiple synergistic feature-groups are partitioned. Distinct synergistic feature-groups embrace distinct structures of dependency. The explicit details of dependency among members of synergistic features are seen through mutliscale compositions of blocks computed by a computing paradigm called Data Mechanics. We then propose a categorical pattern matching approach to establish a directed associative linkage: from the patterned response dependency to serial structured covariate dependency. The graphic display of such a directed associative linkage is termed an information flow and the degrees of association are evaluated via tree-to-tree mutual conditional entropy. This new universal way of discovering system knowledge is illustrated through five data sets. In each case, the emergent visible heterogeneity is an organization of discovered knowledge.
format Online
Article
Text
id pubmed-6006982
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60069822018-06-25 From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching Fushing, Hsieh Liu, Shan-Yu Hsieh, Yin-Chen McCowan, Brenda PLoS One Research Article Data generated from a system of interest typically consists of measurements on many covariate features and possibly multiple response features across all subjects in a designated ensemble. Such data is naturally represented by one response-matrix against one covariate-matrix. A matrix lattice is an advantageous platform for simultaneously accommodating heterogeneous data types: continuous, discrete and categorical, and exploring hidden dependency among/between features and subjects. After each feature being individually renormalized with respect to its own histogram, the categorical version of mutual conditional entropy is evaluated for all pairs of response and covariate features according to the combinatorial information theory. Then, by applying Data Could Geometry (DCG) algorithmic computations on such a mutual conditional entropy matrix, multiple synergistic feature-groups are partitioned. Distinct synergistic feature-groups embrace distinct structures of dependency. The explicit details of dependency among members of synergistic features are seen through mutliscale compositions of blocks computed by a computing paradigm called Data Mechanics. We then propose a categorical pattern matching approach to establish a directed associative linkage: from the patterned response dependency to serial structured covariate dependency. The graphic display of such a directed associative linkage is termed an information flow and the degrees of association are evaluated via tree-to-tree mutual conditional entropy. This new universal way of discovering system knowledge is illustrated through five data sets. In each case, the emergent visible heterogeneity is an organization of discovered knowledge. Public Library of Science 2018-06-14 /pmc/articles/PMC6006982/ /pubmed/29902187 http://dx.doi.org/10.1371/journal.pone.0198253 Text en © 2018 Fushing et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Fushing, Hsieh
Liu, Shan-Yu
Hsieh, Yin-Chen
McCowan, Brenda
From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching
title From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching
title_full From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching
title_fullStr From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching
title_full_unstemmed From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching
title_short From patterned response dependency to structured covariate dependency: Entropy based categorical-pattern-matching
title_sort from patterned response dependency to structured covariate dependency: entropy based categorical-pattern-matching
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6006982/
https://www.ncbi.nlm.nih.gov/pubmed/29902187
http://dx.doi.org/10.1371/journal.pone.0198253
work_keys_str_mv AT fushinghsieh frompatternedresponsedependencytostructuredcovariatedependencyentropybasedcategoricalpatternmatching
AT liushanyu frompatternedresponsedependencytostructuredcovariatedependencyentropybasedcategoricalpatternmatching
AT hsiehyinchen frompatternedresponsedependencytostructuredcovariatedependencyentropybasedcategoricalpatternmatching
AT mccowanbrenda frompatternedresponsedependencytostructuredcovariatedependencyentropybasedcategoricalpatternmatching