Cargando…

Heterogeneous Graphical Granger Causality by Minimum Message Length

The heterogeneous graphical Granger model (HGGM) for causal inference among processes with distributions from an exponential family is efficient in scenarios when the number of time observations is much greater than the number of time series, normally by several orders of magnitude. However, in the...

Descripción completa

Detalles Bibliográficos
Autores principales: Hlaváčková-Schindler, Kateřina, Plant, Claudia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763266/
https://www.ncbi.nlm.nih.gov/pubmed/33322439
http://dx.doi.org/10.3390/e22121400
_version_ 1783627977578250240
author Hlaváčková-Schindler, Kateřina
Plant, Claudia
author_facet Hlaváčková-Schindler, Kateřina
Plant, Claudia
author_sort Hlaváčková-Schindler, Kateřina
collection PubMed
description The heterogeneous graphical Granger model (HGGM) for causal inference among processes with distributions from an exponential family is efficient in scenarios when the number of time observations is much greater than the number of time series, normally by several orders of magnitude. However, in the case of “short” time series, the inference in HGGM often suffers from overestimation. To remedy this, we use the minimum message length principle (MML) to determinate the causal connections in the HGGM. The minimum message length as a Bayesian information-theoretic method for statistical model selection applies Occam’s razor in the following way: even when models are equal in their measure of fit-accuracy to the observed data, the one generating the most concise explanation of data is more likely to be correct. Based on the dispersion coefficient of the target time series and on the initial maximum likelihood estimates of the regression coefficients, we propose a minimum message length criterion to select the subset of causally connected time series with each target time series and derive its form for various exponential distributions. We propose two algorithms—the genetic-type algorithm (HMMLGA) and exHMML to find the subset. We demonstrated the superiority of both algorithms in synthetic experiments with respect to the comparison methods Lingam, HGGM and statistical framework Granger causality (SFGC). In the real data experiments, we used the methods to discriminate between pregnancy and labor phase using electrohysterogram data of Islandic mothers from Physionet databasis. We further analysed the Austrian climatological time measurements and their temporal interactions in rain and sunny days scenarios. In both experiments, the results of HMMLGA had the most realistic interpretation with respect to the comparison methods. We provide our code in Matlab. To our best knowledge, this is the first work using the MML principle for causal inference in HGGM.
format Online
Article
Text
id pubmed-7763266
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77632662021-02-24 Heterogeneous Graphical Granger Causality by Minimum Message Length Hlaváčková-Schindler, Kateřina Plant, Claudia Entropy (Basel) Article The heterogeneous graphical Granger model (HGGM) for causal inference among processes with distributions from an exponential family is efficient in scenarios when the number of time observations is much greater than the number of time series, normally by several orders of magnitude. However, in the case of “short” time series, the inference in HGGM often suffers from overestimation. To remedy this, we use the minimum message length principle (MML) to determinate the causal connections in the HGGM. The minimum message length as a Bayesian information-theoretic method for statistical model selection applies Occam’s razor in the following way: even when models are equal in their measure of fit-accuracy to the observed data, the one generating the most concise explanation of data is more likely to be correct. Based on the dispersion coefficient of the target time series and on the initial maximum likelihood estimates of the regression coefficients, we propose a minimum message length criterion to select the subset of causally connected time series with each target time series and derive its form for various exponential distributions. We propose two algorithms—the genetic-type algorithm (HMMLGA) and exHMML to find the subset. We demonstrated the superiority of both algorithms in synthetic experiments with respect to the comparison methods Lingam, HGGM and statistical framework Granger causality (SFGC). In the real data experiments, we used the methods to discriminate between pregnancy and labor phase using electrohysterogram data of Islandic mothers from Physionet databasis. We further analysed the Austrian climatological time measurements and their temporal interactions in rain and sunny days scenarios. In both experiments, the results of HMMLGA had the most realistic interpretation with respect to the comparison methods. We provide our code in Matlab. To our best knowledge, this is the first work using the MML principle for causal inference in HGGM. MDPI 2020-12-11 /pmc/articles/PMC7763266/ /pubmed/33322439 http://dx.doi.org/10.3390/e22121400 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hlaváčková-Schindler, Kateřina
Plant, Claudia
Heterogeneous Graphical Granger Causality by Minimum Message Length
title Heterogeneous Graphical Granger Causality by Minimum Message Length
title_full Heterogeneous Graphical Granger Causality by Minimum Message Length
title_fullStr Heterogeneous Graphical Granger Causality by Minimum Message Length
title_full_unstemmed Heterogeneous Graphical Granger Causality by Minimum Message Length
title_short Heterogeneous Graphical Granger Causality by Minimum Message Length
title_sort heterogeneous graphical granger causality by minimum message length
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763266/
https://www.ncbi.nlm.nih.gov/pubmed/33322439
http://dx.doi.org/10.3390/e22121400
work_keys_str_mv AT hlavackovaschindlerkaterina heterogeneousgraphicalgrangercausalitybyminimummessagelength
AT plantclaudia heterogeneousgraphicalgrangercausalitybyminimummessagelength