Cargando…
Heterogeneous Graphical Granger Causality by Minimum Message Length
The heterogeneous graphical Granger model (HGGM) for causal inference among processes with distributions from an exponential family is efficient in scenarios when the number of time observations is much greater than the number of time series, normally by several orders of magnitude. However, in the...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763266/ https://www.ncbi.nlm.nih.gov/pubmed/33322439 http://dx.doi.org/10.3390/e22121400 |
_version_ | 1783627977578250240 |
---|---|
author | Hlaváčková-Schindler, Kateřina Plant, Claudia |
author_facet | Hlaváčková-Schindler, Kateřina Plant, Claudia |
author_sort | Hlaváčková-Schindler, Kateřina |
collection | PubMed |
description | The heterogeneous graphical Granger model (HGGM) for causal inference among processes with distributions from an exponential family is efficient in scenarios when the number of time observations is much greater than the number of time series, normally by several orders of magnitude. However, in the case of “short” time series, the inference in HGGM often suffers from overestimation. To remedy this, we use the minimum message length principle (MML) to determinate the causal connections in the HGGM. The minimum message length as a Bayesian information-theoretic method for statistical model selection applies Occam’s razor in the following way: even when models are equal in their measure of fit-accuracy to the observed data, the one generating the most concise explanation of data is more likely to be correct. Based on the dispersion coefficient of the target time series and on the initial maximum likelihood estimates of the regression coefficients, we propose a minimum message length criterion to select the subset of causally connected time series with each target time series and derive its form for various exponential distributions. We propose two algorithms—the genetic-type algorithm (HMMLGA) and exHMML to find the subset. We demonstrated the superiority of both algorithms in synthetic experiments with respect to the comparison methods Lingam, HGGM and statistical framework Granger causality (SFGC). In the real data experiments, we used the methods to discriminate between pregnancy and labor phase using electrohysterogram data of Islandic mothers from Physionet databasis. We further analysed the Austrian climatological time measurements and their temporal interactions in rain and sunny days scenarios. In both experiments, the results of HMMLGA had the most realistic interpretation with respect to the comparison methods. We provide our code in Matlab. To our best knowledge, this is the first work using the MML principle for causal inference in HGGM. |
format | Online Article Text |
id | pubmed-7763266 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-77632662021-02-24 Heterogeneous Graphical Granger Causality by Minimum Message Length Hlaváčková-Schindler, Kateřina Plant, Claudia Entropy (Basel) Article The heterogeneous graphical Granger model (HGGM) for causal inference among processes with distributions from an exponential family is efficient in scenarios when the number of time observations is much greater than the number of time series, normally by several orders of magnitude. However, in the case of “short” time series, the inference in HGGM often suffers from overestimation. To remedy this, we use the minimum message length principle (MML) to determinate the causal connections in the HGGM. The minimum message length as a Bayesian information-theoretic method for statistical model selection applies Occam’s razor in the following way: even when models are equal in their measure of fit-accuracy to the observed data, the one generating the most concise explanation of data is more likely to be correct. Based on the dispersion coefficient of the target time series and on the initial maximum likelihood estimates of the regression coefficients, we propose a minimum message length criterion to select the subset of causally connected time series with each target time series and derive its form for various exponential distributions. We propose two algorithms—the genetic-type algorithm (HMMLGA) and exHMML to find the subset. We demonstrated the superiority of both algorithms in synthetic experiments with respect to the comparison methods Lingam, HGGM and statistical framework Granger causality (SFGC). In the real data experiments, we used the methods to discriminate between pregnancy and labor phase using electrohysterogram data of Islandic mothers from Physionet databasis. We further analysed the Austrian climatological time measurements and their temporal interactions in rain and sunny days scenarios. In both experiments, the results of HMMLGA had the most realistic interpretation with respect to the comparison methods. We provide our code in Matlab. To our best knowledge, this is the first work using the MML principle for causal inference in HGGM. MDPI 2020-12-11 /pmc/articles/PMC7763266/ /pubmed/33322439 http://dx.doi.org/10.3390/e22121400 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Hlaváčková-Schindler, Kateřina Plant, Claudia Heterogeneous Graphical Granger Causality by Minimum Message Length |
title | Heterogeneous Graphical Granger Causality by Minimum Message Length |
title_full | Heterogeneous Graphical Granger Causality by Minimum Message Length |
title_fullStr | Heterogeneous Graphical Granger Causality by Minimum Message Length |
title_full_unstemmed | Heterogeneous Graphical Granger Causality by Minimum Message Length |
title_short | Heterogeneous Graphical Granger Causality by Minimum Message Length |
title_sort | heterogeneous graphical granger causality by minimum message length |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7763266/ https://www.ncbi.nlm.nih.gov/pubmed/33322439 http://dx.doi.org/10.3390/e22121400 |
work_keys_str_mv | AT hlavackovaschindlerkaterina heterogeneousgraphicalgrangercausalitybyminimummessagelength AT plantclaudia heterogeneousgraphicalgrangercausalitybyminimummessagelength |