Cargando…

A Bayesian Model for Bivariate Causal Inference

We address the problem of two-variable causal inference without intervention. This task is to infer an existing causal relation between two random variables, i.e., [Formula: see text] or [Formula: see text] , from purely observational data. As the option to modify a potential cause is not given in m...

Descripción completa

Detalles Bibliográficos
Autores principales: Kurthen, Maximilian, Enßlin, Torsten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516476/
https://www.ncbi.nlm.nih.gov/pubmed/33285821
http://dx.doi.org/10.3390/e22010046
_version_ 1783587010306375680
author Kurthen, Maximilian
Enßlin, Torsten
author_facet Kurthen, Maximilian
Enßlin, Torsten
author_sort Kurthen, Maximilian
collection PubMed
description We address the problem of two-variable causal inference without intervention. This task is to infer an existing causal relation between two random variables, i.e., [Formula: see text] or [Formula: see text] , from purely observational data. As the option to modify a potential cause is not given in many situations, only structural properties of the data can be used to solve this ill-posed problem. We briefly review a number of state-of-the-art methods for this, including very recent ones. A novel inference method is introduced, Bayesian Causal Inference (BCI) which assumes a generative Bayesian hierarchical model to pursue the strategy of Bayesian model selection. In the adopted model, the distribution of the cause variable is given by a Poisson lognormal distribution, which allows to explicitly regard the discrete nature of datasets, correlations in the parameter spaces, as well as the variance of probability densities on logarithmic scales. We assume Fourier diagonal Field covariance operators. The model itself is restricted to use cases where a direct causal relation [Formula: see text] has to be decided against a relation [Formula: see text] , therefore we compare it other methods for this exact problem setting. The generative model assumed provides synthetic causal data for benchmarking our model in comparison to existing state-of-the-art models, namely LiNGAM, ANM-HSIC, ANM-MML, IGCI, and CGNN. We explore how well the above methods perform in case of high noise settings, strongly discretized data, and very sparse data. BCI performs generally reliably with synthetic data as well as with the real world TCEP benchmark set, with an accuracy comparable to state-of-the-art algorithms. We discuss directions for the future development of BCI.
format Online
Article
Text
id pubmed-7516476
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75164762020-11-09 A Bayesian Model for Bivariate Causal Inference Kurthen, Maximilian Enßlin, Torsten Entropy (Basel) Article We address the problem of two-variable causal inference without intervention. This task is to infer an existing causal relation between two random variables, i.e., [Formula: see text] or [Formula: see text] , from purely observational data. As the option to modify a potential cause is not given in many situations, only structural properties of the data can be used to solve this ill-posed problem. We briefly review a number of state-of-the-art methods for this, including very recent ones. A novel inference method is introduced, Bayesian Causal Inference (BCI) which assumes a generative Bayesian hierarchical model to pursue the strategy of Bayesian model selection. In the adopted model, the distribution of the cause variable is given by a Poisson lognormal distribution, which allows to explicitly regard the discrete nature of datasets, correlations in the parameter spaces, as well as the variance of probability densities on logarithmic scales. We assume Fourier diagonal Field covariance operators. The model itself is restricted to use cases where a direct causal relation [Formula: see text] has to be decided against a relation [Formula: see text] , therefore we compare it other methods for this exact problem setting. The generative model assumed provides synthetic causal data for benchmarking our model in comparison to existing state-of-the-art models, namely LiNGAM, ANM-HSIC, ANM-MML, IGCI, and CGNN. We explore how well the above methods perform in case of high noise settings, strongly discretized data, and very sparse data. BCI performs generally reliably with synthetic data as well as with the real world TCEP benchmark set, with an accuracy comparable to state-of-the-art algorithms. We discuss directions for the future development of BCI. MDPI 2019-12-29 /pmc/articles/PMC7516476/ /pubmed/33285821 http://dx.doi.org/10.3390/e22010046 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kurthen, Maximilian
Enßlin, Torsten
A Bayesian Model for Bivariate Causal Inference
title A Bayesian Model for Bivariate Causal Inference
title_full A Bayesian Model for Bivariate Causal Inference
title_fullStr A Bayesian Model for Bivariate Causal Inference
title_full_unstemmed A Bayesian Model for Bivariate Causal Inference
title_short A Bayesian Model for Bivariate Causal Inference
title_sort bayesian model for bivariate causal inference
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516476/
https://www.ncbi.nlm.nih.gov/pubmed/33285821
http://dx.doi.org/10.3390/e22010046
work_keys_str_mv AT kurthenmaximilian abayesianmodelforbivariatecausalinference
AT enßlintorsten abayesianmodelforbivariatecausalinference
AT kurthenmaximilian bayesianmodelforbivariatecausalinference
AT enßlintorsten bayesianmodelforbivariatecausalinference