Cargando…

Detecting and quantifying causal associations in large nonlinear time series datasets

Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often...

Descripción completa

Detalles Bibliográficos
Autores principales: Runge, Jakob, Nowack, Peer, Kretschmer, Marlene, Flaxman, Seth, Sejdinovic, Dino
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6881151/
https://www.ncbi.nlm.nih.gov/pubmed/31807692
http://dx.doi.org/10.1126/sciadv.aau4996
_version_ 1783473888345194496
author Runge, Jakob
Nowack, Peer
Kretschmer, Marlene
Flaxman, Seth
Sejdinovic, Dino
author_facet Runge, Jakob
Nowack, Peer
Kretschmer, Marlene
Flaxman, Seth
Sejdinovic, Dino
author_sort Runge, Jakob
collection PubMed
description Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields.
format Online
Article
Text
id pubmed-6881151
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-68811512019-12-05 Detecting and quantifying causal associations in large nonlinear time series datasets Runge, Jakob Nowack, Peer Kretschmer, Marlene Flaxman, Seth Sejdinovic, Dino Sci Adv Research Articles Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields. American Association for the Advancement of Science 2019-11-27 /pmc/articles/PMC6881151/ /pubmed/31807692 http://dx.doi.org/10.1126/sciadv.aau4996 Text en Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution License 4.0 (CC BY). http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Runge, Jakob
Nowack, Peer
Kretschmer, Marlene
Flaxman, Seth
Sejdinovic, Dino
Detecting and quantifying causal associations in large nonlinear time series datasets
title Detecting and quantifying causal associations in large nonlinear time series datasets
title_full Detecting and quantifying causal associations in large nonlinear time series datasets
title_fullStr Detecting and quantifying causal associations in large nonlinear time series datasets
title_full_unstemmed Detecting and quantifying causal associations in large nonlinear time series datasets
title_short Detecting and quantifying causal associations in large nonlinear time series datasets
title_sort detecting and quantifying causal associations in large nonlinear time series datasets
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6881151/
https://www.ncbi.nlm.nih.gov/pubmed/31807692
http://dx.doi.org/10.1126/sciadv.aau4996
work_keys_str_mv AT rungejakob detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets
AT nowackpeer detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets
AT kretschmermarlene detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets
AT flaxmanseth detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets
AT sejdinovicdino detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets