Cargando…
Detecting and quantifying causal associations in large nonlinear time series datasets
Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Association for the Advancement of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6881151/ https://www.ncbi.nlm.nih.gov/pubmed/31807692 http://dx.doi.org/10.1126/sciadv.aau4996 |
_version_ | 1783473888345194496 |
---|---|
author | Runge, Jakob Nowack, Peer Kretschmer, Marlene Flaxman, Seth Sejdinovic, Dino |
author_facet | Runge, Jakob Nowack, Peer Kretschmer, Marlene Flaxman, Seth Sejdinovic, Dino |
author_sort | Runge, Jakob |
collection | PubMed |
description | Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields. |
format | Online Article Text |
id | pubmed-6881151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | American Association for the Advancement of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-68811512019-12-05 Detecting and quantifying causal associations in large nonlinear time series datasets Runge, Jakob Nowack, Peer Kretschmer, Marlene Flaxman, Seth Sejdinovic, Dino Sci Adv Research Articles Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields. American Association for the Advancement of Science 2019-11-27 /pmc/articles/PMC6881151/ /pubmed/31807692 http://dx.doi.org/10.1126/sciadv.aau4996 Text en Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution License 4.0 (CC BY). http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Runge, Jakob Nowack, Peer Kretschmer, Marlene Flaxman, Seth Sejdinovic, Dino Detecting and quantifying causal associations in large nonlinear time series datasets |
title | Detecting and quantifying causal associations in large nonlinear time series datasets |
title_full | Detecting and quantifying causal associations in large nonlinear time series datasets |
title_fullStr | Detecting and quantifying causal associations in large nonlinear time series datasets |
title_full_unstemmed | Detecting and quantifying causal associations in large nonlinear time series datasets |
title_short | Detecting and quantifying causal associations in large nonlinear time series datasets |
title_sort | detecting and quantifying causal associations in large nonlinear time series datasets |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6881151/ https://www.ncbi.nlm.nih.gov/pubmed/31807692 http://dx.doi.org/10.1126/sciadv.aau4996 |
work_keys_str_mv | AT rungejakob detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets AT nowackpeer detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets AT kretschmermarlene detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets AT flaxmanseth detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets AT sejdinovicdino detectingandquantifyingcausalassociationsinlargenonlineartimeseriesdatasets |