Cargando…

Directed acyclic graphs and causal thinking in clinical risk prediction modeling

BACKGROUND: In epidemiology, causal inference and prediction modeling methodologies have been historically distinct. Directed Acyclic Graphs (DAGs) are used to model a priori causal assumptions and inform variable selection strategies for causal questions. Although tools originally designed for pred...

Descripción completa

Detalles Bibliográficos
Autores principales: Piccininni, Marco, Konigorski, Stefan, Rohmann, Jessica L., Kurth, Tobias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331263/
https://www.ncbi.nlm.nih.gov/pubmed/32615926
http://dx.doi.org/10.1186/s12874-020-01058-z
_version_ 1783553290078781440
author Piccininni, Marco
Konigorski, Stefan
Rohmann, Jessica L.
Kurth, Tobias
author_facet Piccininni, Marco
Konigorski, Stefan
Rohmann, Jessica L.
Kurth, Tobias
author_sort Piccininni, Marco
collection PubMed
description BACKGROUND: In epidemiology, causal inference and prediction modeling methodologies have been historically distinct. Directed Acyclic Graphs (DAGs) are used to model a priori causal assumptions and inform variable selection strategies for causal questions. Although tools originally designed for prediction are finding applications in causal inference, the counterpart has remained largely unexplored. The aim of this theoretical and simulation-based study is to assess the potential benefit of using DAGs in clinical risk prediction modeling. METHODS: We explore how incorporating knowledge about the underlying causal structure can provide insights about the transportability of diagnostic clinical risk prediction models to different settings. We further probe whether causal knowledge can be used to improve predictor selection in clinical risk prediction models. RESULTS: A single-predictor model in the causal direction is likely to have better transportability than one in the anticausal direction in some scenarios. We empirically show that the Markov Blanket, the set of variables including the parents, children, and parents of the children of the outcome node in a DAG, is the optimal set of predictors for that outcome. CONCLUSIONS: Our findings provide a theoretical basis for the intuition that a diagnostic clinical risk prediction model including causes as predictors is likely to be more transportable. Furthermore, using DAGs to identify Markov Blanket variables may be a useful, efficient strategy to select predictors in clinical risk prediction models if strong knowledge of the underlying causal structure exists or can be learned.
format Online
Article
Text
id pubmed-7331263
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-73312632020-07-06 Directed acyclic graphs and causal thinking in clinical risk prediction modeling Piccininni, Marco Konigorski, Stefan Rohmann, Jessica L. Kurth, Tobias BMC Med Res Methodol Technical Advance BACKGROUND: In epidemiology, causal inference and prediction modeling methodologies have been historically distinct. Directed Acyclic Graphs (DAGs) are used to model a priori causal assumptions and inform variable selection strategies for causal questions. Although tools originally designed for prediction are finding applications in causal inference, the counterpart has remained largely unexplored. The aim of this theoretical and simulation-based study is to assess the potential benefit of using DAGs in clinical risk prediction modeling. METHODS: We explore how incorporating knowledge about the underlying causal structure can provide insights about the transportability of diagnostic clinical risk prediction models to different settings. We further probe whether causal knowledge can be used to improve predictor selection in clinical risk prediction models. RESULTS: A single-predictor model in the causal direction is likely to have better transportability than one in the anticausal direction in some scenarios. We empirically show that the Markov Blanket, the set of variables including the parents, children, and parents of the children of the outcome node in a DAG, is the optimal set of predictors for that outcome. CONCLUSIONS: Our findings provide a theoretical basis for the intuition that a diagnostic clinical risk prediction model including causes as predictors is likely to be more transportable. Furthermore, using DAGs to identify Markov Blanket variables may be a useful, efficient strategy to select predictors in clinical risk prediction models if strong knowledge of the underlying causal structure exists or can be learned. BioMed Central 2020-07-02 /pmc/articles/PMC7331263/ /pubmed/32615926 http://dx.doi.org/10.1186/s12874-020-01058-z Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Technical Advance
Piccininni, Marco
Konigorski, Stefan
Rohmann, Jessica L.
Kurth, Tobias
Directed acyclic graphs and causal thinking in clinical risk prediction modeling
title Directed acyclic graphs and causal thinking in clinical risk prediction modeling
title_full Directed acyclic graphs and causal thinking in clinical risk prediction modeling
title_fullStr Directed acyclic graphs and causal thinking in clinical risk prediction modeling
title_full_unstemmed Directed acyclic graphs and causal thinking in clinical risk prediction modeling
title_short Directed acyclic graphs and causal thinking in clinical risk prediction modeling
title_sort directed acyclic graphs and causal thinking in clinical risk prediction modeling
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331263/
https://www.ncbi.nlm.nih.gov/pubmed/32615926
http://dx.doi.org/10.1186/s12874-020-01058-z
work_keys_str_mv AT piccininnimarco directedacyclicgraphsandcausalthinkinginclinicalriskpredictionmodeling
AT konigorskistefan directedacyclicgraphsandcausalthinkinginclinicalriskpredictionmodeling
AT rohmannjessical directedacyclicgraphsandcausalthinkinginclinicalriskpredictionmodeling
AT kurthtobias directedacyclicgraphsandcausalthinkinginclinicalriskpredictionmodeling