Cargando…

Formulating causal questions and principled statistical answers

Although review papers on causal inference methods are now available, there is a lack of introductory overviews on what they can render and on the guiding criteria for choosing one particular method. This tutorial gives an overview in situations where an exposure of interest is set at a chosen basel...

Descripción completa

Detalles Bibliográficos
Autores principales: Goetghebeur, Els, le Cessie, Saskia, De Stavola, Bianca, Moodie, Erica EM, Waernbaum, Ingeborg
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756489/
https://www.ncbi.nlm.nih.gov/pubmed/32964526
http://dx.doi.org/10.1002/sim.8741
_version_ 1783626553362481152
author Goetghebeur, Els
le Cessie, Saskia
De Stavola, Bianca
Moodie, Erica EM
Waernbaum, Ingeborg
author_facet Goetghebeur, Els
le Cessie, Saskia
De Stavola, Bianca
Moodie, Erica EM
Waernbaum, Ingeborg
author_sort Goetghebeur, Els
collection PubMed
description Although review papers on causal inference methods are now available, there is a lack of introductory overviews on what they can render and on the guiding criteria for choosing one particular method. This tutorial gives an overview in situations where an exposure of interest is set at a chosen baseline (“point exposure”) and the target outcome arises at a later time point. We first phrase relevant causal questions and make a case for being specific about the possible exposure levels involved and the populations for which the question is relevant. Using the potential outcomes framework, we describe principled definitions of causal effects and of estimation approaches classified according to whether they invoke the no unmeasured confounding assumption (including outcome regression and propensity score‐based methods) or an instrumental variable with added assumptions. We mainly focus on continuous outcomes and causal average treatment effects. We discuss interpretation, challenges, and potential pitfalls and illustrate application using a “simulation learner,” that mimics the effect of various breastfeeding interventions on a child's later development. This involves a typical simulation component with generated exposure, covariate, and outcome data inspired by a randomized intervention study. The simulation learner further generates various (linked) exposure types with a set of possible values per observation unit, from which observed as well as potential outcome data are generated. It thus provides true values of several causal effects. R code for data generation and analysis is available on www.ofcaus.org, where SAS and Stata code for analysis is also provided.
format Online
Article
Text
id pubmed-7756489
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-77564892020-12-28 Formulating causal questions and principled statistical answers Goetghebeur, Els le Cessie, Saskia De Stavola, Bianca Moodie, Erica EM Waernbaum, Ingeborg Stat Med Tutorial in Biostatistics Although review papers on causal inference methods are now available, there is a lack of introductory overviews on what they can render and on the guiding criteria for choosing one particular method. This tutorial gives an overview in situations where an exposure of interest is set at a chosen baseline (“point exposure”) and the target outcome arises at a later time point. We first phrase relevant causal questions and make a case for being specific about the possible exposure levels involved and the populations for which the question is relevant. Using the potential outcomes framework, we describe principled definitions of causal effects and of estimation approaches classified according to whether they invoke the no unmeasured confounding assumption (including outcome regression and propensity score‐based methods) or an instrumental variable with added assumptions. We mainly focus on continuous outcomes and causal average treatment effects. We discuss interpretation, challenges, and potential pitfalls and illustrate application using a “simulation learner,” that mimics the effect of various breastfeeding interventions on a child's later development. This involves a typical simulation component with generated exposure, covariate, and outcome data inspired by a randomized intervention study. The simulation learner further generates various (linked) exposure types with a set of possible values per observation unit, from which observed as well as potential outcome data are generated. It thus provides true values of several causal effects. R code for data generation and analysis is available on www.ofcaus.org, where SAS and Stata code for analysis is also provided. John Wiley and Sons Inc. 2020-09-23 2020-12-30 /pmc/articles/PMC7756489/ /pubmed/32964526 http://dx.doi.org/10.1002/sim.8741 Text en © 2020 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Tutorial in Biostatistics
Goetghebeur, Els
le Cessie, Saskia
De Stavola, Bianca
Moodie, Erica EM
Waernbaum, Ingeborg
Formulating causal questions and principled statistical answers
title Formulating causal questions and principled statistical answers
title_full Formulating causal questions and principled statistical answers
title_fullStr Formulating causal questions and principled statistical answers
title_full_unstemmed Formulating causal questions and principled statistical answers
title_short Formulating causal questions and principled statistical answers
title_sort formulating causal questions and principled statistical answers
topic Tutorial in Biostatistics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756489/
https://www.ncbi.nlm.nih.gov/pubmed/32964526
http://dx.doi.org/10.1002/sim.8741
work_keys_str_mv AT goetghebeurels formulatingcausalquestionsandprincipledstatisticalanswers
AT lecessiesaskia formulatingcausalquestionsandprincipledstatisticalanswers
AT destavolabianca formulatingcausalquestionsandprincipledstatisticalanswers
AT moodieericaem formulatingcausalquestionsandprincipledstatisticalanswers
AT waernbaumingeborg formulatingcausalquestionsandprincipledstatisticalanswers
AT formulatingcausalquestionsandprincipledstatisticalanswers