Cargando…

Model misspecification and bias for inverse probability weighting estimators of average causal effects

Commonly used semiparametric estimators of causal effects specify parametric models for the propensity score (PS) and the conditional outcome. An example is an augmented inverse probability weighting (IPW) estimator, frequently referred to as a doubly robust estimator, because it is consistent if at...

Descripción completa

Detalles Bibliográficos
Autores principales:	Waernbaum, Ingeborg, Pazzagli, Laura
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2022
Materias:	Inference
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10087564/ https://www.ncbi.nlm.nih.gov/pubmed/36045099 http://dx.doi.org/10.1002/bimj.202100118

_version_	1785022377102409728
author	Waernbaum, Ingeborg Pazzagli, Laura
author_facet	Waernbaum, Ingeborg Pazzagli, Laura
author_sort	Waernbaum, Ingeborg
collection	PubMed
description	Commonly used semiparametric estimators of causal effects specify parametric models for the propensity score (PS) and the conditional outcome. An example is an augmented inverse probability weighting (IPW) estimator, frequently referred to as a doubly robust estimator, because it is consistent if at least one of the two models is correctly specified. However, in many observational studies, the role of the parametric models is often not to provide a representation of the data‐generating process but rather to facilitate the adjustment for confounding, making the assumption of at least one true model unlikely to hold. In this paper, we propose a crude analytical approach to study the large‐sample bias of estimators when the models are assumed to be approximations of the data‐generating process, namely, when all models are misspecified. We apply our approach to three prototypical estimators of the average causal effect, two IPW estimators, using a misspecified PS model, and an augmented IPW (AIPW) estimator, using misspecified models for the outcome regression (OR) and the PS. For the two IPW estimators, we show that normalization, in addition to having a smaller variance, also offers some protection against bias due to model misspecification. To analyze the question of when the use of two misspecified models is better than one we derive necessary and sufficient conditions for when the AIPW estimator has a smaller bias than a simple IPW estimator and when it has a smaller bias than an IPW estimator with normalized weights. If the misspecification of the outcome model is moderate, the comparisons of the biases of the IPW and AIPW estimators show that the AIPW estimator has a smaller bias than the IPW estimators. However, all biases include a scaling with the PS‐model error and we suggest caution in modeling the PS whenever such a model is involved. For numerical and finite sample illustrations, we include three simulation studies and corresponding approximations of the large‐sample biases. In a dataset from the National Health and Nutrition Examination Survey, we estimate the effect of smoking on blood lead levels.
format	Online Article Text
id	pubmed-10087564
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-100875642023-04-12 Model misspecification and bias for inverse probability weighting estimators of average causal effects Waernbaum, Ingeborg Pazzagli, Laura Biom J Inference Commonly used semiparametric estimators of causal effects specify parametric models for the propensity score (PS) and the conditional outcome. An example is an augmented inverse probability weighting (IPW) estimator, frequently referred to as a doubly robust estimator, because it is consistent if at least one of the two models is correctly specified. However, in many observational studies, the role of the parametric models is often not to provide a representation of the data‐generating process but rather to facilitate the adjustment for confounding, making the assumption of at least one true model unlikely to hold. In this paper, we propose a crude analytical approach to study the large‐sample bias of estimators when the models are assumed to be approximations of the data‐generating process, namely, when all models are misspecified. We apply our approach to three prototypical estimators of the average causal effect, two IPW estimators, using a misspecified PS model, and an augmented IPW (AIPW) estimator, using misspecified models for the outcome regression (OR) and the PS. For the two IPW estimators, we show that normalization, in addition to having a smaller variance, also offers some protection against bias due to model misspecification. To analyze the question of when the use of two misspecified models is better than one we derive necessary and sufficient conditions for when the AIPW estimator has a smaller bias than a simple IPW estimator and when it has a smaller bias than an IPW estimator with normalized weights. If the misspecification of the outcome model is moderate, the comparisons of the biases of the IPW and AIPW estimators show that the AIPW estimator has a smaller bias than the IPW estimators. However, all biases include a scaling with the PS‐model error and we suggest caution in modeling the PS whenever such a model is involved. For numerical and finite sample illustrations, we include three simulation studies and corresponding approximations of the large‐sample biases. In a dataset from the National Health and Nutrition Examination Survey, we estimate the effect of smoking on blood lead levels. John Wiley and Sons Inc. 2022-08-31 2023-02 /pmc/articles/PMC10087564/ /pubmed/36045099 http://dx.doi.org/10.1002/bimj.202100118 Text en © 2022 The Authors. Biometrical Journal published by Wiley‐VCH GmbH. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Inference Waernbaum, Ingeborg Pazzagli, Laura Model misspecification and bias for inverse probability weighting estimators of average causal effects
title	Model misspecification and bias for inverse probability weighting estimators of average causal effects
title_full	Model misspecification and bias for inverse probability weighting estimators of average causal effects
title_fullStr	Model misspecification and bias for inverse probability weighting estimators of average causal effects
title_full_unstemmed	Model misspecification and bias for inverse probability weighting estimators of average causal effects
title_short	Model misspecification and bias for inverse probability weighting estimators of average causal effects
title_sort	model misspecification and bias for inverse probability weighting estimators of average causal effects
topic	Inference
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10087564/ https://www.ncbi.nlm.nih.gov/pubmed/36045099 http://dx.doi.org/10.1002/bimj.202100118
work_keys_str_mv	AT waernbaumingeborg modelmisspecificationandbiasforinverseprobabilityweightingestimatorsofaveragecausaleffects AT pazzaglilaura modelmisspecificationandbiasforinverseprobabilityweightingestimatorsofaveragecausaleffects

Model misspecification and bias for inverse probability weighting estimators of average causal effects

Ejemplares similares