Cargando…

Improving upon the efficiency of complete case analysis when covariates are MNAR

Missing values in covariates of regression models are a pervasive problem in empirical research. Popular approaches for analyzing partially observed datasets include complete case analysis (CCA), multiple imputation (MI), and inverse probability weighting (IPW). In the case of missing covariate valu...

Descripción completa

Detalles Bibliográficos
Autores principales: Bartlett, Jonathan W., Carpenter, James R., Tilling, Kate, Vansteelandt, Stijn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173105/
https://www.ncbi.nlm.nih.gov/pubmed/24907708
http://dx.doi.org/10.1093/biostatistics/kxu023
_version_ 1782336134217465856
author Bartlett, Jonathan W.
Carpenter, James R.
Tilling, Kate
Vansteelandt, Stijn
author_facet Bartlett, Jonathan W.
Carpenter, James R.
Tilling, Kate
Vansteelandt, Stijn
author_sort Bartlett, Jonathan W.
collection PubMed
description Missing values in covariates of regression models are a pervasive problem in empirical research. Popular approaches for analyzing partially observed datasets include complete case analysis (CCA), multiple imputation (MI), and inverse probability weighting (IPW). In the case of missing covariate values, these methods (as typically implemented) are valid under different missingness assumptions. In particular, CCA is valid under missing not at random (MNAR) mechanisms in which missingness in a covariate depends on the value of that covariate, but is conditionally independent of outcome. In this paper, we argue that in some settings such an assumption is more plausible than the missing at random assumption underpinning most implementations of MI and IPW. When the former assumption holds, although CCA gives consistent estimates, it does not make use of all observed information. We therefore propose an augmented CCA approach which makes the same conditional independence assumption for missingness as CCA, but which improves efficiency through specification of an additional model for the probability of missingness, given the fully observed variables. The new method is evaluated using simulations and illustrated through application to data on reported alcohol consumption and blood pressure from the US National Health and Nutrition Examination Survey, in which data are likely MNAR independent of outcome.
format Online
Article
Text
id pubmed-4173105
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41731052014-09-25 Improving upon the efficiency of complete case analysis when covariates are MNAR Bartlett, Jonathan W. Carpenter, James R. Tilling, Kate Vansteelandt, Stijn Biostatistics Articles Missing values in covariates of regression models are a pervasive problem in empirical research. Popular approaches for analyzing partially observed datasets include complete case analysis (CCA), multiple imputation (MI), and inverse probability weighting (IPW). In the case of missing covariate values, these methods (as typically implemented) are valid under different missingness assumptions. In particular, CCA is valid under missing not at random (MNAR) mechanisms in which missingness in a covariate depends on the value of that covariate, but is conditionally independent of outcome. In this paper, we argue that in some settings such an assumption is more plausible than the missing at random assumption underpinning most implementations of MI and IPW. When the former assumption holds, although CCA gives consistent estimates, it does not make use of all observed information. We therefore propose an augmented CCA approach which makes the same conditional independence assumption for missingness as CCA, but which improves efficiency through specification of an additional model for the probability of missingness, given the fully observed variables. The new method is evaluated using simulations and illustrated through application to data on reported alcohol consumption and blood pressure from the US National Health and Nutrition Examination Survey, in which data are likely MNAR independent of outcome. Oxford University Press 2014-10 2014-06-06 /pmc/articles/PMC4173105/ /pubmed/24907708 http://dx.doi.org/10.1093/biostatistics/kxu023 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Bartlett, Jonathan W.
Carpenter, James R.
Tilling, Kate
Vansteelandt, Stijn
Improving upon the efficiency of complete case analysis when covariates are MNAR
title Improving upon the efficiency of complete case analysis when covariates are MNAR
title_full Improving upon the efficiency of complete case analysis when covariates are MNAR
title_fullStr Improving upon the efficiency of complete case analysis when covariates are MNAR
title_full_unstemmed Improving upon the efficiency of complete case analysis when covariates are MNAR
title_short Improving upon the efficiency of complete case analysis when covariates are MNAR
title_sort improving upon the efficiency of complete case analysis when covariates are mnar
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173105/
https://www.ncbi.nlm.nih.gov/pubmed/24907708
http://dx.doi.org/10.1093/biostatistics/kxu023
work_keys_str_mv AT bartlettjonathanw improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar
AT carpenterjamesr improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar
AT tillingkate improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar
AT vansteelandtstijn improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar