Cargando…
Improving upon the efficiency of complete case analysis when covariates are MNAR
Missing values in covariates of regression models are a pervasive problem in empirical research. Popular approaches for analyzing partially observed datasets include complete case analysis (CCA), multiple imputation (MI), and inverse probability weighting (IPW). In the case of missing covariate valu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173105/ https://www.ncbi.nlm.nih.gov/pubmed/24907708 http://dx.doi.org/10.1093/biostatistics/kxu023 |
_version_ | 1782336134217465856 |
---|---|
author | Bartlett, Jonathan W. Carpenter, James R. Tilling, Kate Vansteelandt, Stijn |
author_facet | Bartlett, Jonathan W. Carpenter, James R. Tilling, Kate Vansteelandt, Stijn |
author_sort | Bartlett, Jonathan W. |
collection | PubMed |
description | Missing values in covariates of regression models are a pervasive problem in empirical research. Popular approaches for analyzing partially observed datasets include complete case analysis (CCA), multiple imputation (MI), and inverse probability weighting (IPW). In the case of missing covariate values, these methods (as typically implemented) are valid under different missingness assumptions. In particular, CCA is valid under missing not at random (MNAR) mechanisms in which missingness in a covariate depends on the value of that covariate, but is conditionally independent of outcome. In this paper, we argue that in some settings such an assumption is more plausible than the missing at random assumption underpinning most implementations of MI and IPW. When the former assumption holds, although CCA gives consistent estimates, it does not make use of all observed information. We therefore propose an augmented CCA approach which makes the same conditional independence assumption for missingness as CCA, but which improves efficiency through specification of an additional model for the probability of missingness, given the fully observed variables. The new method is evaluated using simulations and illustrated through application to data on reported alcohol consumption and blood pressure from the US National Health and Nutrition Examination Survey, in which data are likely MNAR independent of outcome. |
format | Online Article Text |
id | pubmed-4173105 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41731052014-09-25 Improving upon the efficiency of complete case analysis when covariates are MNAR Bartlett, Jonathan W. Carpenter, James R. Tilling, Kate Vansteelandt, Stijn Biostatistics Articles Missing values in covariates of regression models are a pervasive problem in empirical research. Popular approaches for analyzing partially observed datasets include complete case analysis (CCA), multiple imputation (MI), and inverse probability weighting (IPW). In the case of missing covariate values, these methods (as typically implemented) are valid under different missingness assumptions. In particular, CCA is valid under missing not at random (MNAR) mechanisms in which missingness in a covariate depends on the value of that covariate, but is conditionally independent of outcome. In this paper, we argue that in some settings such an assumption is more plausible than the missing at random assumption underpinning most implementations of MI and IPW. When the former assumption holds, although CCA gives consistent estimates, it does not make use of all observed information. We therefore propose an augmented CCA approach which makes the same conditional independence assumption for missingness as CCA, but which improves efficiency through specification of an additional model for the probability of missingness, given the fully observed variables. The new method is evaluated using simulations and illustrated through application to data on reported alcohol consumption and blood pressure from the US National Health and Nutrition Examination Survey, in which data are likely MNAR independent of outcome. Oxford University Press 2014-10 2014-06-06 /pmc/articles/PMC4173105/ /pubmed/24907708 http://dx.doi.org/10.1093/biostatistics/kxu023 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Bartlett, Jonathan W. Carpenter, James R. Tilling, Kate Vansteelandt, Stijn Improving upon the efficiency of complete case analysis when covariates are MNAR |
title | Improving upon the efficiency of complete case analysis when
covariates are MNAR |
title_full | Improving upon the efficiency of complete case analysis when
covariates are MNAR |
title_fullStr | Improving upon the efficiency of complete case analysis when
covariates are MNAR |
title_full_unstemmed | Improving upon the efficiency of complete case analysis when
covariates are MNAR |
title_short | Improving upon the efficiency of complete case analysis when
covariates are MNAR |
title_sort | improving upon the efficiency of complete case analysis when
covariates are mnar |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173105/ https://www.ncbi.nlm.nih.gov/pubmed/24907708 http://dx.doi.org/10.1093/biostatistics/kxu023 |
work_keys_str_mv | AT bartlettjonathanw improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar AT carpenterjamesr improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar AT tillingkate improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar AT vansteelandtstijn improvingupontheefficiencyofcompletecaseanalysiswhencovariatesaremnar |