Cargando…
Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure
BACKGROUND: Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates. METHOD: Several approaches have been proposed...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7318364/ https://www.ncbi.nlm.nih.gov/pubmed/32586271 http://dx.doi.org/10.1186/s12874-020-01053-4 |
_version_ | 1783550831868510208 |
---|---|
author | Coffman, Donna L. Zhou, Jiangxiu Cai, Xizhen |
author_facet | Coffman, Donna L. Zhou, Jiangxiu Cai, Xizhen |
author_sort | Coffman, Donna L. |
collection | PubMed |
description | BACKGROUND: Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates. METHOD: Several approaches have been proposed for handling covariate missingness, including multiple imputation (MI), multiple imputation with missingness pattern (MIMP), and treatment mean imputation. However, there are other potentially useful approaches that have not been evaluated, including single imputation (SI) + prediction error (PE), SI + PE + parameter uncertainty (PU), and Generalized Boosted Modeling (GBM), which is a nonparametric approach for estimating propensity scores in which missing values are automatically handled in the estimation using a surrogate split method. To evaluate the performance of these approaches, a simulation study was conducted. RESULTS: Results suggested that SI + PE, SI + PE + PU, MI, and MIMP perform almost equally well and better than treatment mean imputation and GBM in terms of bias; however, MI and MIMP account for the additional uncertainty of imputing the missingness. CONCLUSIONS: Applying GBM to the incomplete data and relying on the surrogate split approach resulted in substantial bias. Imputation prior to implementing GBM is recommended. |
format | Online Article Text |
id | pubmed-7318364 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-73183642020-06-29 Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure Coffman, Donna L. Zhou, Jiangxiu Cai, Xizhen BMC Med Res Methodol Research Article BACKGROUND: Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates. METHOD: Several approaches have been proposed for handling covariate missingness, including multiple imputation (MI), multiple imputation with missingness pattern (MIMP), and treatment mean imputation. However, there are other potentially useful approaches that have not been evaluated, including single imputation (SI) + prediction error (PE), SI + PE + parameter uncertainty (PU), and Generalized Boosted Modeling (GBM), which is a nonparametric approach for estimating propensity scores in which missing values are automatically handled in the estimation using a surrogate split method. To evaluate the performance of these approaches, a simulation study was conducted. RESULTS: Results suggested that SI + PE, SI + PE + PU, MI, and MIMP perform almost equally well and better than treatment mean imputation and GBM in terms of bias; however, MI and MIMP account for the additional uncertainty of imputing the missingness. CONCLUSIONS: Applying GBM to the incomplete data and relying on the surrogate split approach resulted in substantial bias. Imputation prior to implementing GBM is recommended. BioMed Central 2020-06-26 /pmc/articles/PMC7318364/ /pubmed/32586271 http://dx.doi.org/10.1186/s12874-020-01053-4 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Coffman, Donna L. Zhou, Jiangxiu Cai, Xizhen Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure |
title | Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure |
title_full | Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure |
title_fullStr | Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure |
title_full_unstemmed | Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure |
title_short | Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure |
title_sort | comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7318364/ https://www.ncbi.nlm.nih.gov/pubmed/32586271 http://dx.doi.org/10.1186/s12874-020-01053-4 |
work_keys_str_mv | AT coffmandonnal comparisonofmethodsforhandlingcovariatemissingnessinpropensityscoreestimationwithabinaryexposure AT zhoujiangxiu comparisonofmethodsforhandlingcovariatemissingnessinpropensityscoreestimationwithabinaryexposure AT caixizhen comparisonofmethodsforhandlingcovariatemissingnessinpropensityscoreestimationwithabinaryexposure |