Cargando…

A comparison of different methods to handle missing data in the context of propensity score analysis

Propensity score analysis is a popular method to control for confounding in observational studies. A challenge in propensity methods is missing values in confounders. Several strategies for handling missing values exist, but guidance in choosing the best method is needed. In this simulation study, w...

Descripción completa

Detalles Bibliográficos
Autores principales:	Choi, Jungyeon, Dekkers, Olaf M., le Cessie, Saskia
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Netherlands 2018
Materias:	Methods
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6325992/ https://www.ncbi.nlm.nih.gov/pubmed/30341708 http://dx.doi.org/10.1007/s10654-018-0447-z

_version_	1783386223190999040
author	Choi, Jungyeon Dekkers, Olaf M. le Cessie, Saskia
author_facet	Choi, Jungyeon Dekkers, Olaf M. le Cessie, Saskia
author_sort	Choi, Jungyeon
collection	PubMed
description	Propensity score analysis is a popular method to control for confounding in observational studies. A challenge in propensity methods is missing values in confounders. Several strategies for handling missing values exist, but guidance in choosing the best method is needed. In this simulation study, we compared four strategies of handling missing covariate values in propensity matching and propensity weighting. These methods include: complete case analysis, missing indicator method, multiple imputation and combining multiple imputation and missing indicator method. Concurrently, we aimed to provide guidance in choosing the optimal strategy. Simulated scenarios varied regarding missing mechanism, presence of effect modification or unmeasured confounding. Additionally, we demonstrated how missingness graphs help clarifying the missing structure. When no effect modification existed, complete case analysis yielded valid causal treatment effects even when data were missing not at random. In some situations, complete case analysis was also able to partially correct for unmeasured confounding. Multiple imputation worked well if the data were missing (completely) at random, and if the imputation model was correctly specified. In the presence of effect modification, more complex imputation models than default options of commonly used statistical software were required. Multiple imputation may fail when data are missing not at random. Here, combining multiple imputation and the missing indicator method reduced the bias as the missing indicator variable can be a proxy for unobserved confounding. The optimal way to handle missing values in covariates of propensity score models depends on the missing data structure and the presence of effect modification. When effect modification is present, default settings of imputation methods may yield biased results even if data are missing at random.
format	Online Article Text
id	pubmed-6325992
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Springer Netherlands
record_format	MEDLINE/PubMed
spelling	pubmed-63259922019-01-23 A comparison of different methods to handle missing data in the context of propensity score analysis Choi, Jungyeon Dekkers, Olaf M. le Cessie, Saskia Eur J Epidemiol Methods Propensity score analysis is a popular method to control for confounding in observational studies. A challenge in propensity methods is missing values in confounders. Several strategies for handling missing values exist, but guidance in choosing the best method is needed. In this simulation study, we compared four strategies of handling missing covariate values in propensity matching and propensity weighting. These methods include: complete case analysis, missing indicator method, multiple imputation and combining multiple imputation and missing indicator method. Concurrently, we aimed to provide guidance in choosing the optimal strategy. Simulated scenarios varied regarding missing mechanism, presence of effect modification or unmeasured confounding. Additionally, we demonstrated how missingness graphs help clarifying the missing structure. When no effect modification existed, complete case analysis yielded valid causal treatment effects even when data were missing not at random. In some situations, complete case analysis was also able to partially correct for unmeasured confounding. Multiple imputation worked well if the data were missing (completely) at random, and if the imputation model was correctly specified. In the presence of effect modification, more complex imputation models than default options of commonly used statistical software were required. Multiple imputation may fail when data are missing not at random. Here, combining multiple imputation and the missing indicator method reduced the bias as the missing indicator variable can be a proxy for unobserved confounding. The optimal way to handle missing values in covariates of propensity score models depends on the missing data structure and the presence of effect modification. When effect modification is present, default settings of imputation methods may yield biased results even if data are missing at random. Springer Netherlands 2018-10-19 2019 /pmc/articles/PMC6325992/ /pubmed/30341708 http://dx.doi.org/10.1007/s10654-018-0447-z Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle	Methods Choi, Jungyeon Dekkers, Olaf M. le Cessie, Saskia A comparison of different methods to handle missing data in the context of propensity score analysis
title	A comparison of different methods to handle missing data in the context of propensity score analysis
title_full	A comparison of different methods to handle missing data in the context of propensity score analysis
title_fullStr	A comparison of different methods to handle missing data in the context of propensity score analysis
title_full_unstemmed	A comparison of different methods to handle missing data in the context of propensity score analysis
title_short	A comparison of different methods to handle missing data in the context of propensity score analysis
title_sort	comparison of different methods to handle missing data in the context of propensity score analysis
topic	Methods
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6325992/ https://www.ncbi.nlm.nih.gov/pubmed/30341708 http://dx.doi.org/10.1007/s10654-018-0447-z
work_keys_str_mv	AT choijungyeon acomparisonofdifferentmethodstohandlemissingdatainthecontextofpropensityscoreanalysis AT dekkersolafm acomparisonofdifferentmethodstohandlemissingdatainthecontextofpropensityscoreanalysis AT lecessiesaskia acomparisonofdifferentmethodstohandlemissingdatainthecontextofpropensityscoreanalysis AT choijungyeon comparisonofdifferentmethodstohandlemissingdatainthecontextofpropensityscoreanalysis AT dekkersolafm comparisonofdifferentmethodstohandlemissingdatainthecontextofpropensityscoreanalysis AT lecessiesaskia comparisonofdifferentmethodstohandlemissingdatainthecontextofpropensityscoreanalysis

A comparison of different methods to handle missing data in the context of propensity score analysis

Ejemplares similares