Cargando…

Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data

(1) Background: Propensity score methods gained popularity in non-interventional clinical studies. As it may often occur in observational datasets, some values in baseline covariates are missing for some patients. The present study aims to compare the performances of popular statistical methods to d...

Descripción completa

Detalles Bibliográficos
Autores principales: Bottigliengo, Daniele, Lorenzoni, Giulia, Ocagli, Honoria, Martinato, Matteo, Berchialla, Paola, Gregori, Dario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8293809/
https://www.ncbi.nlm.nih.gov/pubmed/34206234
http://dx.doi.org/10.3390/ijerph18136694
_version_ 1783725121813348352
author Bottigliengo, Daniele
Lorenzoni, Giulia
Ocagli, Honoria
Martinato, Matteo
Berchialla, Paola
Gregori, Dario
author_facet Bottigliengo, Daniele
Lorenzoni, Giulia
Ocagli, Honoria
Martinato, Matteo
Berchialla, Paola
Gregori, Dario
author_sort Bottigliengo, Daniele
collection PubMed
description (1) Background: Propensity score methods gained popularity in non-interventional clinical studies. As it may often occur in observational datasets, some values in baseline covariates are missing for some patients. The present study aims to compare the performances of popular statistical methods to deal with missing data in propensity score analysis. (2) Methods: Methods that account for missing data during the estimation process and methods based on the imputation of missing values, such as multiple imputations, were considered. The methods were applied on the dataset of an ongoing prospective registry for the treatment of unprotected left main coronary artery disease. The performances were assessed in terms of the overall balance of baseline covariates. (3) Results: Methods that explicitly deal with missing data were superior to classical complete case analysis. The best balance was observed when propensity scores were estimated with a method that accounts for missing data using a stochastic approximation of the expectation-maximization algorithm. (4) Conclusions: If missing at random mechanism is plausible, methods that use missing data to estimate propensity score or impute them should be preferred. Sensitivity analyses are encouraged to evaluate the implications methods used to handle missing data and estimate propensity score.
format Online
Article
Text
id pubmed-8293809
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82938092021-07-22 Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data Bottigliengo, Daniele Lorenzoni, Giulia Ocagli, Honoria Martinato, Matteo Berchialla, Paola Gregori, Dario Int J Environ Res Public Health Article (1) Background: Propensity score methods gained popularity in non-interventional clinical studies. As it may often occur in observational datasets, some values in baseline covariates are missing for some patients. The present study aims to compare the performances of popular statistical methods to deal with missing data in propensity score analysis. (2) Methods: Methods that account for missing data during the estimation process and methods based on the imputation of missing values, such as multiple imputations, were considered. The methods were applied on the dataset of an ongoing prospective registry for the treatment of unprotected left main coronary artery disease. The performances were assessed in terms of the overall balance of baseline covariates. (3) Results: Methods that explicitly deal with missing data were superior to classical complete case analysis. The best balance was observed when propensity scores were estimated with a method that accounts for missing data using a stochastic approximation of the expectation-maximization algorithm. (4) Conclusions: If missing at random mechanism is plausible, methods that use missing data to estimate propensity score or impute them should be preferred. Sensitivity analyses are encouraged to evaluate the implications methods used to handle missing data and estimate propensity score. MDPI 2021-06-22 /pmc/articles/PMC8293809/ /pubmed/34206234 http://dx.doi.org/10.3390/ijerph18136694 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bottigliengo, Daniele
Lorenzoni, Giulia
Ocagli, Honoria
Martinato, Matteo
Berchialla, Paola
Gregori, Dario
Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data
title Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data
title_full Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data
title_fullStr Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data
title_full_unstemmed Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data
title_short Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data
title_sort propensity score analysis with partially observed baseline covariates: a practical comparison of methods for handling missing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8293809/
https://www.ncbi.nlm.nih.gov/pubmed/34206234
http://dx.doi.org/10.3390/ijerph18136694
work_keys_str_mv AT bottigliengodaniele propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata
AT lorenzonigiulia propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata
AT ocaglihonoria propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata
AT martinatomatteo propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata
AT berchiallapaola propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata
AT gregoridario propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata