Cargando…
Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data
(1) Background: Propensity score methods gained popularity in non-interventional clinical studies. As it may often occur in observational datasets, some values in baseline covariates are missing for some patients. The present study aims to compare the performances of popular statistical methods to d...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8293809/ https://www.ncbi.nlm.nih.gov/pubmed/34206234 http://dx.doi.org/10.3390/ijerph18136694 |
_version_ | 1783725121813348352 |
---|---|
author | Bottigliengo, Daniele Lorenzoni, Giulia Ocagli, Honoria Martinato, Matteo Berchialla, Paola Gregori, Dario |
author_facet | Bottigliengo, Daniele Lorenzoni, Giulia Ocagli, Honoria Martinato, Matteo Berchialla, Paola Gregori, Dario |
author_sort | Bottigliengo, Daniele |
collection | PubMed |
description | (1) Background: Propensity score methods gained popularity in non-interventional clinical studies. As it may often occur in observational datasets, some values in baseline covariates are missing for some patients. The present study aims to compare the performances of popular statistical methods to deal with missing data in propensity score analysis. (2) Methods: Methods that account for missing data during the estimation process and methods based on the imputation of missing values, such as multiple imputations, were considered. The methods were applied on the dataset of an ongoing prospective registry for the treatment of unprotected left main coronary artery disease. The performances were assessed in terms of the overall balance of baseline covariates. (3) Results: Methods that explicitly deal with missing data were superior to classical complete case analysis. The best balance was observed when propensity scores were estimated with a method that accounts for missing data using a stochastic approximation of the expectation-maximization algorithm. (4) Conclusions: If missing at random mechanism is plausible, methods that use missing data to estimate propensity score or impute them should be preferred. Sensitivity analyses are encouraged to evaluate the implications methods used to handle missing data and estimate propensity score. |
format | Online Article Text |
id | pubmed-8293809 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-82938092021-07-22 Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data Bottigliengo, Daniele Lorenzoni, Giulia Ocagli, Honoria Martinato, Matteo Berchialla, Paola Gregori, Dario Int J Environ Res Public Health Article (1) Background: Propensity score methods gained popularity in non-interventional clinical studies. As it may often occur in observational datasets, some values in baseline covariates are missing for some patients. The present study aims to compare the performances of popular statistical methods to deal with missing data in propensity score analysis. (2) Methods: Methods that account for missing data during the estimation process and methods based on the imputation of missing values, such as multiple imputations, were considered. The methods were applied on the dataset of an ongoing prospective registry for the treatment of unprotected left main coronary artery disease. The performances were assessed in terms of the overall balance of baseline covariates. (3) Results: Methods that explicitly deal with missing data were superior to classical complete case analysis. The best balance was observed when propensity scores were estimated with a method that accounts for missing data using a stochastic approximation of the expectation-maximization algorithm. (4) Conclusions: If missing at random mechanism is plausible, methods that use missing data to estimate propensity score or impute them should be preferred. Sensitivity analyses are encouraged to evaluate the implications methods used to handle missing data and estimate propensity score. MDPI 2021-06-22 /pmc/articles/PMC8293809/ /pubmed/34206234 http://dx.doi.org/10.3390/ijerph18136694 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Bottigliengo, Daniele Lorenzoni, Giulia Ocagli, Honoria Martinato, Matteo Berchialla, Paola Gregori, Dario Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data |
title | Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data |
title_full | Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data |
title_fullStr | Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data |
title_full_unstemmed | Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data |
title_short | Propensity Score Analysis with Partially Observed Baseline Covariates: A Practical Comparison of Methods for Handling Missing Data |
title_sort | propensity score analysis with partially observed baseline covariates: a practical comparison of methods for handling missing data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8293809/ https://www.ncbi.nlm.nih.gov/pubmed/34206234 http://dx.doi.org/10.3390/ijerph18136694 |
work_keys_str_mv | AT bottigliengodaniele propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata AT lorenzonigiulia propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata AT ocaglihonoria propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata AT martinatomatteo propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata AT berchiallapaola propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata AT gregoridario propensityscoreanalysiswithpartiallyobservedbaselinecovariatesapracticalcomparisonofmethodsforhandlingmissingdata |