Cargando…

Adding propensity scores to pure prediction models fails to improve predictive performance

Background. Propensity score usage seems to be growing in popularity leading researchers to question the possible role of propensity scores in prediction modeling, despite the lack of a theoretical rationale. It is suspected that such requests are due to the lack of differentiation regarding the goa...

Descripción completa

Detalles Bibliográficos
Autores principales: Nowacki, Amy S., Wells, Brian J., Yu, Changhong, Kattan, Michael W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3740143/
https://www.ncbi.nlm.nih.gov/pubmed/23940836
http://dx.doi.org/10.7717/peerj.123
_version_ 1782476980753530880
author Nowacki, Amy S.
Wells, Brian J.
Yu, Changhong
Kattan, Michael W.
author_facet Nowacki, Amy S.
Wells, Brian J.
Yu, Changhong
Kattan, Michael W.
author_sort Nowacki, Amy S.
collection PubMed
description Background. Propensity score usage seems to be growing in popularity leading researchers to question the possible role of propensity scores in prediction modeling, despite the lack of a theoretical rationale. It is suspected that such requests are due to the lack of differentiation regarding the goals of predictive modeling versus causal inference modeling. Therefore, the purpose of this study is to formally examine the effect of propensity scores on predictive performance. Our hypothesis is that a multivariable regression model that adjusts for all covariates will perform as well as or better than those models utilizing propensity scores with respect to model discrimination and calibration. Methods. The most commonly encountered statistical scenarios for medical prediction (logistic and proportional hazards regression) were used to investigate this research question. Random cross-validation was performed 500 times to correct for optimism. The multivariable regression models adjusting for all covariates were compared with models that included adjustment for or weighting with the propensity scores. The methods were compared based on three predictive performance measures: (1) concordance indices; (2) Brier scores; and (3) calibration curves. Results. Multivariable models adjusting for all covariates had the highest average concordance index, the lowest average Brier score, and the best calibration. Propensity score adjustment and inverse probability weighting models without adjustment for all covariates performed worse than full models and failed to improve predictive performance with full covariate adjustment. Conclusion. Propensity score techniques did not improve prediction performance measures beyond multivariable adjustment. Propensity scores are not recommended if the analytical goal is pure prediction modeling.
format Online
Article
Text
id pubmed-3740143
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-37401432013-08-12 Adding propensity scores to pure prediction models fails to improve predictive performance Nowacki, Amy S. Wells, Brian J. Yu, Changhong Kattan, Michael W. PeerJ Epidemiology Background. Propensity score usage seems to be growing in popularity leading researchers to question the possible role of propensity scores in prediction modeling, despite the lack of a theoretical rationale. It is suspected that such requests are due to the lack of differentiation regarding the goals of predictive modeling versus causal inference modeling. Therefore, the purpose of this study is to formally examine the effect of propensity scores on predictive performance. Our hypothesis is that a multivariable regression model that adjusts for all covariates will perform as well as or better than those models utilizing propensity scores with respect to model discrimination and calibration. Methods. The most commonly encountered statistical scenarios for medical prediction (logistic and proportional hazards regression) were used to investigate this research question. Random cross-validation was performed 500 times to correct for optimism. The multivariable regression models adjusting for all covariates were compared with models that included adjustment for or weighting with the propensity scores. The methods were compared based on three predictive performance measures: (1) concordance indices; (2) Brier scores; and (3) calibration curves. Results. Multivariable models adjusting for all covariates had the highest average concordance index, the lowest average Brier score, and the best calibration. Propensity score adjustment and inverse probability weighting models without adjustment for all covariates performed worse than full models and failed to improve predictive performance with full covariate adjustment. Conclusion. Propensity score techniques did not improve prediction performance measures beyond multivariable adjustment. Propensity scores are not recommended if the analytical goal is pure prediction modeling. PeerJ Inc. 2013-08-01 /pmc/articles/PMC3740143/ /pubmed/23940836 http://dx.doi.org/10.7717/peerj.123 Text en © 2013 Nowacki et al. http://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Epidemiology
Nowacki, Amy S.
Wells, Brian J.
Yu, Changhong
Kattan, Michael W.
Adding propensity scores to pure prediction models fails to improve predictive performance
title Adding propensity scores to pure prediction models fails to improve predictive performance
title_full Adding propensity scores to pure prediction models fails to improve predictive performance
title_fullStr Adding propensity scores to pure prediction models fails to improve predictive performance
title_full_unstemmed Adding propensity scores to pure prediction models fails to improve predictive performance
title_short Adding propensity scores to pure prediction models fails to improve predictive performance
title_sort adding propensity scores to pure prediction models fails to improve predictive performance
topic Epidemiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3740143/
https://www.ncbi.nlm.nih.gov/pubmed/23940836
http://dx.doi.org/10.7717/peerj.123
work_keys_str_mv AT nowackiamys addingpropensityscorestopurepredictionmodelsfailstoimprovepredictiveperformance
AT wellsbrianj addingpropensityscorestopurepredictionmodelsfailstoimprovepredictiveperformance
AT yuchanghong addingpropensityscorestopurepredictionmodelsfailstoimprovepredictiveperformance
AT kattanmichaelw addingpropensityscorestopurepredictionmodelsfailstoimprovepredictiveperformance