Cargando…

Propensity score matching with R: conventional methods and new features

It is increasingly important to accurately and comprehensively estimate the effects of particular clinical treatments. Although randomization is the current gold standard, randomized controlled trials (RCTs) are often limited in practice due to ethical and cost issues. Observational studies have als...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Qin-Yu, Luo, Jing-Chao, Su, Ying, Zhang, Yi-Jie, Tu, Guo-Wei, Luo, Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246231/
https://www.ncbi.nlm.nih.gov/pubmed/34268425
http://dx.doi.org/10.21037/atm-20-3998
_version_ 1783716271884337152
author Zhao, Qin-Yu
Luo, Jing-Chao
Su, Ying
Zhang, Yi-Jie
Tu, Guo-Wei
Luo, Zhe
author_facet Zhao, Qin-Yu
Luo, Jing-Chao
Su, Ying
Zhang, Yi-Jie
Tu, Guo-Wei
Luo, Zhe
author_sort Zhao, Qin-Yu
collection PubMed
description It is increasingly important to accurately and comprehensively estimate the effects of particular clinical treatments. Although randomization is the current gold standard, randomized controlled trials (RCTs) are often limited in practice due to ethical and cost issues. Observational studies have also attracted a great deal of attention as, quite often, large historical datasets are available for these kinds of studies. However, observational studies also have their drawbacks, mainly including the systematic differences in baseline covariates, which relate to outcomes between treatment and control groups that can potentially bias results. Propensity score methods, which are a series of balancing methods in these studies, have become increasingly popular by virtue of the two major advantages of dimension reduction and design separation. Within this approach, propensity score matching (PSM) has been empirically proven, with outstanding performances across observational datasets. While PSM tutorials are available in the literature, there is still room for improvement. Some PSM tutorials provide step-by-step guidance, but only one or two packages have been covered, thereby limiting their scope and practicality. Several articles and books have expounded upon propensity scores in detail, exploring statistical principles and theories; however, the lack of explanations on function usage in programming language has made it difficult for researchers to understand and follow these materials. To this end, this tutorial was developed with a six-step PSM framework, in which we summarize the recent updates and provide step-by-step guidance to the R programming language. This tutorial offers researchers with a broad survey of PSM, ranging from data preprocessing to estimations of propensity scores, and from matching to analyses. We also explain generalized propensity scoring for multiple or continuous treatments, as well as time-dependent PSM. Lastly, we discuss the advantages and disadvantages of propensity score methods.
format Online
Article
Text
id pubmed-8246231
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-82462312021-07-14 Propensity score matching with R: conventional methods and new features Zhao, Qin-Yu Luo, Jing-Chao Su, Ying Zhang, Yi-Jie Tu, Guo-Wei Luo, Zhe Ann Transl Med Review Article It is increasingly important to accurately and comprehensively estimate the effects of particular clinical treatments. Although randomization is the current gold standard, randomized controlled trials (RCTs) are often limited in practice due to ethical and cost issues. Observational studies have also attracted a great deal of attention as, quite often, large historical datasets are available for these kinds of studies. However, observational studies also have their drawbacks, mainly including the systematic differences in baseline covariates, which relate to outcomes between treatment and control groups that can potentially bias results. Propensity score methods, which are a series of balancing methods in these studies, have become increasingly popular by virtue of the two major advantages of dimension reduction and design separation. Within this approach, propensity score matching (PSM) has been empirically proven, with outstanding performances across observational datasets. While PSM tutorials are available in the literature, there is still room for improvement. Some PSM tutorials provide step-by-step guidance, but only one or two packages have been covered, thereby limiting their scope and practicality. Several articles and books have expounded upon propensity scores in detail, exploring statistical principles and theories; however, the lack of explanations on function usage in programming language has made it difficult for researchers to understand and follow these materials. To this end, this tutorial was developed with a six-step PSM framework, in which we summarize the recent updates and provide step-by-step guidance to the R programming language. This tutorial offers researchers with a broad survey of PSM, ranging from data preprocessing to estimations of propensity scores, and from matching to analyses. We also explain generalized propensity scoring for multiple or continuous treatments, as well as time-dependent PSM. Lastly, we discuss the advantages and disadvantages of propensity score methods. AME Publishing Company 2021-05 /pmc/articles/PMC8246231/ /pubmed/34268425 http://dx.doi.org/10.21037/atm-20-3998 Text en 2021 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Review Article
Zhao, Qin-Yu
Luo, Jing-Chao
Su, Ying
Zhang, Yi-Jie
Tu, Guo-Wei
Luo, Zhe
Propensity score matching with R: conventional methods and new features
title Propensity score matching with R: conventional methods and new features
title_full Propensity score matching with R: conventional methods and new features
title_fullStr Propensity score matching with R: conventional methods and new features
title_full_unstemmed Propensity score matching with R: conventional methods and new features
title_short Propensity score matching with R: conventional methods and new features
title_sort propensity score matching with r: conventional methods and new features
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246231/
https://www.ncbi.nlm.nih.gov/pubmed/34268425
http://dx.doi.org/10.21037/atm-20-3998
work_keys_str_mv AT zhaoqinyu propensityscorematchingwithrconventionalmethodsandnewfeatures
AT luojingchao propensityscorematchingwithrconventionalmethodsandnewfeatures
AT suying propensityscorematchingwithrconventionalmethodsandnewfeatures
AT zhangyijie propensityscorematchingwithrconventionalmethodsandnewfeatures
AT tuguowei propensityscorematchingwithrconventionalmethodsandnewfeatures
AT luozhe propensityscorematchingwithrconventionalmethodsandnewfeatures