Cargando…
Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
BACKGROUND: Cardinality matching (CM), a novel matching technique, finds the largest matched sample meeting prespecified balance criteria thereby overcoming limitations of propensity score matching (PSM) associated with limited covariate overlap, which are especially pronounced in studies with small...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8146256/ https://www.ncbi.nlm.nih.gov/pubmed/34030640 http://dx.doi.org/10.1186/s12874-021-01282-1 |
_version_ | 1783697356989923328 |
---|---|
author | Fortin, Stephen P. Johnston, Stephen S Schuemie, Martijn J |
author_facet | Fortin, Stephen P. Johnston, Stephen S Schuemie, Martijn J |
author_sort | Fortin, Stephen P. |
collection | PubMed |
description | BACKGROUND: Cardinality matching (CM), a novel matching technique, finds the largest matched sample meeting prespecified balance criteria thereby overcoming limitations of propensity score matching (PSM) associated with limited covariate overlap, which are especially pronounced in studies with small sample sizes. The current study proposes a framework for large-scale CM (LS-CM); and compares large-scale PSM (LS-PSM) and LS-CM in terms of post-match sample size, covariate balance and residual confounding at progressively smaller sample sizes. METHODS: Evaluation of LS-PSM and LS-CM within a comparative cohort study of new users of angiotensin-converting enzyme inhibitor (ACEI) and thiazide or thiazide-like diuretic monotherapy identified from a U.S. insurance claims database. Candidate covariates included patient demographics, and all observed prior conditions, drug exposures and procedures. Propensity scores were calculated using LASSO regression, and candidate covariates with non-zero beta coefficients in the propensity model were defined as matching covariates for use in LS-CM. One-to-one matching was performed using progressively tighter parameter settings. Covariate balance was assessed using standardized mean differences. Hazard ratios for negative control outcomes perceived as unassociated with treatment (i.e., true hazard ratio of 1) were estimated using unconditional Cox models. Residual confounding was assessed using the expected systematic error of the empirical null distribution of negative control effect estimates compared to the ground truth. To simulate diverse research conditions, analyses were repeated within 10 %, 1 and 0.5 % subsample groups with increasingly limited covariate overlap. RESULTS: A total of 172,117 patients (ACEI: 129,078; thiazide: 43,039) met the study criteria. As compared to LS-PSM, LS-CM was associated with increased sample retention. Although LS-PSM achieved balance across all matching covariates within the full study population, substantial matching covariate imbalance was observed within the 1 and 0.5 % subsample groups. Meanwhile, LS-CM achieved matching covariate balance across all analyses. LS-PSM was associated with better candidate covariate balance within the full study population. Otherwise, both matching techniques achieved comparable candidate covariate balance and expected systematic error. CONCLUSIONS: LS-CM found the largest matched sample meeting prespecified balance criteria while achieving comparable candidate covariate balance and residual confounding. We recommend LS-CM as an alternative to LS-PSM in studies with small sample sizes or limited covariate overlap. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-021-01282-1. |
format | Online Article Text |
id | pubmed-8146256 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-81462562021-05-25 Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research Fortin, Stephen P. Johnston, Stephen S Schuemie, Martijn J BMC Med Res Methodol Research Article BACKGROUND: Cardinality matching (CM), a novel matching technique, finds the largest matched sample meeting prespecified balance criteria thereby overcoming limitations of propensity score matching (PSM) associated with limited covariate overlap, which are especially pronounced in studies with small sample sizes. The current study proposes a framework for large-scale CM (LS-CM); and compares large-scale PSM (LS-PSM) and LS-CM in terms of post-match sample size, covariate balance and residual confounding at progressively smaller sample sizes. METHODS: Evaluation of LS-PSM and LS-CM within a comparative cohort study of new users of angiotensin-converting enzyme inhibitor (ACEI) and thiazide or thiazide-like diuretic monotherapy identified from a U.S. insurance claims database. Candidate covariates included patient demographics, and all observed prior conditions, drug exposures and procedures. Propensity scores were calculated using LASSO regression, and candidate covariates with non-zero beta coefficients in the propensity model were defined as matching covariates for use in LS-CM. One-to-one matching was performed using progressively tighter parameter settings. Covariate balance was assessed using standardized mean differences. Hazard ratios for negative control outcomes perceived as unassociated with treatment (i.e., true hazard ratio of 1) were estimated using unconditional Cox models. Residual confounding was assessed using the expected systematic error of the empirical null distribution of negative control effect estimates compared to the ground truth. To simulate diverse research conditions, analyses were repeated within 10 %, 1 and 0.5 % subsample groups with increasingly limited covariate overlap. RESULTS: A total of 172,117 patients (ACEI: 129,078; thiazide: 43,039) met the study criteria. As compared to LS-PSM, LS-CM was associated with increased sample retention. Although LS-PSM achieved balance across all matching covariates within the full study population, substantial matching covariate imbalance was observed within the 1 and 0.5 % subsample groups. Meanwhile, LS-CM achieved matching covariate balance across all analyses. LS-PSM was associated with better candidate covariate balance within the full study population. Otherwise, both matching techniques achieved comparable candidate covariate balance and expected systematic error. CONCLUSIONS: LS-CM found the largest matched sample meeting prespecified balance criteria while achieving comparable candidate covariate balance and residual confounding. We recommend LS-CM as an alternative to LS-PSM in studies with small sample sizes or limited covariate overlap. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-021-01282-1. BioMed Central 2021-05-24 /pmc/articles/PMC8146256/ /pubmed/34030640 http://dx.doi.org/10.1186/s12874-021-01282-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Fortin, Stephen P. Johnston, Stephen S Schuemie, Martijn J Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research |
title | Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research |
title_full | Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research |
title_fullStr | Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research |
title_full_unstemmed | Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research |
title_short | Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research |
title_sort | applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8146256/ https://www.ncbi.nlm.nih.gov/pubmed/34030640 http://dx.doi.org/10.1186/s12874-021-01282-1 |
work_keys_str_mv | AT fortinstephenp appliedcomparisonoflargescalepropensityscorematchingandcardinalitymatchingforcausalinferenceinobservationalresearch AT johnstonstephens appliedcomparisonoflargescalepropensityscorematchingandcardinalitymatchingforcausalinferenceinobservationalresearch AT schuemiemartijnj appliedcomparisonoflargescalepropensityscorematchingandcardinalitymatchingforcausalinferenceinobservationalresearch |