Cargando…

Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research

BACKGROUND: Cardinality matching (CM), a novel matching technique, finds the largest matched sample meeting prespecified balance criteria thereby overcoming limitations of propensity score matching (PSM) associated with limited covariate overlap, which are especially pronounced in studies with small...

Descripción completa

Detalles Bibliográficos
Autores principales: Fortin, Stephen P., Johnston, Stephen S, Schuemie, Martijn J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8146256/
https://www.ncbi.nlm.nih.gov/pubmed/34030640
http://dx.doi.org/10.1186/s12874-021-01282-1
_version_ 1783697356989923328
author Fortin, Stephen P.
Johnston, Stephen S
Schuemie, Martijn J
author_facet Fortin, Stephen P.
Johnston, Stephen S
Schuemie, Martijn J
author_sort Fortin, Stephen P.
collection PubMed
description BACKGROUND: Cardinality matching (CM), a novel matching technique, finds the largest matched sample meeting prespecified balance criteria thereby overcoming limitations of propensity score matching (PSM) associated with limited covariate overlap, which are especially pronounced in studies with small sample sizes. The current study proposes a framework for large-scale CM (LS-CM); and compares large-scale PSM (LS-PSM) and LS-CM in terms of post-match sample size, covariate balance and residual confounding at progressively smaller sample sizes. METHODS: Evaluation of LS-PSM and LS-CM within a comparative cohort study of new users of angiotensin-converting enzyme inhibitor (ACEI) and thiazide or thiazide-like diuretic monotherapy identified from a U.S. insurance claims database. Candidate covariates included patient demographics, and all observed prior conditions, drug exposures and procedures. Propensity scores were calculated using LASSO regression, and candidate covariates with non-zero beta coefficients in the propensity model were defined as matching covariates for use in LS-CM. One-to-one matching was performed using progressively tighter parameter settings. Covariate balance was assessed using standardized mean differences. Hazard ratios for negative control outcomes perceived as unassociated with treatment (i.e., true hazard ratio of 1) were estimated using unconditional Cox models. Residual confounding was assessed using the expected systematic error of the empirical null distribution of negative control effect estimates compared to the ground truth. To simulate diverse research conditions, analyses were repeated within 10 %, 1 and 0.5 % subsample groups with increasingly limited covariate overlap. RESULTS: A total of 172,117 patients (ACEI: 129,078; thiazide: 43,039) met the study criteria. As compared to LS-PSM, LS-CM was associated with increased sample retention. Although LS-PSM achieved balance across all matching covariates within the full study population, substantial matching covariate imbalance was observed within the 1 and 0.5 % subsample groups. Meanwhile, LS-CM achieved matching covariate balance across all analyses. LS-PSM was associated with better candidate covariate balance within the full study population. Otherwise, both matching techniques achieved comparable candidate covariate balance and expected systematic error. CONCLUSIONS: LS-CM found the largest matched sample meeting prespecified balance criteria while achieving comparable candidate covariate balance and residual confounding. We recommend LS-CM as an alternative to LS-PSM in studies with small sample sizes or limited covariate overlap. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-021-01282-1.
format Online
Article
Text
id pubmed-8146256
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81462562021-05-25 Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research Fortin, Stephen P. Johnston, Stephen S Schuemie, Martijn J BMC Med Res Methodol Research Article BACKGROUND: Cardinality matching (CM), a novel matching technique, finds the largest matched sample meeting prespecified balance criteria thereby overcoming limitations of propensity score matching (PSM) associated with limited covariate overlap, which are especially pronounced in studies with small sample sizes. The current study proposes a framework for large-scale CM (LS-CM); and compares large-scale PSM (LS-PSM) and LS-CM in terms of post-match sample size, covariate balance and residual confounding at progressively smaller sample sizes. METHODS: Evaluation of LS-PSM and LS-CM within a comparative cohort study of new users of angiotensin-converting enzyme inhibitor (ACEI) and thiazide or thiazide-like diuretic monotherapy identified from a U.S. insurance claims database. Candidate covariates included patient demographics, and all observed prior conditions, drug exposures and procedures. Propensity scores were calculated using LASSO regression, and candidate covariates with non-zero beta coefficients in the propensity model were defined as matching covariates for use in LS-CM. One-to-one matching was performed using progressively tighter parameter settings. Covariate balance was assessed using standardized mean differences. Hazard ratios for negative control outcomes perceived as unassociated with treatment (i.e., true hazard ratio of 1) were estimated using unconditional Cox models. Residual confounding was assessed using the expected systematic error of the empirical null distribution of negative control effect estimates compared to the ground truth. To simulate diverse research conditions, analyses were repeated within 10 %, 1 and 0.5 % subsample groups with increasingly limited covariate overlap. RESULTS: A total of 172,117 patients (ACEI: 129,078; thiazide: 43,039) met the study criteria. As compared to LS-PSM, LS-CM was associated with increased sample retention. Although LS-PSM achieved balance across all matching covariates within the full study population, substantial matching covariate imbalance was observed within the 1 and 0.5 % subsample groups. Meanwhile, LS-CM achieved matching covariate balance across all analyses. LS-PSM was associated with better candidate covariate balance within the full study population. Otherwise, both matching techniques achieved comparable candidate covariate balance and expected systematic error. CONCLUSIONS: LS-CM found the largest matched sample meeting prespecified balance criteria while achieving comparable candidate covariate balance and residual confounding. We recommend LS-CM as an alternative to LS-PSM in studies with small sample sizes or limited covariate overlap. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-021-01282-1. BioMed Central 2021-05-24 /pmc/articles/PMC8146256/ /pubmed/34030640 http://dx.doi.org/10.1186/s12874-021-01282-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Fortin, Stephen P.
Johnston, Stephen S
Schuemie, Martijn J
Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
title Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
title_full Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
title_fullStr Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
title_full_unstemmed Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
title_short Applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
title_sort applied comparison of large‐scale propensity score matching and cardinality matching for causal inference in observational research
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8146256/
https://www.ncbi.nlm.nih.gov/pubmed/34030640
http://dx.doi.org/10.1186/s12874-021-01282-1
work_keys_str_mv AT fortinstephenp appliedcomparisonoflargescalepropensityscorematchingandcardinalitymatchingforcausalinferenceinobservationalresearch
AT johnstonstephens appliedcomparisonoflargescalepropensityscorematchingandcardinalitymatchingforcausalinferenceinobservationalresearch
AT schuemiemartijnj appliedcomparisonoflargescalepropensityscorematchingandcardinalitymatchingforcausalinferenceinobservationalresearch