Cargando…

A comparison of 12 algorithms for matching on the propensity score

Propensity-score matching is increasingly being used to reduce the confounding that can occur in observational studies examining the effects of treatments or interventions on outcomes. We used Monte Carlo simulations to examine the following algorithms for forming matched pairs of treated and untrea...

Descripción completa

Detalles Bibliográficos
Autor principal: Austin, Peter C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BlackWell Publishing Ltd 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4285163/
https://www.ncbi.nlm.nih.gov/pubmed/24123228
http://dx.doi.org/10.1002/sim.6004
_version_ 1782351540414054400
author Austin, Peter C
author_facet Austin, Peter C
author_sort Austin, Peter C
collection PubMed
description Propensity-score matching is increasingly being used to reduce the confounding that can occur in observational studies examining the effects of treatments or interventions on outcomes. We used Monte Carlo simulations to examine the following algorithms for forming matched pairs of treated and untreated subjects: optimal matching, greedy nearest neighbor matching without replacement, and greedy nearest neighbor matching without replacement within specified caliper widths. For each of the latter two algorithms, we examined four different sub-algorithms defined by the order in which treated subjects were selected for matching to an untreated subject: lowest to highest propensity score, highest to lowest propensity score, best match first, and random order. We also examined matching with replacement. We found that (i) nearest neighbor matching induced the same balance in baseline covariates as did optimal matching; (ii) when at least some of the covariates were continuous, caliper matching tended to induce balance on baseline covariates that was at least as good as the other algorithms; (iii) caliper matching tended to result in estimates of treatment effect with less bias compared with optimal and nearest neighbor matching; (iv) optimal and nearest neighbor matching resulted in estimates of treatment effect with negligibly less variability than did caliper matching; (v) caliper matching had amongst the best performance when assessed using mean squared error; (vi) the order in which treated subjects were selected for matching had at most a modest effect on estimation; and (vii) matching with replacement did not have superior performance compared with caliper matching without replacement. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd.
format Online
Article
Text
id pubmed-4285163
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BlackWell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-42851632015-01-26 A comparison of 12 algorithms for matching on the propensity score Austin, Peter C Stat Med Research Articles Propensity-score matching is increasingly being used to reduce the confounding that can occur in observational studies examining the effects of treatments or interventions on outcomes. We used Monte Carlo simulations to examine the following algorithms for forming matched pairs of treated and untreated subjects: optimal matching, greedy nearest neighbor matching without replacement, and greedy nearest neighbor matching without replacement within specified caliper widths. For each of the latter two algorithms, we examined four different sub-algorithms defined by the order in which treated subjects were selected for matching to an untreated subject: lowest to highest propensity score, highest to lowest propensity score, best match first, and random order. We also examined matching with replacement. We found that (i) nearest neighbor matching induced the same balance in baseline covariates as did optimal matching; (ii) when at least some of the covariates were continuous, caliper matching tended to induce balance on baseline covariates that was at least as good as the other algorithms; (iii) caliper matching tended to result in estimates of treatment effect with less bias compared with optimal and nearest neighbor matching; (iv) optimal and nearest neighbor matching resulted in estimates of treatment effect with negligibly less variability than did caliper matching; (v) caliper matching had amongst the best performance when assessed using mean squared error; (vi) the order in which treated subjects were selected for matching had at most a modest effect on estimation; and (vii) matching with replacement did not have superior performance compared with caliper matching without replacement. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. BlackWell Publishing Ltd 2014-03-15 2013-10-07 /pmc/articles/PMC4285163/ /pubmed/24123228 http://dx.doi.org/10.1002/sim.6004 Text en © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. http://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Research Articles
Austin, Peter C
A comparison of 12 algorithms for matching on the propensity score
title A comparison of 12 algorithms for matching on the propensity score
title_full A comparison of 12 algorithms for matching on the propensity score
title_fullStr A comparison of 12 algorithms for matching on the propensity score
title_full_unstemmed A comparison of 12 algorithms for matching on the propensity score
title_short A comparison of 12 algorithms for matching on the propensity score
title_sort comparison of 12 algorithms for matching on the propensity score
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4285163/
https://www.ncbi.nlm.nih.gov/pubmed/24123228
http://dx.doi.org/10.1002/sim.6004
work_keys_str_mv AT austinpeterc acomparisonof12algorithmsformatchingonthepropensityscore
AT austinpeterc comparisonof12algorithmsformatchingonthepropensityscore