Cargando…
Comparing person-level matching algorithms to identify risk across disparate datasets among patients with a controlled substance prescription: retrospective analysis
BACKGROUND: The opioid epidemic in the United States has precipitated a need for public health agencies to better understand risk factors associated with fatal overdoses. Matching person-level information stored in public health, medical, and human services datasets can enhance the understanding of...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9097759/ https://www.ncbi.nlm.nih.gov/pubmed/35571361 http://dx.doi.org/10.1093/jamiaopen/ooac020 |
_version_ | 1784706239219892224 |
---|---|
author | Ferris, Lindsey M Weiner, Jonathan P Saloner, Brendan Kharrazi, Hadi |
author_facet | Ferris, Lindsey M Weiner, Jonathan P Saloner, Brendan Kharrazi, Hadi |
author_sort | Ferris, Lindsey M |
collection | PubMed |
description | BACKGROUND: The opioid epidemic in the United States has precipitated a need for public health agencies to better understand risk factors associated with fatal overdoses. Matching person-level information stored in public health, medical, and human services datasets can enhance the understanding of opioid overdose risk factors and interventions. OBJECTIVE: This study compares approximate match versus exact match algorithms to link disparate datasets together for identifying persons at risk from an applied perspective. METHODS: This study used statewide prescription drug monitoring program (PDMP), arrest, and mortality data matched at the person-level using an approximate match and 2 exact match algorithms. Impact of matching was assessed by analyzing 3 independent concepts: (1) the prevalence of key risk indicators used by PDMP programs in practice, (2) the prevalence of arrests and fatal opioid overdose, and (3) the performance of a multivariate logistic regression for fatal opioid overdose. The PDMP key risk indicators included (1) multiple provider episodes (MPE), or patients with prescriptions from multiple prescribers and dispensers, (2) high morphine milligram equivalents (MMEs), which represents an opioid’s potency relative to morphine, and (3) overlapping opioid and benzodiazepine prescriptions. RESULTS: Prevalence of PDMP-based risk indicators were higher in the approximate match population for MPEs (n = 4893/1 859 445 [0.26%]) and overlapping opioid/benzodiazepines (n = 57 888/1 859 445 [4.71%]), but the exact-basic match population had the highest prevalence of individuals with high MMEs (n = 664/1 910 741 [3.11%]). Prevalence of arrests and deaths were highest for the approximate match population compared with the exact match populations. Model performance was comparable across the 3 matching algorithms (exact-basic validation area under the receiver operating characteristic curve [AUC]: 0.854; approximate validation AUC: 0.847; exact + zip validation AUC: 0.826) but resulted in different cutoff points balancing sensitivity and specificity. CONCLUSIONS: Our study illustrates the specific tradeoffs of different matching methods. Further research should be performed to compare matching algorithms and its impact on the prevalence of key risk indicators in an applied setting that can improve understanding of risk within a population. |
format | Online Article Text |
id | pubmed-9097759 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-90977592022-05-13 Comparing person-level matching algorithms to identify risk across disparate datasets among patients with a controlled substance prescription: retrospective analysis Ferris, Lindsey M Weiner, Jonathan P Saloner, Brendan Kharrazi, Hadi JAMIA Open Research and Applications BACKGROUND: The opioid epidemic in the United States has precipitated a need for public health agencies to better understand risk factors associated with fatal overdoses. Matching person-level information stored in public health, medical, and human services datasets can enhance the understanding of opioid overdose risk factors and interventions. OBJECTIVE: This study compares approximate match versus exact match algorithms to link disparate datasets together for identifying persons at risk from an applied perspective. METHODS: This study used statewide prescription drug monitoring program (PDMP), arrest, and mortality data matched at the person-level using an approximate match and 2 exact match algorithms. Impact of matching was assessed by analyzing 3 independent concepts: (1) the prevalence of key risk indicators used by PDMP programs in practice, (2) the prevalence of arrests and fatal opioid overdose, and (3) the performance of a multivariate logistic regression for fatal opioid overdose. The PDMP key risk indicators included (1) multiple provider episodes (MPE), or patients with prescriptions from multiple prescribers and dispensers, (2) high morphine milligram equivalents (MMEs), which represents an opioid’s potency relative to morphine, and (3) overlapping opioid and benzodiazepine prescriptions. RESULTS: Prevalence of PDMP-based risk indicators were higher in the approximate match population for MPEs (n = 4893/1 859 445 [0.26%]) and overlapping opioid/benzodiazepines (n = 57 888/1 859 445 [4.71%]), but the exact-basic match population had the highest prevalence of individuals with high MMEs (n = 664/1 910 741 [3.11%]). Prevalence of arrests and deaths were highest for the approximate match population compared with the exact match populations. Model performance was comparable across the 3 matching algorithms (exact-basic validation area under the receiver operating characteristic curve [AUC]: 0.854; approximate validation AUC: 0.847; exact + zip validation AUC: 0.826) but resulted in different cutoff points balancing sensitivity and specificity. CONCLUSIONS: Our study illustrates the specific tradeoffs of different matching methods. Further research should be performed to compare matching algorithms and its impact on the prevalence of key risk indicators in an applied setting that can improve understanding of risk within a population. Oxford University Press 2022-03-30 /pmc/articles/PMC9097759/ /pubmed/35571361 http://dx.doi.org/10.1093/jamiaopen/ooac020 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research and Applications Ferris, Lindsey M Weiner, Jonathan P Saloner, Brendan Kharrazi, Hadi Comparing person-level matching algorithms to identify risk across disparate datasets among patients with a controlled substance prescription: retrospective analysis |
title | Comparing person-level matching algorithms to identify risk across disparate
datasets among patients with a controlled substance prescription: retrospective
analysis |
title_full | Comparing person-level matching algorithms to identify risk across disparate
datasets among patients with a controlled substance prescription: retrospective
analysis |
title_fullStr | Comparing person-level matching algorithms to identify risk across disparate
datasets among patients with a controlled substance prescription: retrospective
analysis |
title_full_unstemmed | Comparing person-level matching algorithms to identify risk across disparate
datasets among patients with a controlled substance prescription: retrospective
analysis |
title_short | Comparing person-level matching algorithms to identify risk across disparate
datasets among patients with a controlled substance prescription: retrospective
analysis |
title_sort | comparing person-level matching algorithms to identify risk across disparate
datasets among patients with a controlled substance prescription: retrospective
analysis |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9097759/ https://www.ncbi.nlm.nih.gov/pubmed/35571361 http://dx.doi.org/10.1093/jamiaopen/ooac020 |
work_keys_str_mv | AT ferrislindseym comparingpersonlevelmatchingalgorithmstoidentifyriskacrossdisparatedatasetsamongpatientswithacontrolledsubstanceprescriptionretrospectiveanalysis AT weinerjonathanp comparingpersonlevelmatchingalgorithmstoidentifyriskacrossdisparatedatasetsamongpatientswithacontrolledsubstanceprescriptionretrospectiveanalysis AT salonerbrendan comparingpersonlevelmatchingalgorithmstoidentifyriskacrossdisparatedatasetsamongpatientswithacontrolledsubstanceprescriptionretrospectiveanalysis AT kharrazihadi comparingpersonlevelmatchingalgorithmstoidentifyriskacrossdisparatedatasetsamongpatientswithacontrolledsubstanceprescriptionretrospectiveanalysis |