Cargando…

A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension

Many patients with pulmonary arterial hypertension (PAH) experience substantial delays in diagnosis, which is associated with worse outcomes and higher costs. Tools for diagnosing PAH sooner may lead to earlier treatment, which may delay disease progression and adverse outcomes including hospitaliza...

Descripción completa

Detalles Bibliográficos
Autores principales: Hyde, Bethany, Paoli, Carly J., Panjabi, Sumeet, Bettencourt, Katherine C., Bell Lynum, Karimah S., Selej, Mona
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243208/
https://www.ncbi.nlm.nih.gov/pubmed/37287599
http://dx.doi.org/10.1002/pul2.12237
_version_ 1785054380514344960
author Hyde, Bethany
Paoli, Carly J.
Panjabi, Sumeet
Bettencourt, Katherine C.
Bell Lynum, Karimah S.
Selej, Mona
author_facet Hyde, Bethany
Paoli, Carly J.
Panjabi, Sumeet
Bettencourt, Katherine C.
Bell Lynum, Karimah S.
Selej, Mona
author_sort Hyde, Bethany
collection PubMed
description Many patients with pulmonary arterial hypertension (PAH) experience substantial delays in diagnosis, which is associated with worse outcomes and higher costs. Tools for diagnosing PAH sooner may lead to earlier treatment, which may delay disease progression and adverse outcomes including hospitalization and death. We developed a machine‐learning (ML) algorithm to identify patients at risk for PAH earlier in their symptom journey and distinguish them from patients with similar early symptoms not at risk for developing PAH. Our supervised ML model analyzed retrospective, de‐identified data from the US‐based Optum® Clinformatics® Data Mart claims database (January 2015 to December 2019). Propensity score matched PAH and non‐PAH (control) cohorts were established based on observed differences. Random forest models were used to classify patients as PAH or non‐PAH at diagnosis and at 6 months prediagnosis. The PAH and non‐PAH cohorts included 1339 and 4222 patients, respectively. At 6 months prediagnosis, the model performed well in distinguishing PAH and non‐PAH patients, with area under the curve of the receiver operating characteristic of 0.84, recall (sensitivity) of 0.73, and precision of 0.50. Key features distinguishing PAH from non‐PAH cohorts were a longer time between first symptom and the prediagnosis model date (i.e., 6 months before diagnosis); more diagnostic and prescription claims, circulatory claims, and imaging procedures, leading to higher overall healthcare resource utilization; and more hospitalizations. Our model distinguishes between patients with and without PAH at 6 months before diagnosis and illustrates the feasibility of using routine claims data to identify patients at a population level who might benefit from PAH‐specific screening and/or earlier specialist referral.
format Online
Article
Text
id pubmed-10243208
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-102432082023-06-07 A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension Hyde, Bethany Paoli, Carly J. Panjabi, Sumeet Bettencourt, Katherine C. Bell Lynum, Karimah S. Selej, Mona Pulm Circ Research Articles Many patients with pulmonary arterial hypertension (PAH) experience substantial delays in diagnosis, which is associated with worse outcomes and higher costs. Tools for diagnosing PAH sooner may lead to earlier treatment, which may delay disease progression and adverse outcomes including hospitalization and death. We developed a machine‐learning (ML) algorithm to identify patients at risk for PAH earlier in their symptom journey and distinguish them from patients with similar early symptoms not at risk for developing PAH. Our supervised ML model analyzed retrospective, de‐identified data from the US‐based Optum® Clinformatics® Data Mart claims database (January 2015 to December 2019). Propensity score matched PAH and non‐PAH (control) cohorts were established based on observed differences. Random forest models were used to classify patients as PAH or non‐PAH at diagnosis and at 6 months prediagnosis. The PAH and non‐PAH cohorts included 1339 and 4222 patients, respectively. At 6 months prediagnosis, the model performed well in distinguishing PAH and non‐PAH patients, with area under the curve of the receiver operating characteristic of 0.84, recall (sensitivity) of 0.73, and precision of 0.50. Key features distinguishing PAH from non‐PAH cohorts were a longer time between first symptom and the prediagnosis model date (i.e., 6 months before diagnosis); more diagnostic and prescription claims, circulatory claims, and imaging procedures, leading to higher overall healthcare resource utilization; and more hospitalizations. Our model distinguishes between patients with and without PAH at 6 months before diagnosis and illustrates the feasibility of using routine claims data to identify patients at a population level who might benefit from PAH‐specific screening and/or earlier specialist referral. John Wiley and Sons Inc. 2023-06-06 /pmc/articles/PMC10243208/ /pubmed/37287599 http://dx.doi.org/10.1002/pul2.12237 Text en © 2023 Janssen Research & Development, LLC. Pulmonary Circulation published by John Wiley & Sons Ltd on behalf of Pulmonary Vascular Research Institute. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
Hyde, Bethany
Paoli, Carly J.
Panjabi, Sumeet
Bettencourt, Katherine C.
Bell Lynum, Karimah S.
Selej, Mona
A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension
title A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension
title_full A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension
title_fullStr A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension
title_full_unstemmed A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension
title_short A claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension
title_sort claims‐based, machine‐learning algorithm to identify patients with pulmonary arterial hypertension
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243208/
https://www.ncbi.nlm.nih.gov/pubmed/37287599
http://dx.doi.org/10.1002/pul2.12237
work_keys_str_mv AT hydebethany aclaimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT paolicarlyj aclaimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT panjabisumeet aclaimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT bettencourtkatherinec aclaimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT belllynumkarimahs aclaimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT selejmona aclaimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT hydebethany claimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT paolicarlyj claimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT panjabisumeet claimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT bettencourtkatherinec claimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT belllynumkarimahs claimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension
AT selejmona claimsbasedmachinelearningalgorithmtoidentifypatientswithpulmonaryarterialhypertension