Cargando…
Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study
IMPORTANCE: Acute Hepatic Porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of fifteen years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, pre...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491361/ https://www.ncbi.nlm.nih.gov/pubmed/37693437 http://dx.doi.org/10.1101/2023.08.30.23293130 |
_version_ | 1785104044888424448 |
---|---|
author | Bhasuran, Balu Schmolly, Katharina Kapoor, Yuvraaj Jayakumar, Nanditha Lakshmi Doan, Raymond Amin, Jigar Meninger, Stephen Cheng, Nathan Deering, Robert Anderson, Karl Beaven, Simon W. Wang, Bruce Rudrapatna, Vivek A. |
author_facet | Bhasuran, Balu Schmolly, Katharina Kapoor, Yuvraaj Jayakumar, Nanditha Lakshmi Doan, Raymond Amin, Jigar Meninger, Stephen Cheng, Nathan Deering, Robert Anderson, Karl Beaven, Simon W. Wang, Bruce Rudrapatna, Vivek A. |
author_sort | Bhasuran, Balu |
collection | PubMed |
description | IMPORTANCE: Acute Hepatic Porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of fifteen years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, prediction models can be difficult to train given the limited case numbers, unstructured EHR data, and selection biases intrinsic to healthcare delivery. OBJECTIVE: To train and characterize models for identifying patients with AHP. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic study used structured and notes-based EHR data from two centers at the University of California, UCSF (2012–2022) and UCLA (2019–2022). The data were split into two cohorts (referral, diagnosis) and used to develop models that predict: 1) who will be referred for testing of acute porphyria, amongst those who presented with abdominal pain (a cardinal symptom of AHP), and 2) who will test positive, amongst those referred. The referral cohort consisted of 747 patients referred for testing and 99,849 contemporaneous patients who were not. The diagnosis cohort consisted of 72 confirmed AHP cases and 347 patients who tested negative. Cases were female predominant and 6–75 years old at the time of diagnosis. Candidate models used a range of architectures. Feature selection was semi-automated and incorporated publicly available data from knowledge graphs. MAIN OUTCOMES AND MEASURES: F-score on an outcome-stratified test set RESULTS: The best center-specific referral models achieved an F-score of 86–91%. The best diagnosis model achieved an F-score of 92%. To further test our model, we contacted 372 current patients who lack an AHP diagnosis but were predicted by our models as potentially having it (≥ 10% probability of referral, ≥ 50% of testing positive). However, we were only able to recruit 10 of these patients for biochemical testing, all of whom were negative. Nonetheless, post hoc evaluations suggested that these models could identify 71% of cases earlier than their diagnosis date, saving 1.2 years. CONCLUSIONS AND RELEVANCE: ML can reduce diagnostic delays in AHP and other rare diseases. Robust recruitment strategies and multicenter coordination will be needed to validate these models before they can be deployed. |
format | Online Article Text |
id | pubmed-10491361 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104913612023-09-09 Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study Bhasuran, Balu Schmolly, Katharina Kapoor, Yuvraaj Jayakumar, Nanditha Lakshmi Doan, Raymond Amin, Jigar Meninger, Stephen Cheng, Nathan Deering, Robert Anderson, Karl Beaven, Simon W. Wang, Bruce Rudrapatna, Vivek A. medRxiv Article IMPORTANCE: Acute Hepatic Porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of fifteen years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, prediction models can be difficult to train given the limited case numbers, unstructured EHR data, and selection biases intrinsic to healthcare delivery. OBJECTIVE: To train and characterize models for identifying patients with AHP. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic study used structured and notes-based EHR data from two centers at the University of California, UCSF (2012–2022) and UCLA (2019–2022). The data were split into two cohorts (referral, diagnosis) and used to develop models that predict: 1) who will be referred for testing of acute porphyria, amongst those who presented with abdominal pain (a cardinal symptom of AHP), and 2) who will test positive, amongst those referred. The referral cohort consisted of 747 patients referred for testing and 99,849 contemporaneous patients who were not. The diagnosis cohort consisted of 72 confirmed AHP cases and 347 patients who tested negative. Cases were female predominant and 6–75 years old at the time of diagnosis. Candidate models used a range of architectures. Feature selection was semi-automated and incorporated publicly available data from knowledge graphs. MAIN OUTCOMES AND MEASURES: F-score on an outcome-stratified test set RESULTS: The best center-specific referral models achieved an F-score of 86–91%. The best diagnosis model achieved an F-score of 92%. To further test our model, we contacted 372 current patients who lack an AHP diagnosis but were predicted by our models as potentially having it (≥ 10% probability of referral, ≥ 50% of testing positive). However, we were only able to recruit 10 of these patients for biochemical testing, all of whom were negative. Nonetheless, post hoc evaluations suggested that these models could identify 71% of cases earlier than their diagnosis date, saving 1.2 years. CONCLUSIONS AND RELEVANCE: ML can reduce diagnostic delays in AHP and other rare diseases. Robust recruitment strategies and multicenter coordination will be needed to validate these models before they can be deployed. Cold Spring Harbor Laboratory 2023-08-31 /pmc/articles/PMC10491361/ /pubmed/37693437 http://dx.doi.org/10.1101/2023.08.30.23293130 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Bhasuran, Balu Schmolly, Katharina Kapoor, Yuvraaj Jayakumar, Nanditha Lakshmi Doan, Raymond Amin, Jigar Meninger, Stephen Cheng, Nathan Deering, Robert Anderson, Karl Beaven, Simon W. Wang, Bruce Rudrapatna, Vivek A. Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study |
title | Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study |
title_full | Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study |
title_fullStr | Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study |
title_full_unstemmed | Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study |
title_short | Reducing diagnostic delays in Acute Hepatic Porphyria using electronic health records data and machine learning: a multicenter development and validation study |
title_sort | reducing diagnostic delays in acute hepatic porphyria using electronic health records data and machine learning: a multicenter development and validation study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10491361/ https://www.ncbi.nlm.nih.gov/pubmed/37693437 http://dx.doi.org/10.1101/2023.08.30.23293130 |
work_keys_str_mv | AT bhasuranbalu reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT schmollykatharina reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT kapooryuvraaj reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT jayakumarnandithalakshmi reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT doanraymond reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT aminjigar reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT meningerstephen reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT chengnathan reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT deeringrobert reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT andersonkarl reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT beavensimonw reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT wangbruce reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy AT rudrapatnaviveka reducingdiagnosticdelaysinacutehepaticporphyriausingelectronichealthrecordsdataandmachinelearningamulticenterdevelopmentandvalidationstudy |