Cargando…
The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation
Objective: Although many clinical metrics are associated with proximity to decompensation in heart failure (HF), none are individually accurate enough to risk-stratify HF patients on a patient-by-patient basis. The dire consequences of this inaccuracy in risk stratification have profoundly lowered t...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8521851/ https://www.ncbi.nlm.nih.gov/pubmed/34713050 http://dx.doi.org/10.3389/fdgth.2020.576945 |
_version_ | 1784584971935023104 |
---|---|
author | Guo, Aixia Foraker, Randi E. MacGregor, Robert M. Masood, Faraz M. Cupps, Brian P. Pasque, Michael K. |
author_facet | Guo, Aixia Foraker, Randi E. MacGregor, Robert M. Masood, Faraz M. Cupps, Brian P. Pasque, Michael K. |
author_sort | Guo, Aixia |
collection | PubMed |
description | Objective: Although many clinical metrics are associated with proximity to decompensation in heart failure (HF), none are individually accurate enough to risk-stratify HF patients on a patient-by-patient basis. The dire consequences of this inaccuracy in risk stratification have profoundly lowered the clinical threshold for application of high-risk surgical intervention, such as ventricular assist device placement. Machine learning can detect non-intuitive classifier patterns that allow for innovative combination of patient feature predictive capability. A machine learning-based clinical tool to identify proximity to catastrophic HF deterioration on a patient-specific basis would enable more efficient direction of high-risk surgical intervention to those patients who have the most to gain from it, while sparing others. Synthetic electronic health record (EHR) data are statistically indistinguishable from the original protected health information, and can be analyzed as if they were original data but without any privacy concerns. We demonstrate that synthetic EHR data can be easily accessed and analyzed and are amenable to machine learning analyses. Methods: We developed synthetic data from EHR data of 26,575 HF patients admitted to a single institution during the decade ending on 12/31/2018. Twenty-seven clinically-relevant features were synthesized and utilized in supervised deep learning and machine learning algorithms (i.e., deep neural networks [DNN], random forest [RF], and logistic regression [LR]) to explore their ability to predict 1-year mortality by five-fold cross validation methods. We conducted analyses leveraging features from prior to/at and after/at the time of HF diagnosis. Results: The area under the receiver operating curve (AUC) was used to evaluate the performance of the three models: the mean AUC was 0.80 for DNN, 0.72 for RF, and 0.74 for LR. Age, creatinine, body mass index, and blood pressure levels were especially important features in predicting death within 1-year among HF patients. Conclusions: Machine learning models have considerable potential to improve accuracy in mortality prediction, such that high-risk surgical intervention can be applied only in those patients who stand to benefit from it. Access to EHR-based synthetic data derivatives eliminates risk of exposure of EHR data, speeds time-to-insight, and facilitates data sharing. As more clinical, imaging, and contractile features with proven predictive capability are added to these models, the development of a clinical tool to assist in timing of intervention in surgical candidates may be possible. |
format | Online Article Text |
id | pubmed-8521851 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-85218512021-10-27 The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation Guo, Aixia Foraker, Randi E. MacGregor, Robert M. Masood, Faraz M. Cupps, Brian P. Pasque, Michael K. Front Digit Health Digital Health Objective: Although many clinical metrics are associated with proximity to decompensation in heart failure (HF), none are individually accurate enough to risk-stratify HF patients on a patient-by-patient basis. The dire consequences of this inaccuracy in risk stratification have profoundly lowered the clinical threshold for application of high-risk surgical intervention, such as ventricular assist device placement. Machine learning can detect non-intuitive classifier patterns that allow for innovative combination of patient feature predictive capability. A machine learning-based clinical tool to identify proximity to catastrophic HF deterioration on a patient-specific basis would enable more efficient direction of high-risk surgical intervention to those patients who have the most to gain from it, while sparing others. Synthetic electronic health record (EHR) data are statistically indistinguishable from the original protected health information, and can be analyzed as if they were original data but without any privacy concerns. We demonstrate that synthetic EHR data can be easily accessed and analyzed and are amenable to machine learning analyses. Methods: We developed synthetic data from EHR data of 26,575 HF patients admitted to a single institution during the decade ending on 12/31/2018. Twenty-seven clinically-relevant features were synthesized and utilized in supervised deep learning and machine learning algorithms (i.e., deep neural networks [DNN], random forest [RF], and logistic regression [LR]) to explore their ability to predict 1-year mortality by five-fold cross validation methods. We conducted analyses leveraging features from prior to/at and after/at the time of HF diagnosis. Results: The area under the receiver operating curve (AUC) was used to evaluate the performance of the three models: the mean AUC was 0.80 for DNN, 0.72 for RF, and 0.74 for LR. Age, creatinine, body mass index, and blood pressure levels were especially important features in predicting death within 1-year among HF patients. Conclusions: Machine learning models have considerable potential to improve accuracy in mortality prediction, such that high-risk surgical intervention can be applied only in those patients who stand to benefit from it. Access to EHR-based synthetic data derivatives eliminates risk of exposure of EHR data, speeds time-to-insight, and facilitates data sharing. As more clinical, imaging, and contractile features with proven predictive capability are added to these models, the development of a clinical tool to assist in timing of intervention in surgical candidates may be possible. Frontiers Media S.A. 2020-12-07 /pmc/articles/PMC8521851/ /pubmed/34713050 http://dx.doi.org/10.3389/fdgth.2020.576945 Text en Copyright © 2020 Guo, Foraker, MacGregor, Masood, Cupps and Pasque. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Digital Health Guo, Aixia Foraker, Randi E. MacGregor, Robert M. Masood, Faraz M. Cupps, Brian P. Pasque, Michael K. The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation |
title | The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation |
title_full | The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation |
title_fullStr | The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation |
title_full_unstemmed | The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation |
title_short | The Use of Synthetic Electronic Health Record Data and Deep Learning to Improve Timing of High-Risk Heart Failure Surgical Intervention by Predicting Proximity to Catastrophic Decompensation |
title_sort | use of synthetic electronic health record data and deep learning to improve timing of high-risk heart failure surgical intervention by predicting proximity to catastrophic decompensation |
topic | Digital Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8521851/ https://www.ncbi.nlm.nih.gov/pubmed/34713050 http://dx.doi.org/10.3389/fdgth.2020.576945 |
work_keys_str_mv | AT guoaixia theuseofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT forakerrandie theuseofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT macgregorrobertm theuseofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT masoodfarazm theuseofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT cuppsbrianp theuseofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT pasquemichaelk theuseofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT guoaixia useofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT forakerrandie useofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT macgregorrobertm useofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT masoodfarazm useofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT cuppsbrianp useofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation AT pasquemichaelk useofsyntheticelectronichealthrecorddataanddeeplearningtoimprovetimingofhighriskheartfailuresurgicalinterventionbypredictingproximitytocatastrophicdecompensation |