Cargando…
Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation
IMPORTANCE: Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia, and its early detection could lead to significant improvements in outcomes through the appropriate prescription of anticoagulation medication. Although a variety of methods exist for screening for AF, a targeted ap...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Association
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6991266/ https://www.ncbi.nlm.nih.gov/pubmed/31951272 http://dx.doi.org/10.1001/jamanetworkopen.2019.19396 |
_version_ | 1783492623742271488 |
---|---|
author | Tiwari, Premanand Colborn, Kathryn L. Smith, Derek E. Xing, Fuyong Ghosh, Debashis Rosenberg, Michael A. |
author_facet | Tiwari, Premanand Colborn, Kathryn L. Smith, Derek E. Xing, Fuyong Ghosh, Debashis Rosenberg, Michael A. |
author_sort | Tiwari, Premanand |
collection | PubMed |
description | IMPORTANCE: Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia, and its early detection could lead to significant improvements in outcomes through the appropriate prescription of anticoagulation medication. Although a variety of methods exist for screening for AF, a targeted approach, which requires an efficient method for identifying patients at risk, would be preferred. OBJECTIVE: To examine machine learning approaches applied to electronic health record data that have been harmonized to the Observational Medical Outcomes Partnership Common Data Model for identifying risk of AF. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic study used data from 2 252 219 individuals cared for in the UCHealth hospital system, which comprises 3 large hospitals in Colorado, from January 1, 2011, to October 1, 2018. Initial analysis was performed in December 2018; follow-up analysis was performed in July 2019. EXPOSURES: All Observational Medical Outcomes Partnership Common Data Model–harmonized electronic health record features, including diagnoses, procedures, medications, age, and sex. MAIN OUTCOMES AND MEASURES: Classification of incident AF in designated 6-month intervals, adjudicated retrospectively, based on area under the receiver operating characteristic curve and F1 statistic. RESULTS: Of 2 252 219 individuals (1 225 533 [54.4%] women; mean [SD] age, 42.9 [22.3] years), 28 036 (1.2%) developed incident AF during a designated 6-month interval. The machine learning model that used the 200 most common electronic health record features, including age and sex, and random oversampling with a single-layer, fully connected neural network provided the optimal prediction of 6-month incident AF, with an area under the receiver operating characteristic curve of 0.800 and an F1 score of 0.110. This model performed only slightly better than a more basic logistic regression model composed of known clinical risk factors for AF, which had an area under the receiver operating characteristic curve of 0.794 and an F1 score of 0.079. CONCLUSIONS AND RELEVANCE: Machine learning approaches to electronic health record data offer a promising method for improving risk prediction for incident AF, but more work is needed to show improvement beyond standard risk factors. |
format | Online Article Text |
id | pubmed-6991266 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | American Medical Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-69912662020-02-11 Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation Tiwari, Premanand Colborn, Kathryn L. Smith, Derek E. Xing, Fuyong Ghosh, Debashis Rosenberg, Michael A. JAMA Netw Open Original Investigation IMPORTANCE: Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia, and its early detection could lead to significant improvements in outcomes through the appropriate prescription of anticoagulation medication. Although a variety of methods exist for screening for AF, a targeted approach, which requires an efficient method for identifying patients at risk, would be preferred. OBJECTIVE: To examine machine learning approaches applied to electronic health record data that have been harmonized to the Observational Medical Outcomes Partnership Common Data Model for identifying risk of AF. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic study used data from 2 252 219 individuals cared for in the UCHealth hospital system, which comprises 3 large hospitals in Colorado, from January 1, 2011, to October 1, 2018. Initial analysis was performed in December 2018; follow-up analysis was performed in July 2019. EXPOSURES: All Observational Medical Outcomes Partnership Common Data Model–harmonized electronic health record features, including diagnoses, procedures, medications, age, and sex. MAIN OUTCOMES AND MEASURES: Classification of incident AF in designated 6-month intervals, adjudicated retrospectively, based on area under the receiver operating characteristic curve and F1 statistic. RESULTS: Of 2 252 219 individuals (1 225 533 [54.4%] women; mean [SD] age, 42.9 [22.3] years), 28 036 (1.2%) developed incident AF during a designated 6-month interval. The machine learning model that used the 200 most common electronic health record features, including age and sex, and random oversampling with a single-layer, fully connected neural network provided the optimal prediction of 6-month incident AF, with an area under the receiver operating characteristic curve of 0.800 and an F1 score of 0.110. This model performed only slightly better than a more basic logistic regression model composed of known clinical risk factors for AF, which had an area under the receiver operating characteristic curve of 0.794 and an F1 score of 0.079. CONCLUSIONS AND RELEVANCE: Machine learning approaches to electronic health record data offer a promising method for improving risk prediction for incident AF, but more work is needed to show improvement beyond standard risk factors. American Medical Association 2020-01-17 /pmc/articles/PMC6991266/ /pubmed/31951272 http://dx.doi.org/10.1001/jamanetworkopen.2019.19396 Text en Copyright 2020 Tiwari P et al. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License. |
spellingShingle | Original Investigation Tiwari, Premanand Colborn, Kathryn L. Smith, Derek E. Xing, Fuyong Ghosh, Debashis Rosenberg, Michael A. Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation |
title | Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation |
title_full | Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation |
title_fullStr | Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation |
title_full_unstemmed | Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation |
title_short | Assessment of a Machine Learning Model Applied to Harmonized Electronic Health Record Data for the Prediction of Incident Atrial Fibrillation |
title_sort | assessment of a machine learning model applied to harmonized electronic health record data for the prediction of incident atrial fibrillation |
topic | Original Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6991266/ https://www.ncbi.nlm.nih.gov/pubmed/31951272 http://dx.doi.org/10.1001/jamanetworkopen.2019.19396 |
work_keys_str_mv | AT tiwaripremanand assessmentofamachinelearningmodelappliedtoharmonizedelectronichealthrecorddataforthepredictionofincidentatrialfibrillation AT colbornkathrynl assessmentofamachinelearningmodelappliedtoharmonizedelectronichealthrecorddataforthepredictionofincidentatrialfibrillation AT smithdereke assessmentofamachinelearningmodelappliedtoharmonizedelectronichealthrecorddataforthepredictionofincidentatrialfibrillation AT xingfuyong assessmentofamachinelearningmodelappliedtoharmonizedelectronichealthrecorddataforthepredictionofincidentatrialfibrillation AT ghoshdebashis assessmentofamachinelearningmodelappliedtoharmonizedelectronichealthrecorddataforthepredictionofincidentatrialfibrillation AT rosenbergmichaela assessmentofamachinelearningmodelappliedtoharmonizedelectronichealthrecorddataforthepredictionofincidentatrialfibrillation |