Cargando…

Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation

BACKGROUND: Although there is a growing interest in prediction models based on electronic medical records (EMRs) to identify patients at risk of adverse cardiac events following invasive coronary treatment, robust models fully utilizing EMR data are limited. OBJECTIVE: We aimed to develop and valida...

Descripción completa

Detalles Bibliográficos
Autores principales: Kwon, Osung, Na, Wonjun, Kang, Heejun, Jun, Tae Joon, Kweon, Jihoon, Park, Gyung-Min, Cho, YongHyun, Hur, Cinyoung, Chae, Jungwoo, Kang, Do-Yoon, Lee, Pil Hyung, Ahn, Jung-Min, Park, Duk-Woo, Kang, Soo-Jin, Lee, Seung-Whan, Lee, Cheol Whan, Park, Seong-Wook, Park, Seung-Jung, Yang, Dong Hyun, Kim, Young-Hak
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9133980/
https://www.ncbi.nlm.nih.gov/pubmed/35544292
http://dx.doi.org/10.2196/26801
_version_ 1784713695715131392
author Kwon, Osung
Na, Wonjun
Kang, Heejun
Jun, Tae Joon
Kweon, Jihoon
Park, Gyung-Min
Cho, YongHyun
Hur, Cinyoung
Chae, Jungwoo
Kang, Do-Yoon
Lee, Pil Hyung
Ahn, Jung-Min
Park, Duk-Woo
Kang, Soo-Jin
Lee, Seung-Whan
Lee, Cheol Whan
Park, Seong-Wook
Park, Seung-Jung
Yang, Dong Hyun
Kim, Young-Hak
author_facet Kwon, Osung
Na, Wonjun
Kang, Heejun
Jun, Tae Joon
Kweon, Jihoon
Park, Gyung-Min
Cho, YongHyun
Hur, Cinyoung
Chae, Jungwoo
Kang, Do-Yoon
Lee, Pil Hyung
Ahn, Jung-Min
Park, Duk-Woo
Kang, Soo-Jin
Lee, Seung-Whan
Lee, Cheol Whan
Park, Seong-Wook
Park, Seung-Jung
Yang, Dong Hyun
Kim, Young-Hak
author_sort Kwon, Osung
collection PubMed
description BACKGROUND: Although there is a growing interest in prediction models based on electronic medical records (EMRs) to identify patients at risk of adverse cardiac events following invasive coronary treatment, robust models fully utilizing EMR data are limited. OBJECTIVE: We aimed to develop and validate machine learning (ML) models by using diverse fields of EMR to predict the risk of 30-day adverse cardiac events after percutaneous intervention or bypass surgery. METHODS: EMR data of 5,184,565 records of 16,793 patients at a quaternary hospital between 2006 and 2016 were categorized into static basic (eg, demographics), dynamic time-series (eg, laboratory values), and cardiac-specific data (eg, coronary angiography). The data were randomly split into training, tuning, and testing sets in a ratio of 3:1:1. Each model was evaluated with 5-fold cross-validation and with an external EMR-based cohort at a tertiary hospital. Logistic regression (LR), random forest (RF), gradient boosting machine (GBM), and feedforward neural network (FNN) algorithms were applied. The primary outcome was 30-day mortality following invasive treatment. RESULTS: GBM showed the best performance with area under the receiver operating characteristic curve (AUROC) of 0.99; RF had a similar AUROC of 0.98. AUROCs of FNN and LR were 0.96 and 0.93, respectively. GBM had the highest area under the precision-recall curve (AUPRC) of 0.80, and the AUPRCs of RF, LR, and FNN were 0.73, 0.68, and 0.63, respectively. All models showed low Brier scores of <0.1 as well as highly fitted calibration plots, indicating a good fit of the ML-based models. On external validation, the GBM model demonstrated maximal performance with an AUROC of 0.90, while FNN had an AUROC of 0.85. The AUROCs of LR and RF were slightly lower at 0.80 and 0.79, respectively. The AUPRCs of GBM, LR, and FNN were similar at 0.47, 0.43, and 0.41, respectively, while that of RF was lower at 0.33. Among the categories in the GBM model, time-series dynamic data demonstrated a high AUROC of >0.95, contributing majorly to the excellent results. CONCLUSIONS: Exploiting the diverse fields of the EMR data set, the ML-based 30-day adverse cardiac event prediction models demonstrated outstanding results, and the applied framework could be generalized for various health care prediction models.
format Online
Article
Text
id pubmed-9133980
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-91339802022-05-27 Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation Kwon, Osung Na, Wonjun Kang, Heejun Jun, Tae Joon Kweon, Jihoon Park, Gyung-Min Cho, YongHyun Hur, Cinyoung Chae, Jungwoo Kang, Do-Yoon Lee, Pil Hyung Ahn, Jung-Min Park, Duk-Woo Kang, Soo-Jin Lee, Seung-Whan Lee, Cheol Whan Park, Seong-Wook Park, Seung-Jung Yang, Dong Hyun Kim, Young-Hak JMIR Med Inform Original Paper BACKGROUND: Although there is a growing interest in prediction models based on electronic medical records (EMRs) to identify patients at risk of adverse cardiac events following invasive coronary treatment, robust models fully utilizing EMR data are limited. OBJECTIVE: We aimed to develop and validate machine learning (ML) models by using diverse fields of EMR to predict the risk of 30-day adverse cardiac events after percutaneous intervention or bypass surgery. METHODS: EMR data of 5,184,565 records of 16,793 patients at a quaternary hospital between 2006 and 2016 were categorized into static basic (eg, demographics), dynamic time-series (eg, laboratory values), and cardiac-specific data (eg, coronary angiography). The data were randomly split into training, tuning, and testing sets in a ratio of 3:1:1. Each model was evaluated with 5-fold cross-validation and with an external EMR-based cohort at a tertiary hospital. Logistic regression (LR), random forest (RF), gradient boosting machine (GBM), and feedforward neural network (FNN) algorithms were applied. The primary outcome was 30-day mortality following invasive treatment. RESULTS: GBM showed the best performance with area under the receiver operating characteristic curve (AUROC) of 0.99; RF had a similar AUROC of 0.98. AUROCs of FNN and LR were 0.96 and 0.93, respectively. GBM had the highest area under the precision-recall curve (AUPRC) of 0.80, and the AUPRCs of RF, LR, and FNN were 0.73, 0.68, and 0.63, respectively. All models showed low Brier scores of <0.1 as well as highly fitted calibration plots, indicating a good fit of the ML-based models. On external validation, the GBM model demonstrated maximal performance with an AUROC of 0.90, while FNN had an AUROC of 0.85. The AUROCs of LR and RF were slightly lower at 0.80 and 0.79, respectively. The AUPRCs of GBM, LR, and FNN were similar at 0.47, 0.43, and 0.41, respectively, while that of RF was lower at 0.33. Among the categories in the GBM model, time-series dynamic data demonstrated a high AUROC of >0.95, contributing majorly to the excellent results. CONCLUSIONS: Exploiting the diverse fields of the EMR data set, the ML-based 30-day adverse cardiac event prediction models demonstrated outstanding results, and the applied framework could be generalized for various health care prediction models. JMIR Publications 2022-05-11 /pmc/articles/PMC9133980/ /pubmed/35544292 http://dx.doi.org/10.2196/26801 Text en ©Osung Kwon, Wonjun Na, Heejun Kang, Tae Joon Jun, Jihoon Kweon, Gyung-Min Park, YongHyun Cho, Cinyoung Hur, Jungwoo Chae, Do-Yoon Kang, Pil Hyung Lee, Jung-Min Ahn, Duk-Woo Park, Soo-Jin Kang, Seung-Whan Lee, Cheol Whan Lee, Seong-Wook Park, Seung-Jung Park, Dong Hyun Yang, Young-Hak Kim. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.05.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Kwon, Osung
Na, Wonjun
Kang, Heejun
Jun, Tae Joon
Kweon, Jihoon
Park, Gyung-Min
Cho, YongHyun
Hur, Cinyoung
Chae, Jungwoo
Kang, Do-Yoon
Lee, Pil Hyung
Ahn, Jung-Min
Park, Duk-Woo
Kang, Soo-Jin
Lee, Seung-Whan
Lee, Cheol Whan
Park, Seong-Wook
Park, Seung-Jung
Yang, Dong Hyun
Kim, Young-Hak
Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation
title Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation
title_full Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation
title_fullStr Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation
title_full_unstemmed Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation
title_short Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation
title_sort electronic medical record–based machine learning approach to predict the risk of 30-day adverse cardiac events after invasive coronary treatment: machine learning model development and validation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9133980/
https://www.ncbi.nlm.nih.gov/pubmed/35544292
http://dx.doi.org/10.2196/26801
work_keys_str_mv AT kwonosung electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT nawonjun electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT kangheejun electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT juntaejoon electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT kweonjihoon electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT parkgyungmin electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT choyonghyun electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT hurcinyoung electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT chaejungwoo electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT kangdoyoon electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT leepilhyung electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT ahnjungmin electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT parkdukwoo electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT kangsoojin electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT leeseungwhan electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT leecheolwhan electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT parkseongwook electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT parkseungjung electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT yangdonghyun electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation
AT kimyounghak electronicmedicalrecordbasedmachinelearningapproachtopredicttheriskof30dayadversecardiaceventsafterinvasivecoronarytreatmentmachinelearningmodeldevelopmentandvalidation