Cargando…
Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach
BACKGROUND: In-hospital cardiac arrest (IHCA) is associated with high mortality and health care costs in the recovery phase. Predicting adverse outcome events, including readmission, improves the chance for appropriate interventions and reduces health care costs. However, studies related to the earl...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8477292/ https://www.ncbi.nlm.nih.gov/pubmed/34515639 http://dx.doi.org/10.2196/27798 |
_version_ | 1784575813789679616 |
---|---|
author | Chi, Chien-Yu Ao, Shuang Winkler, Adrian Fu, Kuan-Chun Xu, Jie Ho, Yi-Lwun Huang, Chien-Hua Soltani, Rohollah |
author_facet | Chi, Chien-Yu Ao, Shuang Winkler, Adrian Fu, Kuan-Chun Xu, Jie Ho, Yi-Lwun Huang, Chien-Hua Soltani, Rohollah |
author_sort | Chi, Chien-Yu |
collection | PubMed |
description | BACKGROUND: In-hospital cardiac arrest (IHCA) is associated with high mortality and health care costs in the recovery phase. Predicting adverse outcome events, including readmission, improves the chance for appropriate interventions and reduces health care costs. However, studies related to the early prediction of adverse events of IHCA survivors are rare. Therefore, we used a deep learning model for prediction in this study. OBJECTIVE: This study aimed to demonstrate that with the proper data set and learning strategies, we can predict the 30-day mortality and readmission of IHCA survivors based on their historical claims. METHODS: National Health Insurance Research Database claims data, including 168,693 patients who had experienced IHCA at least once and 1,569,478 clinical records, were obtained to generate a data set for outcome prediction. We predicted the 30-day mortality/readmission after each current record (ALL-mortality/ALL-readmission) and 30-day mortality/readmission after IHCA (cardiac arrest [CA]-mortality/CA-readmission). We developed a hierarchical vectorizer (HVec) deep learning model to extract patients’ information and predict mortality and readmission. To embed the textual medical concepts of the clinical records into our deep learning model, we used Text2Node to compute the distributed representations of all medical concept codes as a 128-dimensional vector. Along with the patient’s demographic information, our novel HVec model generated embedding vectors to hierarchically describe the health status at the record-level and patient-level. Multitask learning involving two main tasks and auxiliary tasks was proposed. As CA-mortality and CA-readmission were rare, person upsampling of patients with CA and weighting of CA records were used to improve prediction performance. RESULTS: With the multitask learning setting in the model learning process, we achieved an area under the receiver operating characteristic of 0.752 for CA-mortality, 0.711 for ALL-mortality, 0.852 for CA-readmission, and 0.889 for ALL-readmission. The area under the receiver operating characteristic was improved to 0.808 for CA-mortality and 0.862 for CA-readmission after solving the extremely imbalanced issue for CA-mortality/CA-readmission by upsampling and weighting. CONCLUSIONS: This study demonstrated the potential of predicting future outcomes for IHCA survivors by machine learning. The results showed that our proposed approach could effectively alleviate data imbalance problems and train a better model for outcome prediction. |
format | Online Article Text |
id | pubmed-8477292 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-84772922021-10-18 Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach Chi, Chien-Yu Ao, Shuang Winkler, Adrian Fu, Kuan-Chun Xu, Jie Ho, Yi-Lwun Huang, Chien-Hua Soltani, Rohollah J Med Internet Res Original Paper BACKGROUND: In-hospital cardiac arrest (IHCA) is associated with high mortality and health care costs in the recovery phase. Predicting adverse outcome events, including readmission, improves the chance for appropriate interventions and reduces health care costs. However, studies related to the early prediction of adverse events of IHCA survivors are rare. Therefore, we used a deep learning model for prediction in this study. OBJECTIVE: This study aimed to demonstrate that with the proper data set and learning strategies, we can predict the 30-day mortality and readmission of IHCA survivors based on their historical claims. METHODS: National Health Insurance Research Database claims data, including 168,693 patients who had experienced IHCA at least once and 1,569,478 clinical records, were obtained to generate a data set for outcome prediction. We predicted the 30-day mortality/readmission after each current record (ALL-mortality/ALL-readmission) and 30-day mortality/readmission after IHCA (cardiac arrest [CA]-mortality/CA-readmission). We developed a hierarchical vectorizer (HVec) deep learning model to extract patients’ information and predict mortality and readmission. To embed the textual medical concepts of the clinical records into our deep learning model, we used Text2Node to compute the distributed representations of all medical concept codes as a 128-dimensional vector. Along with the patient’s demographic information, our novel HVec model generated embedding vectors to hierarchically describe the health status at the record-level and patient-level. Multitask learning involving two main tasks and auxiliary tasks was proposed. As CA-mortality and CA-readmission were rare, person upsampling of patients with CA and weighting of CA records were used to improve prediction performance. RESULTS: With the multitask learning setting in the model learning process, we achieved an area under the receiver operating characteristic of 0.752 for CA-mortality, 0.711 for ALL-mortality, 0.852 for CA-readmission, and 0.889 for ALL-readmission. The area under the receiver operating characteristic was improved to 0.808 for CA-mortality and 0.862 for CA-readmission after solving the extremely imbalanced issue for CA-mortality/CA-readmission by upsampling and weighting. CONCLUSIONS: This study demonstrated the potential of predicting future outcomes for IHCA survivors by machine learning. The results showed that our proposed approach could effectively alleviate data imbalance problems and train a better model for outcome prediction. JMIR Publications 2021-09-13 /pmc/articles/PMC8477292/ /pubmed/34515639 http://dx.doi.org/10.2196/27798 Text en ©Chien-Yu Chi, Shuang Ao, Adrian Winkler, Kuan-Chun Fu, Jie Xu, Yi-Lwun Ho, Chien-Hua Huang, Rohollah Soltani. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 13.09.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Chi, Chien-Yu Ao, Shuang Winkler, Adrian Fu, Kuan-Chun Xu, Jie Ho, Yi-Lwun Huang, Chien-Hua Soltani, Rohollah Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach |
title | Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach |
title_full | Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach |
title_fullStr | Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach |
title_full_unstemmed | Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach |
title_short | Predicting the Mortality and Readmission of In-Hospital Cardiac Arrest Patients With Electronic Health Records: A Machine Learning Approach |
title_sort | predicting the mortality and readmission of in-hospital cardiac arrest patients with electronic health records: a machine learning approach |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8477292/ https://www.ncbi.nlm.nih.gov/pubmed/34515639 http://dx.doi.org/10.2196/27798 |
work_keys_str_mv | AT chichienyu predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach AT aoshuang predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach AT winkleradrian predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach AT fukuanchun predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach AT xujie predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach AT hoyilwun predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach AT huangchienhua predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach AT soltanirohollah predictingthemortalityandreadmissionofinhospitalcardiacarrestpatientswithelectronichealthrecordsamachinelearningapproach |