Cargando…

Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach

BACKGROUND: Sepsis is diagnosed in millions of people every year, resulting in a high mortality rate. Although patients with sepsis present multimorbid conditions, including cancer, sepsis predictions have mainly focused on patients with severe injuries. OBJECTIVE: In this paper, we present a machin...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Donghun, Kim, Jimin, Yoo, Junsang, Cha, Won Chul, Paik, Hyojung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9244654/
https://www.ncbi.nlm.nih.gov/pubmed/35704364
http://dx.doi.org/10.2196/37689
_version_ 1784738575416295424
author Yang, Donghun
Kim, Jimin
Yoo, Junsang
Cha, Won Chul
Paik, Hyojung
author_facet Yang, Donghun
Kim, Jimin
Yoo, Junsang
Cha, Won Chul
Paik, Hyojung
author_sort Yang, Donghun
collection PubMed
description BACKGROUND: Sepsis is diagnosed in millions of people every year, resulting in a high mortality rate. Although patients with sepsis present multimorbid conditions, including cancer, sepsis predictions have mainly focused on patients with severe injuries. OBJECTIVE: In this paper, we present a machine learning–based approach to identify the risk of sepsis in patients with cancer using electronic health records (EHRs). METHODS: We utilized deidentified anonymized EHRs of 8580 patients with cancer from the Samsung Medical Center in Korea in a longitudinal manner between 2014 and 2019. To build a prediction model based on physical status that would differ between sepsis and nonsepsis patients, we analyzed 2462 laboratory test results and 2266 medication prescriptions using graph network and statistical analyses. The medication relationships and lab test results from each analysis were used as additional learning features to train our predictive model. RESULTS: Patients with sepsis showed differential medication trajectories and physical status. For example, in the network-based analysis, narcotic analgesics were prescribed more often in the sepsis group, along with other drugs. Likewise, 35 types of lab tests, including albumin, globulin, and prothrombin time, showed significantly different distributions between sepsis and nonsepsis patients (P<.001). Our model outperformed the model trained using only common EHRs, showing an improved accuracy, area under the receiver operating characteristic (AUROC), and F1 score by 11.9%, 11.3%, and 13.6%, respectively. For the random forest–based model, the accuracy, AUROC, and F1 score were 0.692, 0.753, and 0.602, respectively. CONCLUSIONS: We showed that lab tests and medication relationships can be used as efficient features for predicting sepsis in patients with cancer. Consequently, identifying the risk of sepsis in patients with cancer using EHRs and machine learning is feasible.
format Online
Article
Text
id pubmed-9244654
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-92446542022-07-01 Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach Yang, Donghun Kim, Jimin Yoo, Junsang Cha, Won Chul Paik, Hyojung JMIR Med Inform Original Paper BACKGROUND: Sepsis is diagnosed in millions of people every year, resulting in a high mortality rate. Although patients with sepsis present multimorbid conditions, including cancer, sepsis predictions have mainly focused on patients with severe injuries. OBJECTIVE: In this paper, we present a machine learning–based approach to identify the risk of sepsis in patients with cancer using electronic health records (EHRs). METHODS: We utilized deidentified anonymized EHRs of 8580 patients with cancer from the Samsung Medical Center in Korea in a longitudinal manner between 2014 and 2019. To build a prediction model based on physical status that would differ between sepsis and nonsepsis patients, we analyzed 2462 laboratory test results and 2266 medication prescriptions using graph network and statistical analyses. The medication relationships and lab test results from each analysis were used as additional learning features to train our predictive model. RESULTS: Patients with sepsis showed differential medication trajectories and physical status. For example, in the network-based analysis, narcotic analgesics were prescribed more often in the sepsis group, along with other drugs. Likewise, 35 types of lab tests, including albumin, globulin, and prothrombin time, showed significantly different distributions between sepsis and nonsepsis patients (P<.001). Our model outperformed the model trained using only common EHRs, showing an improved accuracy, area under the receiver operating characteristic (AUROC), and F1 score by 11.9%, 11.3%, and 13.6%, respectively. For the random forest–based model, the accuracy, AUROC, and F1 score were 0.692, 0.753, and 0.602, respectively. CONCLUSIONS: We showed that lab tests and medication relationships can be used as efficient features for predicting sepsis in patients with cancer. Consequently, identifying the risk of sepsis in patients with cancer using EHRs and machine learning is feasible. JMIR Publications 2022-06-15 /pmc/articles/PMC9244654/ /pubmed/35704364 http://dx.doi.org/10.2196/37689 Text en ©Donghun Yang, Jimin Kim, Junsang Yoo, Won Chul Cha, Hyojung Paik. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 15.06.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Yang, Donghun
Kim, Jimin
Yoo, Junsang
Cha, Won Chul
Paik, Hyojung
Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_full Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_fullStr Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_full_unstemmed Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_short Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_sort identifying the risk of sepsis in patients with cancer using digital health care records: machine learning–based approach
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9244654/
https://www.ncbi.nlm.nih.gov/pubmed/35704364
http://dx.doi.org/10.2196/37689
work_keys_str_mv AT yangdonghun identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach
AT kimjimin identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach
AT yoojunsang identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach
AT chawonchul identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach
AT paikhyojung identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach