Cargando…

Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach

BACKGROUND: Sepsis is diagnosed in millions of people every year, resulting in a high mortality rate. Although patients with sepsis present multimorbid conditions, including cancer, sepsis predictions have mainly focused on patients with severe injuries. OBJECTIVE: In this paper, we present a machin...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yang, Donghun, Kim, Jimin, Yoo, Junsang, Cha, Won Chul, Paik, Hyojung
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9244654/ https://www.ncbi.nlm.nih.gov/pubmed/35704364 http://dx.doi.org/10.2196/37689

_version_	1784738575416295424
author	Yang, Donghun Kim, Jimin Yoo, Junsang Cha, Won Chul Paik, Hyojung
author_facet	Yang, Donghun Kim, Jimin Yoo, Junsang Cha, Won Chul Paik, Hyojung
author_sort	Yang, Donghun
collection	PubMed
description	BACKGROUND: Sepsis is diagnosed in millions of people every year, resulting in a high mortality rate. Although patients with sepsis present multimorbid conditions, including cancer, sepsis predictions have mainly focused on patients with severe injuries. OBJECTIVE: In this paper, we present a machine learning–based approach to identify the risk of sepsis in patients with cancer using electronic health records (EHRs). METHODS: We utilized deidentified anonymized EHRs of 8580 patients with cancer from the Samsung Medical Center in Korea in a longitudinal manner between 2014 and 2019. To build a prediction model based on physical status that would differ between sepsis and nonsepsis patients, we analyzed 2462 laboratory test results and 2266 medication prescriptions using graph network and statistical analyses. The medication relationships and lab test results from each analysis were used as additional learning features to train our predictive model. RESULTS: Patients with sepsis showed differential medication trajectories and physical status. For example, in the network-based analysis, narcotic analgesics were prescribed more often in the sepsis group, along with other drugs. Likewise, 35 types of lab tests, including albumin, globulin, and prothrombin time, showed significantly different distributions between sepsis and nonsepsis patients (P<.001). Our model outperformed the model trained using only common EHRs, showing an improved accuracy, area under the receiver operating characteristic (AUROC), and F1 score by 11.9%, 11.3%, and 13.6%, respectively. For the random forest–based model, the accuracy, AUROC, and F1 score were 0.692, 0.753, and 0.602, respectively. CONCLUSIONS: We showed that lab tests and medication relationships can be used as efficient features for predicting sepsis in patients with cancer. Consequently, identifying the risk of sepsis in patients with cancer using EHRs and machine learning is feasible.
format	Online Article Text
id	pubmed-9244654
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-92446542022-07-01 Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach Yang, Donghun Kim, Jimin Yoo, Junsang Cha, Won Chul Paik, Hyojung JMIR Med Inform Original Paper BACKGROUND: Sepsis is diagnosed in millions of people every year, resulting in a high mortality rate. Although patients with sepsis present multimorbid conditions, including cancer, sepsis predictions have mainly focused on patients with severe injuries. OBJECTIVE: In this paper, we present a machine learning–based approach to identify the risk of sepsis in patients with cancer using electronic health records (EHRs). METHODS: We utilized deidentified anonymized EHRs of 8580 patients with cancer from the Samsung Medical Center in Korea in a longitudinal manner between 2014 and 2019. To build a prediction model based on physical status that would differ between sepsis and nonsepsis patients, we analyzed 2462 laboratory test results and 2266 medication prescriptions using graph network and statistical analyses. The medication relationships and lab test results from each analysis were used as additional learning features to train our predictive model. RESULTS: Patients with sepsis showed differential medication trajectories and physical status. For example, in the network-based analysis, narcotic analgesics were prescribed more often in the sepsis group, along with other drugs. Likewise, 35 types of lab tests, including albumin, globulin, and prothrombin time, showed significantly different distributions between sepsis and nonsepsis patients (P<.001). Our model outperformed the model trained using only common EHRs, showing an improved accuracy, area under the receiver operating characteristic (AUROC), and F1 score by 11.9%, 11.3%, and 13.6%, respectively. For the random forest–based model, the accuracy, AUROC, and F1 score were 0.692, 0.753, and 0.602, respectively. CONCLUSIONS: We showed that lab tests and medication relationships can be used as efficient features for predicting sepsis in patients with cancer. Consequently, identifying the risk of sepsis in patients with cancer using EHRs and machine learning is feasible. JMIR Publications 2022-06-15 /pmc/articles/PMC9244654/ /pubmed/35704364 http://dx.doi.org/10.2196/37689 Text en ©Donghun Yang, Jimin Kim, Junsang Yoo, Won Chul Cha, Hyojung Paik. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 15.06.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Yang, Donghun Kim, Jimin Yoo, Junsang Cha, Won Chul Paik, Hyojung Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title	Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_full	Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_fullStr	Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_full_unstemmed	Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_short	Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach
title_sort	identifying the risk of sepsis in patients with cancer using digital health care records: machine learning–based approach
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9244654/ https://www.ncbi.nlm.nih.gov/pubmed/35704364 http://dx.doi.org/10.2196/37689
work_keys_str_mv	AT yangdonghun identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach AT kimjimin identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach AT yoojunsang identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach AT chawonchul identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach AT paikhyojung identifyingtheriskofsepsisinpatientswithcancerusingdigitalhealthcarerecordsmachinelearningbasedapproach

Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach

Ejemplares similares