Cargando…

Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation

BACKGROUND: The number of deaths from COVID-19 continues to surge worldwide. In particular, if a patient’s condition is sufficiently severe to require invasive ventilation, it is more likely to lead to death than to recovery. OBJECTIVE: The goal of our study was to analyze the factors related to COV...

Descripción completa

Detalles Bibliográficos
Autores principales: Chung, Heewon, Ko, Hoon, Kang, Wu Seong, Kim, Kyung Won, Lee, Hooseok, Park, Chul, Song, Hyun-Ok, Choi, Tae-Young, Seo, Jae Ho, Lee, Jinseok
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8057199/
https://www.ncbi.nlm.nih.gov/pubmed/33764883
http://dx.doi.org/10.2196/27060
_version_ 1783680792415698944
author Chung, Heewon
Ko, Hoon
Kang, Wu Seong
Kim, Kyung Won
Lee, Hooseok
Park, Chul
Song, Hyun-Ok
Choi, Tae-Young
Seo, Jae Ho
Lee, Jinseok
author_facet Chung, Heewon
Ko, Hoon
Kang, Wu Seong
Kim, Kyung Won
Lee, Hooseok
Park, Chul
Song, Hyun-Ok
Choi, Tae-Young
Seo, Jae Ho
Lee, Jinseok
author_sort Chung, Heewon
collection PubMed
description BACKGROUND: The number of deaths from COVID-19 continues to surge worldwide. In particular, if a patient’s condition is sufficiently severe to require invasive ventilation, it is more likely to lead to death than to recovery. OBJECTIVE: The goal of our study was to analyze the factors related to COVID-19 severity in patients and to develop an artificial intelligence (AI) model to predict the severity of COVID-19 at an early stage. METHODS: We developed an AI model that predicts severity based on data from 5601 COVID-19 patients from all national and regional hospitals across South Korea as of April 2020. The clinical severity of COVID-19 was divided into two categories: low and high severity. The condition of patients in the low-severity group corresponded to no limit of activity, oxygen support with nasal prong or facial mask, and noninvasive ventilation. The condition of patients in the high-severity group corresponded to invasive ventilation, multi-organ failure with extracorporeal membrane oxygenation required, and death. For the AI model input, we used 37 variables from the medical records, including basic patient information, a physical index, initial examination findings, clinical findings, comorbid diseases, and general blood test results at an early stage. Feature importance analysis was performed with AdaBoost, random forest, and eXtreme Gradient Boosting (XGBoost); the AI model for predicting COVID-19 severity among patients was developed with a 5-layer deep neural network (DNN) with the 20 most important features, which were selected based on ranked feature importance analysis of 37 features from the comprehensive data set. The selection procedure was performed using sensitivity, specificity, accuracy, balanced accuracy, and area under the curve (AUC). RESULTS: We found that age was the most important factor for predicting disease severity, followed by lymphocyte level, platelet count, and shortness of breath or dyspnea. Our proposed 5-layer DNN with the 20 most important features provided high sensitivity (90.2%), specificity (90.4%), accuracy (90.4%), balanced accuracy (90.3%), and AUC (0.96). CONCLUSIONS: Our proposed AI model with the selected features was able to predict the severity of COVID-19 accurately. We also made a web application so that anyone can access the model. We believe that sharing the AI model with the public will be helpful in validating and improving its performance.
format Online
Article
Text
id pubmed-8057199
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-80571992021-05-07 Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation Chung, Heewon Ko, Hoon Kang, Wu Seong Kim, Kyung Won Lee, Hooseok Park, Chul Song, Hyun-Ok Choi, Tae-Young Seo, Jae Ho Lee, Jinseok J Med Internet Res Original Paper BACKGROUND: The number of deaths from COVID-19 continues to surge worldwide. In particular, if a patient’s condition is sufficiently severe to require invasive ventilation, it is more likely to lead to death than to recovery. OBJECTIVE: The goal of our study was to analyze the factors related to COVID-19 severity in patients and to develop an artificial intelligence (AI) model to predict the severity of COVID-19 at an early stage. METHODS: We developed an AI model that predicts severity based on data from 5601 COVID-19 patients from all national and regional hospitals across South Korea as of April 2020. The clinical severity of COVID-19 was divided into two categories: low and high severity. The condition of patients in the low-severity group corresponded to no limit of activity, oxygen support with nasal prong or facial mask, and noninvasive ventilation. The condition of patients in the high-severity group corresponded to invasive ventilation, multi-organ failure with extracorporeal membrane oxygenation required, and death. For the AI model input, we used 37 variables from the medical records, including basic patient information, a physical index, initial examination findings, clinical findings, comorbid diseases, and general blood test results at an early stage. Feature importance analysis was performed with AdaBoost, random forest, and eXtreme Gradient Boosting (XGBoost); the AI model for predicting COVID-19 severity among patients was developed with a 5-layer deep neural network (DNN) with the 20 most important features, which were selected based on ranked feature importance analysis of 37 features from the comprehensive data set. The selection procedure was performed using sensitivity, specificity, accuracy, balanced accuracy, and area under the curve (AUC). RESULTS: We found that age was the most important factor for predicting disease severity, followed by lymphocyte level, platelet count, and shortness of breath or dyspnea. Our proposed 5-layer DNN with the 20 most important features provided high sensitivity (90.2%), specificity (90.4%), accuracy (90.4%), balanced accuracy (90.3%), and AUC (0.96). CONCLUSIONS: Our proposed AI model with the selected features was able to predict the severity of COVID-19 accurately. We also made a web application so that anyone can access the model. We believe that sharing the AI model with the public will be helpful in validating and improving its performance. JMIR Publications 2021-04-19 /pmc/articles/PMC8057199/ /pubmed/33764883 http://dx.doi.org/10.2196/27060 Text en ©Heewon Chung, Hoon Ko, Wu Seong Kang, Kyung Won Kim, Hooseok Lee, Chul Park, Hyun-Ok Song, Tae-Young Choi, Jae Ho Seo, Jinseok Lee. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 19.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Chung, Heewon
Ko, Hoon
Kang, Wu Seong
Kim, Kyung Won
Lee, Hooseok
Park, Chul
Song, Hyun-Ok
Choi, Tae-Young
Seo, Jae Ho
Lee, Jinseok
Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation
title Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation
title_full Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation
title_fullStr Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation
title_full_unstemmed Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation
title_short Prediction and Feature Importance Analysis for Severity of COVID-19 in South Korea Using Artificial Intelligence: Model Development and Validation
title_sort prediction and feature importance analysis for severity of covid-19 in south korea using artificial intelligence: model development and validation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8057199/
https://www.ncbi.nlm.nih.gov/pubmed/33764883
http://dx.doi.org/10.2196/27060
work_keys_str_mv AT chungheewon predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT kohoon predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT kangwuseong predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT kimkyungwon predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT leehooseok predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT parkchul predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT songhyunok predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT choitaeyoung predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT seojaeho predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation
AT leejinseok predictionandfeatureimportanceanalysisforseverityofcovid19insouthkoreausingartificialintelligencemodeldevelopmentandvalidation