Cargando…

Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data

Background: The novel 2019 Coronavirus disease (COVID-19) poses a great threat to global public health and the economy. The earlier detection of COVID-19 is the key to its treatment and mitigating the transmission of the virus. Given that Machine Learning (ML) could be potentially useful in COVID-19...

Descripción completa

Detalles Bibliográficos
Autores principales: Shanbehzadeh, Mostafa, Kazemi-Arpanahi, Hadi, Nopour, Raoof
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Iran University of Medical Sciences 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214035/
https://www.ncbi.nlm.nih.gov/pubmed/34169041
http://dx.doi.org/10.47176/mjiri.35.29
_version_ 1783709976488837120
author Shanbehzadeh, Mostafa
Kazemi-Arpanahi, Hadi
Nopour, Raoof
author_facet Shanbehzadeh, Mostafa
Kazemi-Arpanahi, Hadi
Nopour, Raoof
author_sort Shanbehzadeh, Mostafa
collection PubMed
description Background: The novel 2019 Coronavirus disease (COVID-19) poses a great threat to global public health and the economy. The earlier detection of COVID-19 is the key to its treatment and mitigating the transmission of the virus. Given that Machine Learning (ML) could be potentially useful in COVID-19 identification, we compared 7 decision tree (DT) algorithms to select the best clinical diagnostic model. Methods: A hospital-based retrospective dataset was used to train the selected DT algorithms. The performance of DT models was measured using performance criteria, such as accuracy, sensitivity, specificity, receiver operating characteristic (ROC), and precision-recall curves (PRC). Finally, the best decision model was obtained based on comparing the mentioned performance criteria. Results: Based on the Gini Index (GI) scoring model, 13 diagnostic criteria, including the lung lesion existence (GI= 0217), fever (GI= 0.205), history of contact with suspected people (GI= 0.188), O(2) saturation rate in the blood (GI= 0.181), rhinorrhea (GI= 0.177), dyspnea (GI = 0.177), cough (GI = 0.159), history of taking the immunosuppressive drug (GI= 0.145), history of respiratory failure (ARDS) (GI= 0.141), lung lesion situation (GI= 0.133) and appearance (GI= 0.126), diarrhea (GI= 0.112), and nausea and vomiting (GI = 0.092) have been obtained as the most important criteria in diagnosing COVID-19. The results indicated that the J-48, with the accuracy= 0.85, F-Score= 0.85, ROC= 0.926, and PRC= 0.93, had the best performance for diagnosing COVID-19. Conclusion: According to the empirical results, it is promising to implement J-48 in health care settings to increase the accuracy and speed of COVID-19 diagnosis.
format Online
Article
Text
id pubmed-8214035
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Iran University of Medical Sciences
record_format MEDLINE/PubMed
spelling pubmed-82140352021-06-23 Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data Shanbehzadeh, Mostafa Kazemi-Arpanahi, Hadi Nopour, Raoof Med J Islam Repub Iran Original Article Background: The novel 2019 Coronavirus disease (COVID-19) poses a great threat to global public health and the economy. The earlier detection of COVID-19 is the key to its treatment and mitigating the transmission of the virus. Given that Machine Learning (ML) could be potentially useful in COVID-19 identification, we compared 7 decision tree (DT) algorithms to select the best clinical diagnostic model. Methods: A hospital-based retrospective dataset was used to train the selected DT algorithms. The performance of DT models was measured using performance criteria, such as accuracy, sensitivity, specificity, receiver operating characteristic (ROC), and precision-recall curves (PRC). Finally, the best decision model was obtained based on comparing the mentioned performance criteria. Results: Based on the Gini Index (GI) scoring model, 13 diagnostic criteria, including the lung lesion existence (GI= 0217), fever (GI= 0.205), history of contact with suspected people (GI= 0.188), O(2) saturation rate in the blood (GI= 0.181), rhinorrhea (GI= 0.177), dyspnea (GI = 0.177), cough (GI = 0.159), history of taking the immunosuppressive drug (GI= 0.145), history of respiratory failure (ARDS) (GI= 0.141), lung lesion situation (GI= 0.133) and appearance (GI= 0.126), diarrhea (GI= 0.112), and nausea and vomiting (GI = 0.092) have been obtained as the most important criteria in diagnosing COVID-19. The results indicated that the J-48, with the accuracy= 0.85, F-Score= 0.85, ROC= 0.926, and PRC= 0.93, had the best performance for diagnosing COVID-19. Conclusion: According to the empirical results, it is promising to implement J-48 in health care settings to increase the accuracy and speed of COVID-19 diagnosis. Iran University of Medical Sciences 2021-03-01 /pmc/articles/PMC8214035/ /pubmed/34169041 http://dx.doi.org/10.47176/mjiri.35.29 Text en © 2021 Iran University of Medical Sciences https://creativecommons.org/licenses/by-nc-sa/1.0/This is an open-access article distributed under the terms of the Creative Commons Attribution NonCommercial-ShareAlike 1.0 License (CC BY-NC-SA 1.0), which allows users to read, copy, distribute and make derivative works for non-commercial purposes from the material, as long as the author of the original work is cited properly.
spellingShingle Original Article
Shanbehzadeh, Mostafa
Kazemi-Arpanahi, Hadi
Nopour, Raoof
Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data
title Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data
title_full Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data
title_fullStr Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data
title_full_unstemmed Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data
title_short Performance evaluation of selected decision tree algorithms for COVID-19 diagnosis using routine clinical data
title_sort performance evaluation of selected decision tree algorithms for covid-19 diagnosis using routine clinical data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8214035/
https://www.ncbi.nlm.nih.gov/pubmed/34169041
http://dx.doi.org/10.47176/mjiri.35.29
work_keys_str_mv AT shanbehzadehmostafa performanceevaluationofselecteddecisiontreealgorithmsforcovid19diagnosisusingroutineclinicaldata
AT kazemiarpanahihadi performanceevaluationofselecteddecisiontreealgorithmsforcovid19diagnosisusingroutineclinicaldata
AT nopourraoof performanceevaluationofselecteddecisiontreealgorithmsforcovid19diagnosisusingroutineclinicaldata