Cargando…

Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features

BACKGROUND: Given coronavirus disease (COVID-19's) unknown nature, diagnosis, and treatment is very complex up to the present time. Thus, it is essential to have a framework for an early prediction of the disease. In this regard, machines learning (ML) could be crucial to extract concealed patt...

Descripción completa

Detalles Bibliográficos
Autores principales: Shanbehzadeh, Mostafa, Kazemi-Arpanahi, Hadi, Orooji, Azam, Mobarak, Sara, Jelvay, Saeed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer - Medknow 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8459865/
https://www.ncbi.nlm.nih.gov/pubmed/34667785
http://dx.doi.org/10.4103/jehp.jehp_1424_20
_version_ 1784571619367190528
author Shanbehzadeh, Mostafa
Kazemi-Arpanahi, Hadi
Orooji, Azam
Mobarak, Sara
Jelvay, Saeed
author_facet Shanbehzadeh, Mostafa
Kazemi-Arpanahi, Hadi
Orooji, Azam
Mobarak, Sara
Jelvay, Saeed
author_sort Shanbehzadeh, Mostafa
collection PubMed
description BACKGROUND: Given coronavirus disease (COVID-19's) unknown nature, diagnosis, and treatment is very complex up to the present time. Thus, it is essential to have a framework for an early prediction of the disease. In this regard, machines learning (ML) could be crucial to extract concealed patterns from mining of huge raw datasets then it establishes high-quality predictive models. At this juncture, we aimed to apply different ML techniques to develop clinical predictive models and select the best performance of them. MATERIALS AND METHODS: The dataset of Ayatollah Talleghani hospital, COVID-19 focal center affiliated to Abadan University of Medical Sciences have been taken into consideration. The dataset used in this study consists of 501 case records with two classes (COVID-19 and non COVID-19) and 32 columns for the diagnostic features. ML algorithms such as Naïve Bayesian, Bayesian Net, random forest (RF), multilayer perceptron, K-star, C4.5, and support vector machine were developed. Then, the recital of selected ML models was assessed by the comparison of some performance indices such as accuracy, sensitivity, specificity, precision, F-score, and receiver operating characteristic (ROC). RESULTS: The experimental results indicate that RF algorithm with the accuracy of 92.42%, specificity of 75.70%, precision of 92.30%, sensitivity of 92.40%, F-measure of 92.00%, and ROC of 97.15% has the best capability for COVID-19 diagnosis and screening. CONCLUSION: The empirical results reveal that RF model yielded higher performance as compared to other six classification models. It is promising to the implementation of RF model in the health-care settings to increase the accuracy and speed of disease diagnosis for primary prevention, screening, surveillance, and early treatment.
format Online
Article
Text
id pubmed-8459865
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Wolters Kluwer - Medknow
record_format MEDLINE/PubMed
spelling pubmed-84598652021-10-18 Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features Shanbehzadeh, Mostafa Kazemi-Arpanahi, Hadi Orooji, Azam Mobarak, Sara Jelvay, Saeed J Educ Health Promot Original Article BACKGROUND: Given coronavirus disease (COVID-19's) unknown nature, diagnosis, and treatment is very complex up to the present time. Thus, it is essential to have a framework for an early prediction of the disease. In this regard, machines learning (ML) could be crucial to extract concealed patterns from mining of huge raw datasets then it establishes high-quality predictive models. At this juncture, we aimed to apply different ML techniques to develop clinical predictive models and select the best performance of them. MATERIALS AND METHODS: The dataset of Ayatollah Talleghani hospital, COVID-19 focal center affiliated to Abadan University of Medical Sciences have been taken into consideration. The dataset used in this study consists of 501 case records with two classes (COVID-19 and non COVID-19) and 32 columns for the diagnostic features. ML algorithms such as Naïve Bayesian, Bayesian Net, random forest (RF), multilayer perceptron, K-star, C4.5, and support vector machine were developed. Then, the recital of selected ML models was assessed by the comparison of some performance indices such as accuracy, sensitivity, specificity, precision, F-score, and receiver operating characteristic (ROC). RESULTS: The experimental results indicate that RF algorithm with the accuracy of 92.42%, specificity of 75.70%, precision of 92.30%, sensitivity of 92.40%, F-measure of 92.00%, and ROC of 97.15% has the best capability for COVID-19 diagnosis and screening. CONCLUSION: The empirical results reveal that RF model yielded higher performance as compared to other six classification models. It is promising to the implementation of RF model in the health-care settings to increase the accuracy and speed of disease diagnosis for primary prevention, screening, surveillance, and early treatment. Wolters Kluwer - Medknow 2021-08-31 /pmc/articles/PMC8459865/ /pubmed/34667785 http://dx.doi.org/10.4103/jehp.jehp_1424_20 Text en Copyright: © 2021 Journal of Education and Health Promotion https://creativecommons.org/licenses/by-nc-sa/4.0/This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.
spellingShingle Original Article
Shanbehzadeh, Mostafa
Kazemi-Arpanahi, Hadi
Orooji, Azam
Mobarak, Sara
Jelvay, Saeed
Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features
title Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features
title_full Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features
title_fullStr Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features
title_full_unstemmed Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features
title_short Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features
title_sort performance evaluation of selected machine learning algorithms for covid-19 prediction using routine clinical data: with versus without ct scan features
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8459865/
https://www.ncbi.nlm.nih.gov/pubmed/34667785
http://dx.doi.org/10.4103/jehp.jehp_1424_20
work_keys_str_mv AT shanbehzadehmostafa performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures
AT kazemiarpanahihadi performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures
AT oroojiazam performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures
AT mobaraksara performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures
AT jelvaysaeed performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures