Cargando…
Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features
BACKGROUND: Given coronavirus disease (COVID-19's) unknown nature, diagnosis, and treatment is very complex up to the present time. Thus, it is essential to have a framework for an early prediction of the disease. In this regard, machines learning (ML) could be crucial to extract concealed patt...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Wolters Kluwer - Medknow
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8459865/ https://www.ncbi.nlm.nih.gov/pubmed/34667785 http://dx.doi.org/10.4103/jehp.jehp_1424_20 |
_version_ | 1784571619367190528 |
---|---|
author | Shanbehzadeh, Mostafa Kazemi-Arpanahi, Hadi Orooji, Azam Mobarak, Sara Jelvay, Saeed |
author_facet | Shanbehzadeh, Mostafa Kazemi-Arpanahi, Hadi Orooji, Azam Mobarak, Sara Jelvay, Saeed |
author_sort | Shanbehzadeh, Mostafa |
collection | PubMed |
description | BACKGROUND: Given coronavirus disease (COVID-19's) unknown nature, diagnosis, and treatment is very complex up to the present time. Thus, it is essential to have a framework for an early prediction of the disease. In this regard, machines learning (ML) could be crucial to extract concealed patterns from mining of huge raw datasets then it establishes high-quality predictive models. At this juncture, we aimed to apply different ML techniques to develop clinical predictive models and select the best performance of them. MATERIALS AND METHODS: The dataset of Ayatollah Talleghani hospital, COVID-19 focal center affiliated to Abadan University of Medical Sciences have been taken into consideration. The dataset used in this study consists of 501 case records with two classes (COVID-19 and non COVID-19) and 32 columns for the diagnostic features. ML algorithms such as Naïve Bayesian, Bayesian Net, random forest (RF), multilayer perceptron, K-star, C4.5, and support vector machine were developed. Then, the recital of selected ML models was assessed by the comparison of some performance indices such as accuracy, sensitivity, specificity, precision, F-score, and receiver operating characteristic (ROC). RESULTS: The experimental results indicate that RF algorithm with the accuracy of 92.42%, specificity of 75.70%, precision of 92.30%, sensitivity of 92.40%, F-measure of 92.00%, and ROC of 97.15% has the best capability for COVID-19 diagnosis and screening. CONCLUSION: The empirical results reveal that RF model yielded higher performance as compared to other six classification models. It is promising to the implementation of RF model in the health-care settings to increase the accuracy and speed of disease diagnosis for primary prevention, screening, surveillance, and early treatment. |
format | Online Article Text |
id | pubmed-8459865 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Wolters Kluwer - Medknow |
record_format | MEDLINE/PubMed |
spelling | pubmed-84598652021-10-18 Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features Shanbehzadeh, Mostafa Kazemi-Arpanahi, Hadi Orooji, Azam Mobarak, Sara Jelvay, Saeed J Educ Health Promot Original Article BACKGROUND: Given coronavirus disease (COVID-19's) unknown nature, diagnosis, and treatment is very complex up to the present time. Thus, it is essential to have a framework for an early prediction of the disease. In this regard, machines learning (ML) could be crucial to extract concealed patterns from mining of huge raw datasets then it establishes high-quality predictive models. At this juncture, we aimed to apply different ML techniques to develop clinical predictive models and select the best performance of them. MATERIALS AND METHODS: The dataset of Ayatollah Talleghani hospital, COVID-19 focal center affiliated to Abadan University of Medical Sciences have been taken into consideration. The dataset used in this study consists of 501 case records with two classes (COVID-19 and non COVID-19) and 32 columns for the diagnostic features. ML algorithms such as Naïve Bayesian, Bayesian Net, random forest (RF), multilayer perceptron, K-star, C4.5, and support vector machine were developed. Then, the recital of selected ML models was assessed by the comparison of some performance indices such as accuracy, sensitivity, specificity, precision, F-score, and receiver operating characteristic (ROC). RESULTS: The experimental results indicate that RF algorithm with the accuracy of 92.42%, specificity of 75.70%, precision of 92.30%, sensitivity of 92.40%, F-measure of 92.00%, and ROC of 97.15% has the best capability for COVID-19 diagnosis and screening. CONCLUSION: The empirical results reveal that RF model yielded higher performance as compared to other six classification models. It is promising to the implementation of RF model in the health-care settings to increase the accuracy and speed of disease diagnosis for primary prevention, screening, surveillance, and early treatment. Wolters Kluwer - Medknow 2021-08-31 /pmc/articles/PMC8459865/ /pubmed/34667785 http://dx.doi.org/10.4103/jehp.jehp_1424_20 Text en Copyright: © 2021 Journal of Education and Health Promotion https://creativecommons.org/licenses/by-nc-sa/4.0/This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms. |
spellingShingle | Original Article Shanbehzadeh, Mostafa Kazemi-Arpanahi, Hadi Orooji, Azam Mobarak, Sara Jelvay, Saeed Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features |
title | Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features |
title_full | Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features |
title_fullStr | Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features |
title_full_unstemmed | Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features |
title_short | Performance evaluation of selected machine learning algorithms for COVID-19 prediction using routine clinical data: With versus Without CT scan features |
title_sort | performance evaluation of selected machine learning algorithms for covid-19 prediction using routine clinical data: with versus without ct scan features |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8459865/ https://www.ncbi.nlm.nih.gov/pubmed/34667785 http://dx.doi.org/10.4103/jehp.jehp_1424_20 |
work_keys_str_mv | AT shanbehzadehmostafa performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures AT kazemiarpanahihadi performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures AT oroojiazam performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures AT mobaraksara performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures AT jelvaysaeed performanceevaluationofselectedmachinelearningalgorithmsforcovid19predictionusingroutineclinicaldatawithversuswithoutctscanfeatures |