Cargando…

Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods

The monkeypox virus poses a novel public health risk that might quickly escalate into a worldwide epidemic. Machine learning (ML) has recently shown much promise in diagnosing diseases like cancer, finding tumor cells, and finding COVID-19 patients. In this study, we have created a dataset based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Farzipour, Alireza, Elmi, Roya, Nasiri, Hamid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378557/
https://www.ncbi.nlm.nih.gov/pubmed/37510135
http://dx.doi.org/10.3390/diagnostics13142391
_version_ 1785079796803305472
author Farzipour, Alireza
Elmi, Roya
Nasiri, Hamid
author_facet Farzipour, Alireza
Elmi, Roya
Nasiri, Hamid
author_sort Farzipour, Alireza
collection PubMed
description The monkeypox virus poses a novel public health risk that might quickly escalate into a worldwide epidemic. Machine learning (ML) has recently shown much promise in diagnosing diseases like cancer, finding tumor cells, and finding COVID-19 patients. In this study, we have created a dataset based on the data both collected and published by Global Health and used by the World Health Organization (WHO). Being entirely textual, this dataset shows the relationship between the symptoms and the monkeypox disease. The data have been analyzed, using gradient boosting methods such as Extreme Gradient Boosting (XGBoost), CatBoost, and LightGBM along with other standard machine learning methods such as Support Vector Machine (SVM) and Random Forest. All these methods have been compared. The research aims to provide an ML model based on symptoms for the diagnosis of monkeypox. Previous studies have only examined disease diagnosis using images. The best performance has belonged to XGBoost, with an accuracy of 1.0 in reviews. To check the model’s flexibility, k-fold cross-validation is used, reaching an average accuracy of 0.9 in 5 different splits of the test set. In addition, Shapley Additive Explanations (SHAP) helps in examining and explaining the output of the XGBoost model.
format Online
Article
Text
id pubmed-10378557
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103785572023-07-29 Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods Farzipour, Alireza Elmi, Roya Nasiri, Hamid Diagnostics (Basel) Article The monkeypox virus poses a novel public health risk that might quickly escalate into a worldwide epidemic. Machine learning (ML) has recently shown much promise in diagnosing diseases like cancer, finding tumor cells, and finding COVID-19 patients. In this study, we have created a dataset based on the data both collected and published by Global Health and used by the World Health Organization (WHO). Being entirely textual, this dataset shows the relationship between the symptoms and the monkeypox disease. The data have been analyzed, using gradient boosting methods such as Extreme Gradient Boosting (XGBoost), CatBoost, and LightGBM along with other standard machine learning methods such as Support Vector Machine (SVM) and Random Forest. All these methods have been compared. The research aims to provide an ML model based on symptoms for the diagnosis of monkeypox. Previous studies have only examined disease diagnosis using images. The best performance has belonged to XGBoost, with an accuracy of 1.0 in reviews. To check the model’s flexibility, k-fold cross-validation is used, reaching an average accuracy of 0.9 in 5 different splits of the test set. In addition, Shapley Additive Explanations (SHAP) helps in examining and explaining the output of the XGBoost model. MDPI 2023-07-17 /pmc/articles/PMC10378557/ /pubmed/37510135 http://dx.doi.org/10.3390/diagnostics13142391 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Farzipour, Alireza
Elmi, Roya
Nasiri, Hamid
Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods
title Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods
title_full Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods
title_fullStr Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods
title_full_unstemmed Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods
title_short Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods
title_sort detection of monkeypox cases based on symptoms using xgboost and shapley additive explanations methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10378557/
https://www.ncbi.nlm.nih.gov/pubmed/37510135
http://dx.doi.org/10.3390/diagnostics13142391
work_keys_str_mv AT farzipouralireza detectionofmonkeypoxcasesbasedonsymptomsusingxgboostandshapleyadditiveexplanationsmethods
AT elmiroya detectionofmonkeypoxcasesbasedonsymptomsusingxgboostandshapleyadditiveexplanationsmethods
AT nasirihamid detectionofmonkeypoxcasesbasedonsymptomsusingxgboostandshapleyadditiveexplanationsmethods