Cargando…

Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning

Dengue is a viral disease that primarily affects tropical and subtropical regions and is especially prevalent in South-East Asia. This mosquito-borne disease sometimes triggers nationwide epidemics, which results in a large number of fatalities. The development of Dengue Haemorrhagic Fever (DHF) is...

Descripción completa

Detalles Bibliográficos
Autores principales: Chowdhury, Shihab Uddin, Sayeed, Sanjana, Rashid, Iktisad, Alam, Md. Golam Rabiul, Masum, Abdul Kadar Muhammad, Dewan, M. Ali Akber
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9506144/
https://www.ncbi.nlm.nih.gov/pubmed/36135395
http://dx.doi.org/10.3390/jimaging8090229
_version_ 1784796649630990336
author Chowdhury, Shihab Uddin
Sayeed, Sanjana
Rashid, Iktisad
Alam, Md. Golam Rabiul
Masum, Abdul Kadar Muhammad
Dewan, M. Ali Akber
author_facet Chowdhury, Shihab Uddin
Sayeed, Sanjana
Rashid, Iktisad
Alam, Md. Golam Rabiul
Masum, Abdul Kadar Muhammad
Dewan, M. Ali Akber
author_sort Chowdhury, Shihab Uddin
collection PubMed
description Dengue is a viral disease that primarily affects tropical and subtropical regions and is especially prevalent in South-East Asia. This mosquito-borne disease sometimes triggers nationwide epidemics, which results in a large number of fatalities. The development of Dengue Haemorrhagic Fever (DHF) is where most cases occur, and a large portion of them are detected among children under the age of ten, with severe conditions often progressing to a critical state known as Dengue Shock Syndrome (DSS). In this study, we analysed two separate datasets from two different countries– Vietnam and Bangladesh, which we referred as VDengu and BDengue, respectively. For the VDengu dataset, as it was structured, supervised learning models were effective for predictive analysis, among which, the decision tree classifier XGBoost in particular produced the best outcome. Furthermore, Shapley Additive Explanation (SHAP) was used over the XGBoost model to assess the significance of individual attributes of the dataset. Among the significant attributes, we applied the SHAP dependence plot to identify the range for each attribute against the number of DHF or DSS cases. In parallel, the dataset from Bangladesh was unstructured; therefore, we applied an unsupervised learning technique, i.e., hierarchical clustering, to find clusters of vital blood components of the patients according to their complete blood count reports. The clusters were further analysed to find the attributes in the dataset that led to DSS or DHF.
format Online
Article
Text
id pubmed-9506144
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95061442022-09-24 Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning Chowdhury, Shihab Uddin Sayeed, Sanjana Rashid, Iktisad Alam, Md. Golam Rabiul Masum, Abdul Kadar Muhammad Dewan, M. Ali Akber J Imaging Article Dengue is a viral disease that primarily affects tropical and subtropical regions and is especially prevalent in South-East Asia. This mosquito-borne disease sometimes triggers nationwide epidemics, which results in a large number of fatalities. The development of Dengue Haemorrhagic Fever (DHF) is where most cases occur, and a large portion of them are detected among children under the age of ten, with severe conditions often progressing to a critical state known as Dengue Shock Syndrome (DSS). In this study, we analysed two separate datasets from two different countries– Vietnam and Bangladesh, which we referred as VDengu and BDengue, respectively. For the VDengu dataset, as it was structured, supervised learning models were effective for predictive analysis, among which, the decision tree classifier XGBoost in particular produced the best outcome. Furthermore, Shapley Additive Explanation (SHAP) was used over the XGBoost model to assess the significance of individual attributes of the dataset. Among the significant attributes, we applied the SHAP dependence plot to identify the range for each attribute against the number of DHF or DSS cases. In parallel, the dataset from Bangladesh was unstructured; therefore, we applied an unsupervised learning technique, i.e., hierarchical clustering, to find clusters of vital blood components of the patients according to their complete blood count reports. The clusters were further analysed to find the attributes in the dataset that led to DSS or DHF. MDPI 2022-08-26 /pmc/articles/PMC9506144/ /pubmed/36135395 http://dx.doi.org/10.3390/jimaging8090229 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chowdhury, Shihab Uddin
Sayeed, Sanjana
Rashid, Iktisad
Alam, Md. Golam Rabiul
Masum, Abdul Kadar Muhammad
Dewan, M. Ali Akber
Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning
title Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning
title_full Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning
title_fullStr Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning
title_full_unstemmed Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning
title_short Shapley-Additive-Explanations-Based Factor Analysis for Dengue Severity Prediction using Machine Learning
title_sort shapley-additive-explanations-based factor analysis for dengue severity prediction using machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9506144/
https://www.ncbi.nlm.nih.gov/pubmed/36135395
http://dx.doi.org/10.3390/jimaging8090229
work_keys_str_mv AT chowdhuryshihabuddin shapleyadditiveexplanationsbasedfactoranalysisfordengueseveritypredictionusingmachinelearning
AT sayeedsanjana shapleyadditiveexplanationsbasedfactoranalysisfordengueseveritypredictionusingmachinelearning
AT rashidiktisad shapleyadditiveexplanationsbasedfactoranalysisfordengueseveritypredictionusingmachinelearning
AT alammdgolamrabiul shapleyadditiveexplanationsbasedfactoranalysisfordengueseveritypredictionusingmachinelearning
AT masumabdulkadarmuhammad shapleyadditiveexplanationsbasedfactoranalysisfordengueseveritypredictionusingmachinelearning
AT dewanmaliakber shapleyadditiveexplanationsbasedfactoranalysisfordengueseveritypredictionusingmachinelearning