Cargando…

XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI

Machine learning (ML) has been used for classification of heart diseases for almost a decade, although understanding of the internal working of the black boxes, i.e., non-interpretable models, remain a demanding problem. Another major challenge in such ML models is the curse of dimensionality leadin...

Descripción completa

Detalles Bibliográficos
Autores principales: Das, Surajit, Sultana, Mahamuda, Bhattacharya, Suman, Sengupta, Diganta, De, Debashis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10177719/
https://www.ncbi.nlm.nih.gov/pubmed/37359323
http://dx.doi.org/10.1007/s11227-023-05356-3
_version_ 1785040693342765056
author Das, Surajit
Sultana, Mahamuda
Bhattacharya, Suman
Sengupta, Diganta
De, Debashis
author_facet Das, Surajit
Sultana, Mahamuda
Bhattacharya, Suman
Sengupta, Diganta
De, Debashis
author_sort Das, Surajit
collection PubMed
description Machine learning (ML) has been used for classification of heart diseases for almost a decade, although understanding of the internal working of the black boxes, i.e., non-interpretable models, remain a demanding problem. Another major challenge in such ML models is the curse of dimensionality leading to resource intensive classification using the comprehensive set of feature vector (CFV). This study focuses on dimensionality reduction using explainable artificial intelligence, without negotiating on accuracy for heart disease classification. Four explainable ML models, using SHAP, were used for classification which reflected the feature contributions (FC) and feature weights (FW) for each feature in the CFV for generating the final results. FC and FW were taken into account in generating the reduced dimensional feature subset (FS). The findings of the study are as follows: (a) XGBoost classifies heart diseases best with explanations, with an increase in 2% in model accuracy over existing best proposals, (b) explainable classification using FS exhibits better accuracy than most of the literary proposals, and (c) with the increase in explainability, accuracy can be preserved using XGBoost classifier for classifying heart diseases, and (d) the top four features responsible for diagnosis of heart disease have been exhibited which have common occurrences in all the explanations reflected by the five explainable techniques used on XGBoost classifier based on feature contributions. To the best of our knowledge, this is first attempt to explain XGBoost classification for diagnosis of heart diseases using five explainable techniques.
format Online
Article
Text
id pubmed-10177719
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-101777192023-05-14 XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI Das, Surajit Sultana, Mahamuda Bhattacharya, Suman Sengupta, Diganta De, Debashis J Supercomput Article Machine learning (ML) has been used for classification of heart diseases for almost a decade, although understanding of the internal working of the black boxes, i.e., non-interpretable models, remain a demanding problem. Another major challenge in such ML models is the curse of dimensionality leading to resource intensive classification using the comprehensive set of feature vector (CFV). This study focuses on dimensionality reduction using explainable artificial intelligence, without negotiating on accuracy for heart disease classification. Four explainable ML models, using SHAP, were used for classification which reflected the feature contributions (FC) and feature weights (FW) for each feature in the CFV for generating the final results. FC and FW were taken into account in generating the reduced dimensional feature subset (FS). The findings of the study are as follows: (a) XGBoost classifies heart diseases best with explanations, with an increase in 2% in model accuracy over existing best proposals, (b) explainable classification using FS exhibits better accuracy than most of the literary proposals, and (c) with the increase in explainability, accuracy can be preserved using XGBoost classifier for classifying heart diseases, and (d) the top four features responsible for diagnosis of heart disease have been exhibited which have common occurrences in all the explanations reflected by the five explainable techniques used on XGBoost classifier based on feature contributions. To the best of our knowledge, this is first attempt to explain XGBoost classification for diagnosis of heart diseases using five explainable techniques. Springer US 2023-05-12 /pmc/articles/PMC10177719/ /pubmed/37359323 http://dx.doi.org/10.1007/s11227-023-05356-3 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Das, Surajit
Sultana, Mahamuda
Bhattacharya, Suman
Sengupta, Diganta
De, Debashis
XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI
title XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI
title_full XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI
title_fullStr XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI
title_full_unstemmed XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI
title_short XAI–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable AI
title_sort xai–reduct: accuracy preservation despite dimensionality reduction for heart disease classification using explainable ai
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10177719/
https://www.ncbi.nlm.nih.gov/pubmed/37359323
http://dx.doi.org/10.1007/s11227-023-05356-3
work_keys_str_mv AT dassurajit xaireductaccuracypreservationdespitedimensionalityreductionforheartdiseaseclassificationusingexplainableai
AT sultanamahamuda xaireductaccuracypreservationdespitedimensionalityreductionforheartdiseaseclassificationusingexplainableai
AT bhattacharyasuman xaireductaccuracypreservationdespitedimensionalityreductionforheartdiseaseclassificationusingexplainableai
AT senguptadiganta xaireductaccuracypreservationdespitedimensionalityreductionforheartdiseaseclassificationusingexplainableai
AT dedebashis xaireductaccuracypreservationdespitedimensionalityreductionforheartdiseaseclassificationusingexplainableai