Cargando…

Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences

BACKGROUND: Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients’ disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD syste...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hatwell, Julian, Gaber, Mohamed Medhat, Atif Azad, R. Muhammad
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2020
Materias:	Technical Advance
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531148/ https://www.ncbi.nlm.nih.gov/pubmed/33008388 http://dx.doi.org/10.1186/s12911-020-01201-2

_version_	1783589707733532672
author	Hatwell, Julian Gaber, Mohamed Medhat Atif Azad, R. Muhammad
author_facet	Hatwell, Julian Gaber, Mohamed Medhat Atif Azad, R. Muhammad
author_sort	Hatwell, Julian
collection	PubMed
description	BACKGROUND: Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients’ disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning models and high dimensional data sources such as electronic health records, magnetic resonance imaging scans, cardiotocograms, etc. These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. METHODS: In this work, we focus on AdaBoost, a black box model that has been widely adopted in the CAD literature. We address the challenge – to explain AdaBoost classification – with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost’s adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model’s decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting. RESULTS: Experiments on 9 CAD-related data sets showed that Ada-WHIPS explanations consistently generalise better (mean coverage 15%-68%) than the state of the art while remaining competitive for specificity (mean precision 80%-99%). A very small trade-off in specificity is shown to guard against over-fitting which is a known problem in the state of the art methods. CONCLUSIONS: The experimental results demonstrate the benefits of using our novel algorithm for explaining CAD AdaBoost classifiers widely found in the literature. Our tightly coupled, AdaBoost-specific approach outperforms model-agnostic explanation methods and should be considered by practitioners looking for an XAI solution for this class of models.
format	Online Article Text
id	pubmed-7531148
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-75311482020-10-05 Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences Hatwell, Julian Gaber, Mohamed Medhat Atif Azad, R. Muhammad BMC Med Inform Decis Mak Technical Advance BACKGROUND: Computer Aided Diagnostics (CAD) can support medical practitioners to make critical decisions about their patients’ disease conditions. Practitioners require access to the chain of reasoning behind CAD to build trust in the CAD advice and to supplement their own expertise. Yet, CAD systems might be based on black box machine learning models and high dimensional data sources such as electronic health records, magnetic resonance imaging scans, cardiotocograms, etc. These foundations make interpretation and explanation of the CAD advice very challenging. This challenge is recognised throughout the machine learning research community. eXplainable Artificial Intelligence (XAI) is emerging as one of the most important research areas of recent years because it addresses the interpretability and trust concerns of critical decision makers, including those in clinical and medical practice. METHODS: In this work, we focus on AdaBoost, a black box model that has been widely adopted in the CAD literature. We address the challenge – to explain AdaBoost classification – with a novel algorithm that extracts simple, logical rules from AdaBoost models. Our algorithm, Adaptive-Weighted High Importance Path Snippets (Ada-WHIPS), makes use of AdaBoost’s adaptive classifier weights. Using a novel formulation, Ada-WHIPS uniquely redistributes the weights among individual decision nodes of the internal decision trees of the AdaBoost model. Then, a simple heuristic search of the weighted nodes finds a single rule that dominated the model’s decision. We compare the explanations generated by our novel approach with the state of the art in an experimental study. We evaluate the derived explanations with simple statistical tests of well-known quality measures, precision and coverage, and a novel measure stability that is better suited to the XAI setting. RESULTS: Experiments on 9 CAD-related data sets showed that Ada-WHIPS explanations consistently generalise better (mean coverage 15%-68%) than the state of the art while remaining competitive for specificity (mean precision 80%-99%). A very small trade-off in specificity is shown to guard against over-fitting which is a known problem in the state of the art methods. CONCLUSIONS: The experimental results demonstrate the benefits of using our novel algorithm for explaining CAD AdaBoost classifiers widely found in the literature. Our tightly coupled, AdaBoost-specific approach outperforms model-agnostic explanation methods and should be considered by practitioners looking for an XAI solution for this class of models. BioMed Central 2020-10-02 /pmc/articles/PMC7531148/ /pubmed/33008388 http://dx.doi.org/10.1186/s12911-020-01201-2 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Technical Advance Hatwell, Julian Gaber, Mohamed Medhat Atif Azad, R. Muhammad Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
title	Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
title_full	Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
title_fullStr	Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
title_full_unstemmed	Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
title_short	Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences
title_sort	ada-whips: explaining adaboost classification with applications in the health sciences
topic	Technical Advance
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7531148/ https://www.ncbi.nlm.nih.gov/pubmed/33008388 http://dx.doi.org/10.1186/s12911-020-01201-2
work_keys_str_mv	AT hatwelljulian adawhipsexplainingadaboostclassificationwithapplicationsinthehealthsciences AT gabermohamedmedhat adawhipsexplainingadaboostclassificationwithapplicationsinthehealthsciences AT atifazadrmuhammad adawhipsexplainingadaboostclassificationwithapplicationsinthehealthsciences

Ada-WHIPS: explaining AdaBoost classification with applications in the health sciences

Ejemplares similares