Cargando…

Scrutinizing XAI using linear ground-truth data with suppressor variables

Machine learning (ML) is increasingly often used to inform high-stakes decisions. As complex ML models (e.g., deep neural networks) are often considered black boxes, a wealth of procedures has been developed to shed light on their inner workings and the ways in which their predictions come about, de...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilming, Rick, Budding, Céline, Müller, Klaus-Robert, Haufe, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9123083/
https://www.ncbi.nlm.nih.gov/pubmed/35611184
http://dx.doi.org/10.1007/s10994-022-06167-y
_version_ 1784711486208212992
author Wilming, Rick
Budding, Céline
Müller, Klaus-Robert
Haufe, Stefan
author_facet Wilming, Rick
Budding, Céline
Müller, Klaus-Robert
Haufe, Stefan
author_sort Wilming, Rick
collection PubMed
description Machine learning (ML) is increasingly often used to inform high-stakes decisions. As complex ML models (e.g., deep neural networks) are often considered black boxes, a wealth of procedures has been developed to shed light on their inner workings and the ways in which their predictions come about, defining the field of ‘explainable AI’ (XAI). Saliency methods rank input features according to some measure of ‘importance’. Such methods are difficult to validate since a formal definition of feature importance is, thus far, lacking. It has been demonstrated that some saliency methods can highlight features that have no statistical association with the prediction target (suppressor variables). To avoid misinterpretations due to such behavior, we propose the actual presence of such an association as a necessary condition and objective preliminary definition for feature importance. We carefully crafted a ground-truth dataset in which all statistical dependencies are well-defined and linear, serving as a benchmark to study the problem of suppressor variables. We evaluate common explanation methods including LRP, DTD, PatternNet, PatternAttribution, LIME, Anchors, SHAP, and permutation-based methods with respect to our objective definition. We show that most of these methods are unable to distinguish important features from suppressors in this setting. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10994-022-06167-y.
format Online
Article
Text
id pubmed-9123083
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-91230832022-05-22 Scrutinizing XAI using linear ground-truth data with suppressor variables Wilming, Rick Budding, Céline Müller, Klaus-Robert Haufe, Stefan Mach Learn Article Machine learning (ML) is increasingly often used to inform high-stakes decisions. As complex ML models (e.g., deep neural networks) are often considered black boxes, a wealth of procedures has been developed to shed light on their inner workings and the ways in which their predictions come about, defining the field of ‘explainable AI’ (XAI). Saliency methods rank input features according to some measure of ‘importance’. Such methods are difficult to validate since a formal definition of feature importance is, thus far, lacking. It has been demonstrated that some saliency methods can highlight features that have no statistical association with the prediction target (suppressor variables). To avoid misinterpretations due to such behavior, we propose the actual presence of such an association as a necessary condition and objective preliminary definition for feature importance. We carefully crafted a ground-truth dataset in which all statistical dependencies are well-defined and linear, serving as a benchmark to study the problem of suppressor variables. We evaluate common explanation methods including LRP, DTD, PatternNet, PatternAttribution, LIME, Anchors, SHAP, and permutation-based methods with respect to our objective definition. We show that most of these methods are unable to distinguish important features from suppressors in this setting. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10994-022-06167-y. Springer US 2022-04-13 2022 /pmc/articles/PMC9123083/ /pubmed/35611184 http://dx.doi.org/10.1007/s10994-022-06167-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wilming, Rick
Budding, Céline
Müller, Klaus-Robert
Haufe, Stefan
Scrutinizing XAI using linear ground-truth data with suppressor variables
title Scrutinizing XAI using linear ground-truth data with suppressor variables
title_full Scrutinizing XAI using linear ground-truth data with suppressor variables
title_fullStr Scrutinizing XAI using linear ground-truth data with suppressor variables
title_full_unstemmed Scrutinizing XAI using linear ground-truth data with suppressor variables
title_short Scrutinizing XAI using linear ground-truth data with suppressor variables
title_sort scrutinizing xai using linear ground-truth data with suppressor variables
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9123083/
https://www.ncbi.nlm.nih.gov/pubmed/35611184
http://dx.doi.org/10.1007/s10994-022-06167-y
work_keys_str_mv AT wilmingrick scrutinizingxaiusinglineargroundtruthdatawithsuppressorvariables
AT buddingceline scrutinizingxaiusinglineargroundtruthdatawithsuppressorvariables
AT mullerklausrobert scrutinizingxaiusinglineargroundtruthdatawithsuppressorvariables
AT haufestefan scrutinizingxaiusinglineargroundtruthdatawithsuppressorvariables