Cargando…

Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values

Most machine learning (ML) models produce black box predictions that are difficult, if not impossible, to understand. In pharmaceutical research, black box predictions work against the acceptance of ML models for guiding experimental work. Hence, there is increasing interest in approaches for explai...

Descripción completa

Detalles Bibliográficos
Autores principales: Lamens, Alec, Bajorath, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383571/
https://www.ncbi.nlm.nih.gov/pubmed/37513472
http://dx.doi.org/10.3390/molecules28145601
_version_ 1785080943192571904
author Lamens, Alec
Bajorath, Jürgen
author_facet Lamens, Alec
Bajorath, Jürgen
author_sort Lamens, Alec
collection PubMed
description Most machine learning (ML) models produce black box predictions that are difficult, if not impossible, to understand. In pharmaceutical research, black box predictions work against the acceptance of ML models for guiding experimental work. Hence, there is increasing interest in approaches for explainable ML, which is a part of explainable artificial intelligence (XAI), to better understand prediction outcomes. Herein, we have devised a test system for the rationalization of multiclass compound activity prediction models that combines two approaches from XAI for feature relevance or importance analysis, including counterfactuals (CFs) and Shapley additive explanations (SHAP). For compounds with different single- and dual-target activities, we identified small compound modifications that induce feature changes inverting class label predictions. In combination with feature mapping, CFs and SHAP value calculations provide chemically intuitive explanations for model decisions.
format Online
Article
Text
id pubmed-10383571
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103835712023-07-30 Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values Lamens, Alec Bajorath, Jürgen Molecules Article Most machine learning (ML) models produce black box predictions that are difficult, if not impossible, to understand. In pharmaceutical research, black box predictions work against the acceptance of ML models for guiding experimental work. Hence, there is increasing interest in approaches for explainable ML, which is a part of explainable artificial intelligence (XAI), to better understand prediction outcomes. Herein, we have devised a test system for the rationalization of multiclass compound activity prediction models that combines two approaches from XAI for feature relevance or importance analysis, including counterfactuals (CFs) and Shapley additive explanations (SHAP). For compounds with different single- and dual-target activities, we identified small compound modifications that induce feature changes inverting class label predictions. In combination with feature mapping, CFs and SHAP value calculations provide chemically intuitive explanations for model decisions. MDPI 2023-07-24 /pmc/articles/PMC10383571/ /pubmed/37513472 http://dx.doi.org/10.3390/molecules28145601 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lamens, Alec
Bajorath, Jürgen
Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
title Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
title_full Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
title_fullStr Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
title_full_unstemmed Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
title_short Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
title_sort explaining multiclass compound activity predictions using counterfactuals and shapley values
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383571/
https://www.ncbi.nlm.nih.gov/pubmed/37513472
http://dx.doi.org/10.3390/molecules28145601
work_keys_str_mv AT lamensalec explainingmulticlasscompoundactivitypredictionsusingcounterfactualsandshapleyvalues
AT bajorathjurgen explainingmulticlasscompoundactivitypredictionsusingcounterfactualsandshapleyvalues