Cargando…
Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values
Most machine learning (ML) models produce black box predictions that are difficult, if not impossible, to understand. In pharmaceutical research, black box predictions work against the acceptance of ML models for guiding experimental work. Hence, there is increasing interest in approaches for explai...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383571/ https://www.ncbi.nlm.nih.gov/pubmed/37513472 http://dx.doi.org/10.3390/molecules28145601 |
_version_ | 1785080943192571904 |
---|---|
author | Lamens, Alec Bajorath, Jürgen |
author_facet | Lamens, Alec Bajorath, Jürgen |
author_sort | Lamens, Alec |
collection | PubMed |
description | Most machine learning (ML) models produce black box predictions that are difficult, if not impossible, to understand. In pharmaceutical research, black box predictions work against the acceptance of ML models for guiding experimental work. Hence, there is increasing interest in approaches for explainable ML, which is a part of explainable artificial intelligence (XAI), to better understand prediction outcomes. Herein, we have devised a test system for the rationalization of multiclass compound activity prediction models that combines two approaches from XAI for feature relevance or importance analysis, including counterfactuals (CFs) and Shapley additive explanations (SHAP). For compounds with different single- and dual-target activities, we identified small compound modifications that induce feature changes inverting class label predictions. In combination with feature mapping, CFs and SHAP value calculations provide chemically intuitive explanations for model decisions. |
format | Online Article Text |
id | pubmed-10383571 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103835712023-07-30 Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values Lamens, Alec Bajorath, Jürgen Molecules Article Most machine learning (ML) models produce black box predictions that are difficult, if not impossible, to understand. In pharmaceutical research, black box predictions work against the acceptance of ML models for guiding experimental work. Hence, there is increasing interest in approaches for explainable ML, which is a part of explainable artificial intelligence (XAI), to better understand prediction outcomes. Herein, we have devised a test system for the rationalization of multiclass compound activity prediction models that combines two approaches from XAI for feature relevance or importance analysis, including counterfactuals (CFs) and Shapley additive explanations (SHAP). For compounds with different single- and dual-target activities, we identified small compound modifications that induce feature changes inverting class label predictions. In combination with feature mapping, CFs and SHAP value calculations provide chemically intuitive explanations for model decisions. MDPI 2023-07-24 /pmc/articles/PMC10383571/ /pubmed/37513472 http://dx.doi.org/10.3390/molecules28145601 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lamens, Alec Bajorath, Jürgen Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values |
title | Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values |
title_full | Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values |
title_fullStr | Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values |
title_full_unstemmed | Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values |
title_short | Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values |
title_sort | explaining multiclass compound activity predictions using counterfactuals and shapley values |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10383571/ https://www.ncbi.nlm.nih.gov/pubmed/37513472 http://dx.doi.org/10.3390/molecules28145601 |
work_keys_str_mv | AT lamensalec explainingmulticlasscompoundactivitypredictionsusingcounterfactualsandshapleyvalues AT bajorathjurgen explainingmulticlasscompoundactivitypredictionsusingcounterfactualsandshapleyvalues |