Cargando…

Benchmarks for interpretation of QSAR models

Interpretation of QSAR models is useful to understand the complex nature of biological or physicochemical processes, guide structural optimization or perform knowledge-based validation of QSAR models. Highly predictive models are usually complex and their interpretation is non-trivial. This is parti...

Descripción completa

Detalles Bibliográficos
Autores principales:	Matveieva, Mariia, Polishchuk, Pavel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8157407/ https://www.ncbi.nlm.nih.gov/pubmed/34039411 http://dx.doi.org/10.1186/s13321-021-00519-x

_version_	1783699676890923008
author	Matveieva, Mariia Polishchuk, Pavel
author_facet	Matveieva, Mariia Polishchuk, Pavel
author_sort	Matveieva, Mariia
collection	PubMed
description	Interpretation of QSAR models is useful to understand the complex nature of biological or physicochemical processes, guide structural optimization or perform knowledge-based validation of QSAR models. Highly predictive models are usually complex and their interpretation is non-trivial. This is particularly true for modern neural networks. Various approaches to interpretation of these models exist. However, it is difficult to evaluate and compare performance and applicability of these ever-emerging methods. Herein, we developed several benchmark data sets with end-points determined by pre-defined patterns. These data sets are purposed for evaluation of the ability of interpretation approaches to retrieve these patterns. They represent tasks with different complexity levels: from simple atom-based additive properties to pharmacophore hypothesis. We proposed several quantitative metrics of interpretation performance. Applicability of benchmarks and metrics was demonstrated on a set of conventional models and end-to-end graph convolutional neural networks, interpreted by the previously suggested universal ML-agnostic approach for structural interpretation. We anticipate these benchmarks to be useful in evaluation of new interpretation approaches and investigation of decision making of complex “black box” models. [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00519-x.
format	Online Article Text
id	pubmed-8157407
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-81574072021-05-28 Benchmarks for interpretation of QSAR models Matveieva, Mariia Polishchuk, Pavel J Cheminform Research Article Interpretation of QSAR models is useful to understand the complex nature of biological or physicochemical processes, guide structural optimization or perform knowledge-based validation of QSAR models. Highly predictive models are usually complex and their interpretation is non-trivial. This is particularly true for modern neural networks. Various approaches to interpretation of these models exist. However, it is difficult to evaluate and compare performance and applicability of these ever-emerging methods. Herein, we developed several benchmark data sets with end-points determined by pre-defined patterns. These data sets are purposed for evaluation of the ability of interpretation approaches to retrieve these patterns. They represent tasks with different complexity levels: from simple atom-based additive properties to pharmacophore hypothesis. We proposed several quantitative metrics of interpretation performance. Applicability of benchmarks and metrics was demonstrated on a set of conventional models and end-to-end graph convolutional neural networks, interpreted by the previously suggested universal ML-agnostic approach for structural interpretation. We anticipate these benchmarks to be useful in evaluation of new interpretation approaches and investigation of decision making of complex “black box” models. [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-021-00519-x. Springer International Publishing 2021-05-26 /pmc/articles/PMC8157407/ /pubmed/34039411 http://dx.doi.org/10.1186/s13321-021-00519-x Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Article Matveieva, Mariia Polishchuk, Pavel Benchmarks for interpretation of QSAR models
title	Benchmarks for interpretation of QSAR models
title_full	Benchmarks for interpretation of QSAR models
title_fullStr	Benchmarks for interpretation of QSAR models
title_full_unstemmed	Benchmarks for interpretation of QSAR models
title_short	Benchmarks for interpretation of QSAR models
title_sort	benchmarks for interpretation of qsar models
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8157407/ https://www.ncbi.nlm.nih.gov/pubmed/34039411 http://dx.doi.org/10.1186/s13321-021-00519-x
work_keys_str_mv	AT matveievamariia benchmarksforinterpretationofqsarmodels AT polishchukpavel benchmarksforinterpretationofqsarmodels

Benchmarks for interpretation of QSAR models

Ejemplares similares