Cargando…

Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions

Nowadays, the problem of the model’s applicability domain (AD) definition is an active research topic in chemoinformatics. Although many various AD definitions for the models predicting properties of molecules (Quantitative Structure-Activity/Property Relationship (QSAR/QSPR) models) were described...

Descripción completa

Detalles Bibliográficos
Autores principales: Rakhimbekova, Assima, Madzhidov, Timur I., Nugmanov, Ramil I., Gimadiev, Timur R., Baskin, Igor I., Varnek, Alexandre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7432167/
https://www.ncbi.nlm.nih.gov/pubmed/32756326
http://dx.doi.org/10.3390/ijms21155542
_version_ 1783571737580929024
author Rakhimbekova, Assima
Madzhidov, Timur I.
Nugmanov, Ramil I.
Gimadiev, Timur R.
Baskin, Igor I.
Varnek, Alexandre
author_facet Rakhimbekova, Assima
Madzhidov, Timur I.
Nugmanov, Ramil I.
Gimadiev, Timur R.
Baskin, Igor I.
Varnek, Alexandre
author_sort Rakhimbekova, Assima
collection PubMed
description Nowadays, the problem of the model’s applicability domain (AD) definition is an active research topic in chemoinformatics. Although many various AD definitions for the models predicting properties of molecules (Quantitative Structure-Activity/Property Relationship (QSAR/QSPR) models) were described in the literature, no one for chemical reactions (Quantitative Reaction-Property Relationships (QRPR)) has been reported to date. The point is that a chemical reaction is a much more complex object than an individual molecule, and its yield, thermodynamic and kinetic characteristics depend not only on the structures of reactants and products but also on experimental conditions. The QRPR models’ performance largely depends on the way that chemical transformation is encoded. In this study, various AD definition methods extensively used in QSAR/QSPR studies of individual molecules, as well as several novel approaches suggested in this work for reactions, were benchmarked on several reaction datasets. The ability to exclude wrong reaction types, increase coverage, improve the model performance and detect Y-outliers were tested. As a result, several “best” AD definitions for the QRPR models predicting reaction characteristics have been revealed and tested on a previously published external dataset with a clear AD definition problem.
format Online
Article
Text
id pubmed-7432167
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74321672020-08-24 Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions Rakhimbekova, Assima Madzhidov, Timur I. Nugmanov, Ramil I. Gimadiev, Timur R. Baskin, Igor I. Varnek, Alexandre Int J Mol Sci Article Nowadays, the problem of the model’s applicability domain (AD) definition is an active research topic in chemoinformatics. Although many various AD definitions for the models predicting properties of molecules (Quantitative Structure-Activity/Property Relationship (QSAR/QSPR) models) were described in the literature, no one for chemical reactions (Quantitative Reaction-Property Relationships (QRPR)) has been reported to date. The point is that a chemical reaction is a much more complex object than an individual molecule, and its yield, thermodynamic and kinetic characteristics depend not only on the structures of reactants and products but also on experimental conditions. The QRPR models’ performance largely depends on the way that chemical transformation is encoded. In this study, various AD definition methods extensively used in QSAR/QSPR studies of individual molecules, as well as several novel approaches suggested in this work for reactions, were benchmarked on several reaction datasets. The ability to exclude wrong reaction types, increase coverage, improve the model performance and detect Y-outliers were tested. As a result, several “best” AD definitions for the QRPR models predicting reaction characteristics have been revealed and tested on a previously published external dataset with a clear AD definition problem. MDPI 2020-08-03 /pmc/articles/PMC7432167/ /pubmed/32756326 http://dx.doi.org/10.3390/ijms21155542 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rakhimbekova, Assima
Madzhidov, Timur I.
Nugmanov, Ramil I.
Gimadiev, Timur R.
Baskin, Igor I.
Varnek, Alexandre
Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_full Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_fullStr Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_full_unstemmed Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_short Comprehensive Analysis of Applicability Domains of QSPR Models for Chemical Reactions
title_sort comprehensive analysis of applicability domains of qspr models for chemical reactions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7432167/
https://www.ncbi.nlm.nih.gov/pubmed/32756326
http://dx.doi.org/10.3390/ijms21155542
work_keys_str_mv AT rakhimbekovaassima comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT madzhidovtimuri comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT nugmanovramili comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT gimadievtimurr comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT baskinigori comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions
AT varnekalexandre comprehensiveanalysisofapplicabilitydomainsofqsprmodelsforchemicalreactions