Cargando…

Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor

BACKGROUND: Quantitative structure-activity relationship (QSAR) models are important tools used in discovering new drug candidates and identifying potentially harmful environmental chemicals. These models often face two fundamental challenges: limited amount of available biological activity data and...

Descripción completa

Detalles Bibliográficos
Autores principales: Balabin, Ilya A., Judson, Richard S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755572/
https://www.ncbi.nlm.nih.gov/pubmed/30229396
http://dx.doi.org/10.1186/s13321-018-0300-0
_version_ 1783453260890243072
author Balabin, Ilya A.
Judson, Richard S.
author_facet Balabin, Ilya A.
Judson, Richard S.
author_sort Balabin, Ilya A.
collection PubMed
description BACKGROUND: Quantitative structure-activity relationship (QSAR) models are important tools used in discovering new drug candidates and identifying potentially harmful environmental chemicals. These models often face two fundamental challenges: limited amount of available biological activity data and noise or uncertainty in the activity data themselves. To address these challenges, we introduce and explore a QSAR model based on custom distance metrics in the structure-activity space. METHODS: The model is built on top of the k-nearest neighbor model, incorporating non-linearity not only in the chemical structure space, but also in the biological activity space. The model is tuned and evaluated using activity data for human estrogen receptor from the US EPA ToxCast and Tox21 databases. RESULTS: The model closely trails the CERAPP consensus model (built on top of 48 individual human estrogen receptor activity models) in agonist activity predictions and consistently outperforms the CERAPP consensus model in antagonist activity predictions. DISCUSSION: We suggest that incorporating non-linear distance metrics may significantly improve QSAR model performance when the available biological activity data are limited. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0300-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6755572
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-67555722019-09-26 Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor Balabin, Ilya A. Judson, Richard S. J Cheminform Research Article BACKGROUND: Quantitative structure-activity relationship (QSAR) models are important tools used in discovering new drug candidates and identifying potentially harmful environmental chemicals. These models often face two fundamental challenges: limited amount of available biological activity data and noise or uncertainty in the activity data themselves. To address these challenges, we introduce and explore a QSAR model based on custom distance metrics in the structure-activity space. METHODS: The model is built on top of the k-nearest neighbor model, incorporating non-linearity not only in the chemical structure space, but also in the biological activity space. The model is tuned and evaluated using activity data for human estrogen receptor from the US EPA ToxCast and Tox21 databases. RESULTS: The model closely trails the CERAPP consensus model (built on top of 48 individual human estrogen receptor activity models) in agonist activity predictions and consistently outperforms the CERAPP consensus model in antagonist activity predictions. DISCUSSION: We suggest that incorporating non-linear distance metrics may significantly improve QSAR model performance when the available biological activity data are limited. [Image: see text] ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13321-018-0300-0) contains supplementary material, which is available to authorized users. Springer International Publishing 2018-09-18 /pmc/articles/PMC6755572/ /pubmed/30229396 http://dx.doi.org/10.1186/s13321-018-0300-0 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Balabin, Ilya A.
Judson, Richard S.
Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor
title Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor
title_full Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor
title_fullStr Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor
title_full_unstemmed Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor
title_short Exploring non-linear distance metrics in the structure–activity space: QSAR models for human estrogen receptor
title_sort exploring non-linear distance metrics in the structure–activity space: qsar models for human estrogen receptor
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6755572/
https://www.ncbi.nlm.nih.gov/pubmed/30229396
http://dx.doi.org/10.1186/s13321-018-0300-0
work_keys_str_mv AT balabinilyaa exploringnonlineardistancemetricsinthestructureactivityspaceqsarmodelsforhumanestrogenreceptor
AT judsonrichards exploringnonlineardistancemetricsinthestructureactivityspaceqsarmodelsforhumanestrogenreceptor