Cargando…

Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning

Predicting compounds with single- and multi-target activity and exploring origins of compound specificity and promiscuity is of high interest for chemical biology and drug discovery. We present a large-scale analysis of compound promiscuity including two major components. First, high-confidence data...

Descripción completa

Detalles Bibliográficos
Autores principales: Feldmann, Christian, Yonchev, Dimitar, Bajorath, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7761051/
https://www.ncbi.nlm.nih.gov/pubmed/33260876
http://dx.doi.org/10.3390/biom10121605
_version_ 1783627477492432896
author Feldmann, Christian
Yonchev, Dimitar
Bajorath, Jürgen
author_facet Feldmann, Christian
Yonchev, Dimitar
Bajorath, Jürgen
author_sort Feldmann, Christian
collection PubMed
description Predicting compounds with single- and multi-target activity and exploring origins of compound specificity and promiscuity is of high interest for chemical biology and drug discovery. We present a large-scale analysis of compound promiscuity including two major components. First, high-confidence datasets of compounds with multi- and corresponding single-target activity were extracted from biological screening data. Positive and negative assay results were taken into account and data completeness was ensured. Second, these datasets were investigated using diagnostic machine learning to systematically distinguish between compounds with multi- and single-target activity. Models built on the basis of chemical structure consistently produced meaningful predictions. These findings provided evidence for the presence of structural features differentiating promiscuous and non-promiscuous compounds. Machine learning under varying conditions using modified datasets revealed a strong influence of nearest neighbor relationship on the predictions. Many multi-target compounds were found to be more similar to other multi-target compounds than single-target compounds and vice versa, which resulted in consistently accurate predictions. The results of our study confirm the presence of structural relationships that differentiate promiscuous and non-promiscuous compounds.
format Online
Article
Text
id pubmed-7761051
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77610512020-12-26 Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning Feldmann, Christian Yonchev, Dimitar Bajorath, Jürgen Biomolecules Article Predicting compounds with single- and multi-target activity and exploring origins of compound specificity and promiscuity is of high interest for chemical biology and drug discovery. We present a large-scale analysis of compound promiscuity including two major components. First, high-confidence datasets of compounds with multi- and corresponding single-target activity were extracted from biological screening data. Positive and negative assay results were taken into account and data completeness was ensured. Second, these datasets were investigated using diagnostic machine learning to systematically distinguish between compounds with multi- and single-target activity. Models built on the basis of chemical structure consistently produced meaningful predictions. These findings provided evidence for the presence of structural features differentiating promiscuous and non-promiscuous compounds. Machine learning under varying conditions using modified datasets revealed a strong influence of nearest neighbor relationship on the predictions. Many multi-target compounds were found to be more similar to other multi-target compounds than single-target compounds and vice versa, which resulted in consistently accurate predictions. The results of our study confirm the presence of structural relationships that differentiate promiscuous and non-promiscuous compounds. MDPI 2020-11-27 /pmc/articles/PMC7761051/ /pubmed/33260876 http://dx.doi.org/10.3390/biom10121605 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Feldmann, Christian
Yonchev, Dimitar
Bajorath, Jürgen
Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning
title Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning
title_full Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning
title_fullStr Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning
title_full_unstemmed Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning
title_short Analysis of Biological Screening Compounds with Single- or Multi-Target Activity via Diagnostic Machine Learning
title_sort analysis of biological screening compounds with single- or multi-target activity via diagnostic machine learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7761051/
https://www.ncbi.nlm.nih.gov/pubmed/33260876
http://dx.doi.org/10.3390/biom10121605
work_keys_str_mv AT feldmannchristian analysisofbiologicalscreeningcompoundswithsingleormultitargetactivityviadiagnosticmachinelearning
AT yonchevdimitar analysisofbiologicalscreeningcompoundswithsingleormultitargetactivityviadiagnosticmachinelearning
AT bajorathjurgen analysisofbiologicalscreeningcompoundswithsingleormultitargetactivityviadiagnosticmachinelearning