Cargando…

Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint

Prediction of the potency of bioactive compounds generally relies on linear or nonlinear quantitative structure–activity relationship (QSAR) models. Nonlinear models are generated using machine learning methods. We introduce a novel approach for potency prediction that depends on a newly designed mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Janela, Tiago, Takeuchi, Kosuke, Bajorath, Jürgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953226/
https://www.ncbi.nlm.nih.gov/pubmed/36830761
http://dx.doi.org/10.3390/biom13020393
_version_ 1784893825710292992
author Janela, Tiago
Takeuchi, Kosuke
Bajorath, Jürgen
author_facet Janela, Tiago
Takeuchi, Kosuke
Bajorath, Jürgen
author_sort Janela, Tiago
collection PubMed
description Prediction of the potency of bioactive compounds generally relies on linear or nonlinear quantitative structure–activity relationship (QSAR) models. Nonlinear models are generated using machine learning methods. We introduce a novel approach for potency prediction that depends on a newly designed molecular fingerprint (FP) representation. This structure–potency fingerprint (SPFP) combines different modules accounting for the structural features of active compounds and their potency values in a single bit string, hence unifying structure and potency representation. This encoding enables the derivation of a conditional variational autoencoder (CVAE) using SPFPs of training compounds and apply the model to predict the SPFP potency module of test compounds using only their structure module as input. The SPFP–CVAE approach correctly predicts the potency values of compounds belonging to different activity classes with an accuracy comparable to support vector regression (SVR), representing the state-of-the-art in the field. In addition, highly potent compounds are predicted with very similar accuracy as SVR and deep neural networks.
format Online
Article
Text
id pubmed-9953226
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99532262023-02-25 Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint Janela, Tiago Takeuchi, Kosuke Bajorath, Jürgen Biomolecules Article Prediction of the potency of bioactive compounds generally relies on linear or nonlinear quantitative structure–activity relationship (QSAR) models. Nonlinear models are generated using machine learning methods. We introduce a novel approach for potency prediction that depends on a newly designed molecular fingerprint (FP) representation. This structure–potency fingerprint (SPFP) combines different modules accounting for the structural features of active compounds and their potency values in a single bit string, hence unifying structure and potency representation. This encoding enables the derivation of a conditional variational autoencoder (CVAE) using SPFPs of training compounds and apply the model to predict the SPFP potency module of test compounds using only their structure module as input. The SPFP–CVAE approach correctly predicts the potency values of compounds belonging to different activity classes with an accuracy comparable to support vector regression (SVR), representing the state-of-the-art in the field. In addition, highly potent compounds are predicted with very similar accuracy as SVR and deep neural networks. MDPI 2023-02-18 /pmc/articles/PMC9953226/ /pubmed/36830761 http://dx.doi.org/10.3390/biom13020393 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Janela, Tiago
Takeuchi, Kosuke
Bajorath, Jürgen
Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint
title Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint
title_full Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint
title_fullStr Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint
title_full_unstemmed Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint
title_short Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint
title_sort predicting potent compounds using a conditional variational autoencoder based upon a new structure–potency fingerprint
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953226/
https://www.ncbi.nlm.nih.gov/pubmed/36830761
http://dx.doi.org/10.3390/biom13020393
work_keys_str_mv AT janelatiago predictingpotentcompoundsusingaconditionalvariationalautoencoderbaseduponanewstructurepotencyfingerprint
AT takeuchikosuke predictingpotentcompoundsusingaconditionalvariationalautoencoderbaseduponanewstructurepotencyfingerprint
AT bajorathjurgen predictingpotentcompoundsusingaconditionalvariationalautoencoderbaseduponanewstructurepotencyfingerprint