Cargando…
Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint
Prediction of the potency of bioactive compounds generally relies on linear or nonlinear quantitative structure–activity relationship (QSAR) models. Nonlinear models are generated using machine learning methods. We introduce a novel approach for potency prediction that depends on a newly designed mo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953226/ https://www.ncbi.nlm.nih.gov/pubmed/36830761 http://dx.doi.org/10.3390/biom13020393 |
_version_ | 1784893825710292992 |
---|---|
author | Janela, Tiago Takeuchi, Kosuke Bajorath, Jürgen |
author_facet | Janela, Tiago Takeuchi, Kosuke Bajorath, Jürgen |
author_sort | Janela, Tiago |
collection | PubMed |
description | Prediction of the potency of bioactive compounds generally relies on linear or nonlinear quantitative structure–activity relationship (QSAR) models. Nonlinear models are generated using machine learning methods. We introduce a novel approach for potency prediction that depends on a newly designed molecular fingerprint (FP) representation. This structure–potency fingerprint (SPFP) combines different modules accounting for the structural features of active compounds and their potency values in a single bit string, hence unifying structure and potency representation. This encoding enables the derivation of a conditional variational autoencoder (CVAE) using SPFPs of training compounds and apply the model to predict the SPFP potency module of test compounds using only their structure module as input. The SPFP–CVAE approach correctly predicts the potency values of compounds belonging to different activity classes with an accuracy comparable to support vector regression (SVR), representing the state-of-the-art in the field. In addition, highly potent compounds are predicted with very similar accuracy as SVR and deep neural networks. |
format | Online Article Text |
id | pubmed-9953226 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99532262023-02-25 Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint Janela, Tiago Takeuchi, Kosuke Bajorath, Jürgen Biomolecules Article Prediction of the potency of bioactive compounds generally relies on linear or nonlinear quantitative structure–activity relationship (QSAR) models. Nonlinear models are generated using machine learning methods. We introduce a novel approach for potency prediction that depends on a newly designed molecular fingerprint (FP) representation. This structure–potency fingerprint (SPFP) combines different modules accounting for the structural features of active compounds and their potency values in a single bit string, hence unifying structure and potency representation. This encoding enables the derivation of a conditional variational autoencoder (CVAE) using SPFPs of training compounds and apply the model to predict the SPFP potency module of test compounds using only their structure module as input. The SPFP–CVAE approach correctly predicts the potency values of compounds belonging to different activity classes with an accuracy comparable to support vector regression (SVR), representing the state-of-the-art in the field. In addition, highly potent compounds are predicted with very similar accuracy as SVR and deep neural networks. MDPI 2023-02-18 /pmc/articles/PMC9953226/ /pubmed/36830761 http://dx.doi.org/10.3390/biom13020393 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Janela, Tiago Takeuchi, Kosuke Bajorath, Jürgen Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint |
title | Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint |
title_full | Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint |
title_fullStr | Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint |
title_full_unstemmed | Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint |
title_short | Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure–Potency Fingerprint |
title_sort | predicting potent compounds using a conditional variational autoencoder based upon a new structure–potency fingerprint |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9953226/ https://www.ncbi.nlm.nih.gov/pubmed/36830761 http://dx.doi.org/10.3390/biom13020393 |
work_keys_str_mv | AT janelatiago predictingpotentcompoundsusingaconditionalvariationalautoencoderbaseduponanewstructurepotencyfingerprint AT takeuchikosuke predictingpotentcompoundsusingaconditionalvariationalautoencoderbaseduponanewstructurepotencyfingerprint AT bajorathjurgen predictingpotentcompoundsusingaconditionalvariationalautoencoderbaseduponanewstructurepotencyfingerprint |