Cargando…

Bioactivity descriptors for uncharacterized chemical compounds

Chemical descriptors encode the physicochemical and structural properties of small molecules, and they are at the core of chemoinformatics. The broad release of bioactivity data has prompted enriched representations of compounds, reaching beyond chemical structures and capturing their known biologic...

Descripción completa

Detalles Bibliográficos
Autores principales: Bertoni, Martino, Duran-Frigola, Miquel, Badia-i-Mompel, Pau, Pauls, Eduardo, Orozco-Ruiz, Modesto, Guitart-Pla, Oriol, Alcalde, Víctor, Diaz, Víctor M., Berenguer-Llergo, Antoni, Brun-Heath, Isabelle, Villegas, Núria, de Herreros, Antonio García, Aloy, Patrick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8225676/
https://www.ncbi.nlm.nih.gov/pubmed/34168145
http://dx.doi.org/10.1038/s41467-021-24150-4
_version_ 1783712131993042944
author Bertoni, Martino
Duran-Frigola, Miquel
Badia-i-Mompel, Pau
Pauls, Eduardo
Orozco-Ruiz, Modesto
Guitart-Pla, Oriol
Alcalde, Víctor
Diaz, Víctor M.
Berenguer-Llergo, Antoni
Brun-Heath, Isabelle
Villegas, Núria
de Herreros, Antonio García
Aloy, Patrick
author_facet Bertoni, Martino
Duran-Frigola, Miquel
Badia-i-Mompel, Pau
Pauls, Eduardo
Orozco-Ruiz, Modesto
Guitart-Pla, Oriol
Alcalde, Víctor
Diaz, Víctor M.
Berenguer-Llergo, Antoni
Brun-Heath, Isabelle
Villegas, Núria
de Herreros, Antonio García
Aloy, Patrick
author_sort Bertoni, Martino
collection PubMed
description Chemical descriptors encode the physicochemical and structural properties of small molecules, and they are at the core of chemoinformatics. The broad release of bioactivity data has prompted enriched representations of compounds, reaching beyond chemical structures and capturing their known biological properties. Unfortunately, bioactivity descriptors are not available for most small molecules, which limits their applicability to a few thousand well characterized compounds. Here we present a collection of deep neural networks able to infer bioactivity signatures for any compound of interest, even when little or no experimental information is available for them. Our signaturizers relate to bioactivities of 25 different types (including target profiles, cellular response and clinical outcomes) and can be used as drop-in replacements for chemical descriptors in day-to-day chemoinformatics tasks. Indeed, we illustrate how inferred bioactivity signatures are useful to navigate the chemical space in a biologically relevant manner, unveiling higher-order organization in natural product collections, and to enrich mostly uncharacterized chemical libraries for activity against the drug-orphan target Snail1. Moreover, we implement a battery of signature-activity relationship (SigAR) models and show a substantial improvement in performance, with respect to chemistry-based classifiers, across a series of biophysics and physiology activity prediction benchmarks.
format Online
Article
Text
id pubmed-8225676
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82256762021-07-09 Bioactivity descriptors for uncharacterized chemical compounds Bertoni, Martino Duran-Frigola, Miquel Badia-i-Mompel, Pau Pauls, Eduardo Orozco-Ruiz, Modesto Guitart-Pla, Oriol Alcalde, Víctor Diaz, Víctor M. Berenguer-Llergo, Antoni Brun-Heath, Isabelle Villegas, Núria de Herreros, Antonio García Aloy, Patrick Nat Commun Article Chemical descriptors encode the physicochemical and structural properties of small molecules, and they are at the core of chemoinformatics. The broad release of bioactivity data has prompted enriched representations of compounds, reaching beyond chemical structures and capturing their known biological properties. Unfortunately, bioactivity descriptors are not available for most small molecules, which limits their applicability to a few thousand well characterized compounds. Here we present a collection of deep neural networks able to infer bioactivity signatures for any compound of interest, even when little or no experimental information is available for them. Our signaturizers relate to bioactivities of 25 different types (including target profiles, cellular response and clinical outcomes) and can be used as drop-in replacements for chemical descriptors in day-to-day chemoinformatics tasks. Indeed, we illustrate how inferred bioactivity signatures are useful to navigate the chemical space in a biologically relevant manner, unveiling higher-order organization in natural product collections, and to enrich mostly uncharacterized chemical libraries for activity against the drug-orphan target Snail1. Moreover, we implement a battery of signature-activity relationship (SigAR) models and show a substantial improvement in performance, with respect to chemistry-based classifiers, across a series of biophysics and physiology activity prediction benchmarks. Nature Publishing Group UK 2021-06-24 /pmc/articles/PMC8225676/ /pubmed/34168145 http://dx.doi.org/10.1038/s41467-021-24150-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Bertoni, Martino
Duran-Frigola, Miquel
Badia-i-Mompel, Pau
Pauls, Eduardo
Orozco-Ruiz, Modesto
Guitart-Pla, Oriol
Alcalde, Víctor
Diaz, Víctor M.
Berenguer-Llergo, Antoni
Brun-Heath, Isabelle
Villegas, Núria
de Herreros, Antonio García
Aloy, Patrick
Bioactivity descriptors for uncharacterized chemical compounds
title Bioactivity descriptors for uncharacterized chemical compounds
title_full Bioactivity descriptors for uncharacterized chemical compounds
title_fullStr Bioactivity descriptors for uncharacterized chemical compounds
title_full_unstemmed Bioactivity descriptors for uncharacterized chemical compounds
title_short Bioactivity descriptors for uncharacterized chemical compounds
title_sort bioactivity descriptors for uncharacterized chemical compounds
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8225676/
https://www.ncbi.nlm.nih.gov/pubmed/34168145
http://dx.doi.org/10.1038/s41467-021-24150-4
work_keys_str_mv AT bertonimartino bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT duranfrigolamiquel bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT badiaimompelpau bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT paulseduardo bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT orozcoruizmodesto bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT guitartplaoriol bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT alcaldevictor bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT diazvictorm bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT berenguerllergoantoni bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT brunheathisabelle bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT villegasnuria bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT deherrerosantoniogarcia bioactivitydescriptorsforuncharacterizedchemicalcompounds
AT aloypatrick bioactivitydescriptorsforuncharacterizedchemicalcompounds