Cargando…
Improving the generalizability of protein-ligand binding predictions with AI-Bind
Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We un...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082765/ https://www.ncbi.nlm.nih.gov/pubmed/37031187 http://dx.doi.org/10.1038/s41467-023-37572-z |
_version_ | 1785021380174020608 |
---|---|
author | Chatterjee, Ayan Walters, Robin Shafi, Zohair Ahmed, Omair Shafi Sebek, Michael Gysi, Deisy Yu, Rose Eliassi-Rad, Tina Barabási, Albert-László Menichetti, Giulia |
author_facet | Chatterjee, Ayan Walters, Robin Shafi, Zohair Ahmed, Omair Shafi Sebek, Michael Gysi, Deisy Yu, Rose Eliassi-Rad, Tina Barabási, Albert-László Menichetti, Giulia |
author_sort | Chatterjee, Ayan |
collection | PubMed |
description | Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery. |
format | Online Article Text |
id | pubmed-10082765 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-100827652023-04-10 Improving the generalizability of protein-ligand binding predictions with AI-Bind Chatterjee, Ayan Walters, Robin Shafi, Zohair Ahmed, Omair Shafi Sebek, Michael Gysi, Deisy Yu, Rose Eliassi-Rad, Tina Barabási, Albert-László Menichetti, Giulia Nat Commun Article Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery. Nature Publishing Group UK 2023-04-08 /pmc/articles/PMC10082765/ /pubmed/37031187 http://dx.doi.org/10.1038/s41467-023-37572-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Chatterjee, Ayan Walters, Robin Shafi, Zohair Ahmed, Omair Shafi Sebek, Michael Gysi, Deisy Yu, Rose Eliassi-Rad, Tina Barabási, Albert-László Menichetti, Giulia Improving the generalizability of protein-ligand binding predictions with AI-Bind |
title | Improving the generalizability of protein-ligand binding predictions with AI-Bind |
title_full | Improving the generalizability of protein-ligand binding predictions with AI-Bind |
title_fullStr | Improving the generalizability of protein-ligand binding predictions with AI-Bind |
title_full_unstemmed | Improving the generalizability of protein-ligand binding predictions with AI-Bind |
title_short | Improving the generalizability of protein-ligand binding predictions with AI-Bind |
title_sort | improving the generalizability of protein-ligand binding predictions with ai-bind |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082765/ https://www.ncbi.nlm.nih.gov/pubmed/37031187 http://dx.doi.org/10.1038/s41467-023-37572-z |
work_keys_str_mv | AT chatterjeeayan improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT waltersrobin improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT shafizohair improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT ahmedomairshafi improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT sebekmichael improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT gysideisy improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT yurose improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT eliassiradtina improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT barabasialbertlaszlo improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind AT menichettigiulia improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind |