Cargando…

Improving the generalizability of protein-ligand binding predictions with AI-Bind

Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We un...

Descripción completa

Detalles Bibliográficos
Autores principales: Chatterjee, Ayan, Walters, Robin, Shafi, Zohair, Ahmed, Omair Shafi, Sebek, Michael, Gysi, Deisy, Yu, Rose, Eliassi-Rad, Tina, Barabási, Albert-László, Menichetti, Giulia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082765/
https://www.ncbi.nlm.nih.gov/pubmed/37031187
http://dx.doi.org/10.1038/s41467-023-37572-z
_version_ 1785021380174020608
author Chatterjee, Ayan
Walters, Robin
Shafi, Zohair
Ahmed, Omair Shafi
Sebek, Michael
Gysi, Deisy
Yu, Rose
Eliassi-Rad, Tina
Barabási, Albert-László
Menichetti, Giulia
author_facet Chatterjee, Ayan
Walters, Robin
Shafi, Zohair
Ahmed, Omair Shafi
Sebek, Michael
Gysi, Deisy
Yu, Rose
Eliassi-Rad, Tina
Barabási, Albert-László
Menichetti, Giulia
author_sort Chatterjee, Ayan
collection PubMed
description Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery.
format Online
Article
Text
id pubmed-10082765
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100827652023-04-10 Improving the generalizability of protein-ligand binding predictions with AI-Bind Chatterjee, Ayan Walters, Robin Shafi, Zohair Ahmed, Omair Shafi Sebek, Michael Gysi, Deisy Yu, Rose Eliassi-Rad, Tina Barabási, Albert-László Menichetti, Giulia Nat Commun Article Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery. Nature Publishing Group UK 2023-04-08 /pmc/articles/PMC10082765/ /pubmed/37031187 http://dx.doi.org/10.1038/s41467-023-37572-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chatterjee, Ayan
Walters, Robin
Shafi, Zohair
Ahmed, Omair Shafi
Sebek, Michael
Gysi, Deisy
Yu, Rose
Eliassi-Rad, Tina
Barabási, Albert-László
Menichetti, Giulia
Improving the generalizability of protein-ligand binding predictions with AI-Bind
title Improving the generalizability of protein-ligand binding predictions with AI-Bind
title_full Improving the generalizability of protein-ligand binding predictions with AI-Bind
title_fullStr Improving the generalizability of protein-ligand binding predictions with AI-Bind
title_full_unstemmed Improving the generalizability of protein-ligand binding predictions with AI-Bind
title_short Improving the generalizability of protein-ligand binding predictions with AI-Bind
title_sort improving the generalizability of protein-ligand binding predictions with ai-bind
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082765/
https://www.ncbi.nlm.nih.gov/pubmed/37031187
http://dx.doi.org/10.1038/s41467-023-37572-z
work_keys_str_mv AT chatterjeeayan improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT waltersrobin improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT shafizohair improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT ahmedomairshafi improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT sebekmichael improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT gysideisy improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT yurose improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT eliassiradtina improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT barabasialbertlaszlo improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind
AT menichettigiulia improvingthegeneralizabilityofproteinligandbindingpredictionswithaibind