Cargando…

Contrastive learning in protein language space predicts interactions between drugs and protein targets

Sequence-based prediction of drug–target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computationa...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Rohit, Sledzieski, Samuel, Bryson, Bryan, Cowen, Lenore, Berger, Bonnie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10268324/
https://www.ncbi.nlm.nih.gov/pubmed/37289807
http://dx.doi.org/10.1073/pnas.2220778120
_version_ 1785059069632970752
author Singh, Rohit
Sledzieski, Samuel
Bryson, Bryan
Cowen, Lenore
Berger, Bonnie
author_facet Singh, Rohit
Sledzieski, Samuel
Bryson, Bryan
Cowen, Lenore
Berger, Bonnie
author_sort Singh, Rohit
collection PubMed
description Sequence-based prediction of drug–target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models (“PLex”) and employing a protein-anchored contrastive coembedding (“Con”) to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and specificity against decoy compounds. It makes predictions of binding based on the distance between learned representations, enabling predictions at the scale of massive compound libraries and the human proteome. Experimental testing of 19 kinase-drug interaction predictions validated 12 interactions, including four with subnanomolar affinity, plus a strongly binding EPHB1 inhibitor (K(D) = 1.3 nM). Furthermore, ConPLex embeddings are interpretable, which enables us to visualize the drug–target embedding space and use embeddings to characterize the function of human cell-surface proteins. We anticipate that ConPLex will facilitate efficient drug discovery by making highly sensitive in silico drug screening feasible at the genome scale. ConPLex is available open source at https://ConPLex.csail.mit.edu.
format Online
Article
Text
id pubmed-10268324
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-102683242023-06-15 Contrastive learning in protein language space predicts interactions between drugs and protein targets Singh, Rohit Sledzieski, Samuel Bryson, Bryan Cowen, Lenore Berger, Bonnie Proc Natl Acad Sci U S A Biological Sciences Sequence-based prediction of drug–target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models (“PLex”) and employing a protein-anchored contrastive coembedding (“Con”) to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and specificity against decoy compounds. It makes predictions of binding based on the distance between learned representations, enabling predictions at the scale of massive compound libraries and the human proteome. Experimental testing of 19 kinase-drug interaction predictions validated 12 interactions, including four with subnanomolar affinity, plus a strongly binding EPHB1 inhibitor (K(D) = 1.3 nM). Furthermore, ConPLex embeddings are interpretable, which enables us to visualize the drug–target embedding space and use embeddings to characterize the function of human cell-surface proteins. We anticipate that ConPLex will facilitate efficient drug discovery by making highly sensitive in silico drug screening feasible at the genome scale. ConPLex is available open source at https://ConPLex.csail.mit.edu. National Academy of Sciences 2023-06-08 2023-06-13 /pmc/articles/PMC10268324/ /pubmed/37289807 http://dx.doi.org/10.1073/pnas.2220778120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Singh, Rohit
Sledzieski, Samuel
Bryson, Bryan
Cowen, Lenore
Berger, Bonnie
Contrastive learning in protein language space predicts interactions between drugs and protein targets
title Contrastive learning in protein language space predicts interactions between drugs and protein targets
title_full Contrastive learning in protein language space predicts interactions between drugs and protein targets
title_fullStr Contrastive learning in protein language space predicts interactions between drugs and protein targets
title_full_unstemmed Contrastive learning in protein language space predicts interactions between drugs and protein targets
title_short Contrastive learning in protein language space predicts interactions between drugs and protein targets
title_sort contrastive learning in protein language space predicts interactions between drugs and protein targets
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10268324/
https://www.ncbi.nlm.nih.gov/pubmed/37289807
http://dx.doi.org/10.1073/pnas.2220778120
work_keys_str_mv AT singhrohit contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets
AT sledzieskisamuel contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets
AT brysonbryan contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets
AT cowenlenore contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets
AT bergerbonnie contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets