Cargando…
Contrastive learning in protein language space predicts interactions between drugs and protein targets
Sequence-based prediction of drug–target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computationa...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10268324/ https://www.ncbi.nlm.nih.gov/pubmed/37289807 http://dx.doi.org/10.1073/pnas.2220778120 |
_version_ | 1785059069632970752 |
---|---|
author | Singh, Rohit Sledzieski, Samuel Bryson, Bryan Cowen, Lenore Berger, Bonnie |
author_facet | Singh, Rohit Sledzieski, Samuel Bryson, Bryan Cowen, Lenore Berger, Bonnie |
author_sort | Singh, Rohit |
collection | PubMed |
description | Sequence-based prediction of drug–target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models (“PLex”) and employing a protein-anchored contrastive coembedding (“Con”) to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and specificity against decoy compounds. It makes predictions of binding based on the distance between learned representations, enabling predictions at the scale of massive compound libraries and the human proteome. Experimental testing of 19 kinase-drug interaction predictions validated 12 interactions, including four with subnanomolar affinity, plus a strongly binding EPHB1 inhibitor (K(D) = 1.3 nM). Furthermore, ConPLex embeddings are interpretable, which enables us to visualize the drug–target embedding space and use embeddings to characterize the function of human cell-surface proteins. We anticipate that ConPLex will facilitate efficient drug discovery by making highly sensitive in silico drug screening feasible at the genome scale. ConPLex is available open source at https://ConPLex.csail.mit.edu. |
format | Online Article Text |
id | pubmed-10268324 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-102683242023-06-15 Contrastive learning in protein language space predicts interactions between drugs and protein targets Singh, Rohit Sledzieski, Samuel Bryson, Bryan Cowen, Lenore Berger, Bonnie Proc Natl Acad Sci U S A Biological Sciences Sequence-based prediction of drug–target interactions has the potential to accelerate drug discovery by complementing experimental screens. Such computational prediction needs to be generalizable and scalable while remaining sensitive to subtle variations in the inputs. However, current computational techniques fail to simultaneously meet these goals, often sacrificing performance of one to achieve the others. We develop a deep learning model, ConPLex, successfully leveraging the advances in pretrained protein language models (“PLex”) and employing a protein-anchored contrastive coembedding (“Con”) to outperform state-of-the-art approaches. ConPLex achieves high accuracy, broad adaptivity to unseen data, and specificity against decoy compounds. It makes predictions of binding based on the distance between learned representations, enabling predictions at the scale of massive compound libraries and the human proteome. Experimental testing of 19 kinase-drug interaction predictions validated 12 interactions, including four with subnanomolar affinity, plus a strongly binding EPHB1 inhibitor (K(D) = 1.3 nM). Furthermore, ConPLex embeddings are interpretable, which enables us to visualize the drug–target embedding space and use embeddings to characterize the function of human cell-surface proteins. We anticipate that ConPLex will facilitate efficient drug discovery by making highly sensitive in silico drug screening feasible at the genome scale. ConPLex is available open source at https://ConPLex.csail.mit.edu. National Academy of Sciences 2023-06-08 2023-06-13 /pmc/articles/PMC10268324/ /pubmed/37289807 http://dx.doi.org/10.1073/pnas.2220778120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Biological Sciences Singh, Rohit Sledzieski, Samuel Bryson, Bryan Cowen, Lenore Berger, Bonnie Contrastive learning in protein language space predicts interactions between drugs and protein targets |
title | Contrastive learning in protein language space predicts interactions between drugs and protein targets |
title_full | Contrastive learning in protein language space predicts interactions between drugs and protein targets |
title_fullStr | Contrastive learning in protein language space predicts interactions between drugs and protein targets |
title_full_unstemmed | Contrastive learning in protein language space predicts interactions between drugs and protein targets |
title_short | Contrastive learning in protein language space predicts interactions between drugs and protein targets |
title_sort | contrastive learning in protein language space predicts interactions between drugs and protein targets |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10268324/ https://www.ncbi.nlm.nih.gov/pubmed/37289807 http://dx.doi.org/10.1073/pnas.2220778120 |
work_keys_str_mv | AT singhrohit contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets AT sledzieskisamuel contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets AT brysonbryan contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets AT cowenlenore contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets AT bergerbonnie contrastivelearninginproteinlanguagespacepredictsinteractionsbetweendrugsandproteintargets |