Cargando…

Peptide-binding specificity prediction using fine-tuned protein structure prediction networks

Peptide-binding proteins play key roles in biology, and predicting their binding specificity is a long-standing challenge. While considerable protein structural information is available, the most successful current methods use sequence information alone, in part because it has been a challenge to mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Motmaen, Amir, Dauparas, Justas, Baek, Minkyung, Abedi, Mohamad H., Baker, David, Bradley, Philip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992841/
https://www.ncbi.nlm.nih.gov/pubmed/36802421
http://dx.doi.org/10.1073/pnas.2216697120
_version_ 1784902407072776192
author Motmaen, Amir
Dauparas, Justas
Baek, Minkyung
Abedi, Mohamad H.
Baker, David
Bradley, Philip
author_facet Motmaen, Amir
Dauparas, Justas
Baek, Minkyung
Abedi, Mohamad H.
Baker, David
Bradley, Philip
author_sort Motmaen, Amir
collection PubMed
description Peptide-binding proteins play key roles in biology, and predicting their binding specificity is a long-standing challenge. While considerable protein structural information is available, the most successful current methods use sequence information alone, in part because it has been a challenge to model the subtle structural changes accompanying sequence substitutions. Protein structure prediction networks such as AlphaFold model sequence-structure relationships very accurately, and we reasoned that if it were possible to specifically train such networks on binding data, more generalizable models could be created. We show that placing a classifier on top of the AlphaFold network and fine-tuning the combined network parameters for both classification and structure prediction accuracy leads to a model with strong generalizable performance on a wide range of Class I and Class II peptide-MHC interactions that approaches the overall performance of the state-of-the-art NetMHCpan sequence-based method. The peptide-MHC optimized model shows excellent performance in distinguishing binding and non-binding peptides to SH3 and PDZ domains. This ability to generalize well beyond the training set far exceeds that of sequence-only models and should be particularly powerful for systems where less experimental data are available.
format Online
Article
Text
id pubmed-9992841
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-99928412023-03-09 Peptide-binding specificity prediction using fine-tuned protein structure prediction networks Motmaen, Amir Dauparas, Justas Baek, Minkyung Abedi, Mohamad H. Baker, David Bradley, Philip Proc Natl Acad Sci U S A Biological Sciences Peptide-binding proteins play key roles in biology, and predicting their binding specificity is a long-standing challenge. While considerable protein structural information is available, the most successful current methods use sequence information alone, in part because it has been a challenge to model the subtle structural changes accompanying sequence substitutions. Protein structure prediction networks such as AlphaFold model sequence-structure relationships very accurately, and we reasoned that if it were possible to specifically train such networks on binding data, more generalizable models could be created. We show that placing a classifier on top of the AlphaFold network and fine-tuning the combined network parameters for both classification and structure prediction accuracy leads to a model with strong generalizable performance on a wide range of Class I and Class II peptide-MHC interactions that approaches the overall performance of the state-of-the-art NetMHCpan sequence-based method. The peptide-MHC optimized model shows excellent performance in distinguishing binding and non-binding peptides to SH3 and PDZ domains. This ability to generalize well beyond the training set far exceeds that of sequence-only models and should be particularly powerful for systems where less experimental data are available. National Academy of Sciences 2023-02-21 2023-02-28 /pmc/articles/PMC9992841/ /pubmed/36802421 http://dx.doi.org/10.1073/pnas.2216697120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Motmaen, Amir
Dauparas, Justas
Baek, Minkyung
Abedi, Mohamad H.
Baker, David
Bradley, Philip
Peptide-binding specificity prediction using fine-tuned protein structure prediction networks
title Peptide-binding specificity prediction using fine-tuned protein structure prediction networks
title_full Peptide-binding specificity prediction using fine-tuned protein structure prediction networks
title_fullStr Peptide-binding specificity prediction using fine-tuned protein structure prediction networks
title_full_unstemmed Peptide-binding specificity prediction using fine-tuned protein structure prediction networks
title_short Peptide-binding specificity prediction using fine-tuned protein structure prediction networks
title_sort peptide-binding specificity prediction using fine-tuned protein structure prediction networks
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992841/
https://www.ncbi.nlm.nih.gov/pubmed/36802421
http://dx.doi.org/10.1073/pnas.2216697120
work_keys_str_mv AT motmaenamir peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks
AT dauparasjustas peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks
AT baekminkyung peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks
AT abedimohamadh peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks
AT bakerdavid peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks
AT bradleyphilip peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks