Cargando…
Peptide-binding specificity prediction using fine-tuned protein structure prediction networks
Peptide-binding proteins play key roles in biology, and predicting their binding specificity is a long-standing challenge. While considerable protein structural information is available, the most successful current methods use sequence information alone, in part because it has been a challenge to mo...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992841/ https://www.ncbi.nlm.nih.gov/pubmed/36802421 http://dx.doi.org/10.1073/pnas.2216697120 |
_version_ | 1784902407072776192 |
---|---|
author | Motmaen, Amir Dauparas, Justas Baek, Minkyung Abedi, Mohamad H. Baker, David Bradley, Philip |
author_facet | Motmaen, Amir Dauparas, Justas Baek, Minkyung Abedi, Mohamad H. Baker, David Bradley, Philip |
author_sort | Motmaen, Amir |
collection | PubMed |
description | Peptide-binding proteins play key roles in biology, and predicting their binding specificity is a long-standing challenge. While considerable protein structural information is available, the most successful current methods use sequence information alone, in part because it has been a challenge to model the subtle structural changes accompanying sequence substitutions. Protein structure prediction networks such as AlphaFold model sequence-structure relationships very accurately, and we reasoned that if it were possible to specifically train such networks on binding data, more generalizable models could be created. We show that placing a classifier on top of the AlphaFold network and fine-tuning the combined network parameters for both classification and structure prediction accuracy leads to a model with strong generalizable performance on a wide range of Class I and Class II peptide-MHC interactions that approaches the overall performance of the state-of-the-art NetMHCpan sequence-based method. The peptide-MHC optimized model shows excellent performance in distinguishing binding and non-binding peptides to SH3 and PDZ domains. This ability to generalize well beyond the training set far exceeds that of sequence-only models and should be particularly powerful for systems where less experimental data are available. |
format | Online Article Text |
id | pubmed-9992841 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-99928412023-03-09 Peptide-binding specificity prediction using fine-tuned protein structure prediction networks Motmaen, Amir Dauparas, Justas Baek, Minkyung Abedi, Mohamad H. Baker, David Bradley, Philip Proc Natl Acad Sci U S A Biological Sciences Peptide-binding proteins play key roles in biology, and predicting their binding specificity is a long-standing challenge. While considerable protein structural information is available, the most successful current methods use sequence information alone, in part because it has been a challenge to model the subtle structural changes accompanying sequence substitutions. Protein structure prediction networks such as AlphaFold model sequence-structure relationships very accurately, and we reasoned that if it were possible to specifically train such networks on binding data, more generalizable models could be created. We show that placing a classifier on top of the AlphaFold network and fine-tuning the combined network parameters for both classification and structure prediction accuracy leads to a model with strong generalizable performance on a wide range of Class I and Class II peptide-MHC interactions that approaches the overall performance of the state-of-the-art NetMHCpan sequence-based method. The peptide-MHC optimized model shows excellent performance in distinguishing binding and non-binding peptides to SH3 and PDZ domains. This ability to generalize well beyond the training set far exceeds that of sequence-only models and should be particularly powerful for systems where less experimental data are available. National Academy of Sciences 2023-02-21 2023-02-28 /pmc/articles/PMC9992841/ /pubmed/36802421 http://dx.doi.org/10.1073/pnas.2216697120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Biological Sciences Motmaen, Amir Dauparas, Justas Baek, Minkyung Abedi, Mohamad H. Baker, David Bradley, Philip Peptide-binding specificity prediction using fine-tuned protein structure prediction networks |
title | Peptide-binding specificity prediction using fine-tuned protein structure prediction networks |
title_full | Peptide-binding specificity prediction using fine-tuned protein structure prediction networks |
title_fullStr | Peptide-binding specificity prediction using fine-tuned protein structure prediction networks |
title_full_unstemmed | Peptide-binding specificity prediction using fine-tuned protein structure prediction networks |
title_short | Peptide-binding specificity prediction using fine-tuned protein structure prediction networks |
title_sort | peptide-binding specificity prediction using fine-tuned protein structure prediction networks |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992841/ https://www.ncbi.nlm.nih.gov/pubmed/36802421 http://dx.doi.org/10.1073/pnas.2216697120 |
work_keys_str_mv | AT motmaenamir peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks AT dauparasjustas peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks AT baekminkyung peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks AT abedimohamadh peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks AT bakerdavid peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks AT bradleyphilip peptidebindingspecificitypredictionusingfinetunedproteinstructurepredictionnetworks |