Cargando…

Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level

Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential...

Descripción completa

Detalles Bibliográficos
Autores principales: Canner, Samuel W., Shanker, Sudhanshu, Gray, Jeffrey J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054975/
https://www.ncbi.nlm.nih.gov/pubmed/36993750
http://dx.doi.org/10.1101/2023.03.14.531382
_version_ 1785015798311419904
author Canner, Samuel W.
Shanker, Sudhanshu
Gray, Jeffrey J.
author_facet Canner, Samuel W.
Shanker, Sudhanshu
Gray, Jeffrey J.
author_sort Canner, Samuel W.
collection PubMed
description Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate binding sites on any given protein. Here, we present two deep learning models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predict carbohydrate binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2 predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures.
format Online
Article
Text
id pubmed-10054975
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100549752023-03-30 Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level Canner, Samuel W. Shanker, Sudhanshu Gray, Jeffrey J. bioRxiv Article Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate binding sites on any given protein. Here, we present two deep learning models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predict carbohydrate binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2 predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures. Cold Spring Harbor Laboratory 2023-03-15 /pmc/articles/PMC10054975/ /pubmed/36993750 http://dx.doi.org/10.1101/2023.03.14.531382 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Canner, Samuel W.
Shanker, Sudhanshu
Gray, Jeffrey J.
Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level
title Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level
title_full Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level
title_fullStr Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level
title_full_unstemmed Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level
title_short Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level
title_sort structure-based neural network protein-carbohydrate interaction predictions at the residue level
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054975/
https://www.ncbi.nlm.nih.gov/pubmed/36993750
http://dx.doi.org/10.1101/2023.03.14.531382
work_keys_str_mv AT cannersamuelw structurebasedneuralnetworkproteincarbohydrateinteractionpredictionsattheresiduelevel
AT shankersudhanshu structurebasedneuralnetworkproteincarbohydrateinteractionpredictionsattheresiduelevel
AT grayjeffreyj structurebasedneuralnetworkproteincarbohydrateinteractionpredictionsattheresiduelevel