Cargando…
Structure-based neural network protein–carbohydrate interaction predictions at the residue level
Carbohydrates dynamically and transiently interact with proteins for cell–cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10318439/ https://www.ncbi.nlm.nih.gov/pubmed/37409346 http://dx.doi.org/10.3389/fbinf.2023.1186531 |
_version_ | 1785068038090915840 |
---|---|
author | Canner, Samuel W. Shanker, Sudhanshu Gray, Jeffrey J. |
author_facet | Canner, Samuel W. Shanker, Sudhanshu Gray, Jeffrey J. |
author_sort | Canner, Samuel W. |
collection | PubMed |
description | Carbohydrates dynamically and transiently interact with proteins for cell–cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate-binding sites on any given protein. Here, we present two deep learning (DL) models named CArbohydrate–Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate-binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2-predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein–carbohydrate structures. |
format | Online Article Text |
id | pubmed-10318439 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-103184392023-07-05 Structure-based neural network protein–carbohydrate interaction predictions at the residue level Canner, Samuel W. Shanker, Sudhanshu Gray, Jeffrey J. Front Bioinform Bioinformatics Carbohydrates dynamically and transiently interact with proteins for cell–cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate-binding sites on any given protein. Here, we present two deep learning (DL) models named CArbohydrate–Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate-binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2-predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein–carbohydrate structures. Frontiers Media S.A. 2023-06-20 /pmc/articles/PMC10318439/ /pubmed/37409346 http://dx.doi.org/10.3389/fbinf.2023.1186531 Text en Copyright © 2023 Canner, Shanker and Gray. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Canner, Samuel W. Shanker, Sudhanshu Gray, Jeffrey J. Structure-based neural network protein–carbohydrate interaction predictions at the residue level |
title | Structure-based neural network protein–carbohydrate interaction predictions at the residue level |
title_full | Structure-based neural network protein–carbohydrate interaction predictions at the residue level |
title_fullStr | Structure-based neural network protein–carbohydrate interaction predictions at the residue level |
title_full_unstemmed | Structure-based neural network protein–carbohydrate interaction predictions at the residue level |
title_short | Structure-based neural network protein–carbohydrate interaction predictions at the residue level |
title_sort | structure-based neural network protein–carbohydrate interaction predictions at the residue level |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10318439/ https://www.ncbi.nlm.nih.gov/pubmed/37409346 http://dx.doi.org/10.3389/fbinf.2023.1186531 |
work_keys_str_mv | AT cannersamuelw structurebasedneuralnetworkproteincarbohydrateinteractionpredictionsattheresiduelevel AT shankersudhanshu structurebasedneuralnetworkproteincarbohydrateinteractionpredictionsattheresiduelevel AT grayjeffreyj structurebasedneuralnetworkproteincarbohydrateinteractionpredictionsattheresiduelevel |