Cargando…

Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions

BACKGROUND: Development of new methods for analysis of protein–protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological o...

Descripción completa

Detalles Bibliográficos
Autores principales: Baranwal, Mayank, Magner, Abram, Saldinger, Jacob, Turali-Emre, Emine S., Elvati, Paolo, Kozarekar, Shivani, VanEpps, J. Scott, Kotov, Nicholas A., Violi, Angela, Hero, Alfred O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9464414/
https://www.ncbi.nlm.nih.gov/pubmed/36088285
http://dx.doi.org/10.1186/s12859-022-04910-9
_version_ 1784787576368922624
author Baranwal, Mayank
Magner, Abram
Saldinger, Jacob
Turali-Emre, Emine S.
Elvati, Paolo
Kozarekar, Shivani
VanEpps, J. Scott
Kotov, Nicholas A.
Violi, Angela
Hero, Alfred O.
author_facet Baranwal, Mayank
Magner, Abram
Saldinger, Jacob
Turali-Emre, Emine S.
Elvati, Paolo
Kozarekar, Shivani
VanEpps, J. Scott
Kotov, Nicholas A.
Violi, Angela
Hero, Alfred O.
author_sort Baranwal, Mayank
collection PubMed
description BACKGROUND: Development of new methods for analysis of protein–protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological origins. Recent advances in computational tools, particularly the ones involving modern deep learning algorithms, have been shown to complement experimental approaches for describing and rationalizing PPIs. However, most of the existing works on PPI predictions use protein-sequence information, and thus have difficulties in accounting for the three-dimensional organization of the protein chains. RESULTS: In this study, we address this problem and describe a PPI analysis based on a graph attention network, named Struct2Graph, for identifying PPIs directly from the structural data of folded protein globules. Our method is capable of predicting the PPI with an accuracy of 98.89% on the balanced set consisting of an equal number of positive and negative pairs. On the unbalanced set with the ratio of 1:10 between positive and negative pairs, Struct2Graph achieves a fivefold cross validation average accuracy of 99.42%. Moreover, Struct2Graph can potentially identify residues that likely contribute to the formation of the protein–protein complex. The identification of important residues is tested for two different interaction types: (a) Proteins with multiple ligands competing for the same binding area, (b) Dynamic protein–protein adhesion interaction. Struct2Graph identifies interacting residues with 30% sensitivity, 89% specificity, and 87% accuracy. CONCLUSIONS: In this manuscript, we address the problem of prediction of PPIs using a first of its kind, 3D-structure-based graph attention network (code available at https://github.com/baranwa2/Struct2Graph). Furthermore, the novel mutual attention mechanism provides insights into likely interaction sites through its unsupervised knowledge selection process. This study demonstrates that a relatively low-dimensional feature embedding learned from graph structures of individual proteins outperforms other modern machine learning classifiers based on global protein features. In addition, through the analysis of single amino acid variations, the attention mechanism shows preference for disease-causing residue variations over benign polymorphisms, demonstrating that it is not limited to interface residues. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04910-9.
format Online
Article
Text
id pubmed-9464414
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94644142022-09-12 Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions Baranwal, Mayank Magner, Abram Saldinger, Jacob Turali-Emre, Emine S. Elvati, Paolo Kozarekar, Shivani VanEpps, J. Scott Kotov, Nicholas A. Violi, Angela Hero, Alfred O. BMC Bioinformatics Research BACKGROUND: Development of new methods for analysis of protein–protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological origins. Recent advances in computational tools, particularly the ones involving modern deep learning algorithms, have been shown to complement experimental approaches for describing and rationalizing PPIs. However, most of the existing works on PPI predictions use protein-sequence information, and thus have difficulties in accounting for the three-dimensional organization of the protein chains. RESULTS: In this study, we address this problem and describe a PPI analysis based on a graph attention network, named Struct2Graph, for identifying PPIs directly from the structural data of folded protein globules. Our method is capable of predicting the PPI with an accuracy of 98.89% on the balanced set consisting of an equal number of positive and negative pairs. On the unbalanced set with the ratio of 1:10 between positive and negative pairs, Struct2Graph achieves a fivefold cross validation average accuracy of 99.42%. Moreover, Struct2Graph can potentially identify residues that likely contribute to the formation of the protein–protein complex. The identification of important residues is tested for two different interaction types: (a) Proteins with multiple ligands competing for the same binding area, (b) Dynamic protein–protein adhesion interaction. Struct2Graph identifies interacting residues with 30% sensitivity, 89% specificity, and 87% accuracy. CONCLUSIONS: In this manuscript, we address the problem of prediction of PPIs using a first of its kind, 3D-structure-based graph attention network (code available at https://github.com/baranwa2/Struct2Graph). Furthermore, the novel mutual attention mechanism provides insights into likely interaction sites through its unsupervised knowledge selection process. This study demonstrates that a relatively low-dimensional feature embedding learned from graph structures of individual proteins outperforms other modern machine learning classifiers based on global protein features. In addition, through the analysis of single amino acid variations, the attention mechanism shows preference for disease-causing residue variations over benign polymorphisms, demonstrating that it is not limited to interface residues. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04910-9. BioMed Central 2022-09-10 /pmc/articles/PMC9464414/ /pubmed/36088285 http://dx.doi.org/10.1186/s12859-022-04910-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Baranwal, Mayank
Magner, Abram
Saldinger, Jacob
Turali-Emre, Emine S.
Elvati, Paolo
Kozarekar, Shivani
VanEpps, J. Scott
Kotov, Nicholas A.
Violi, Angela
Hero, Alfred O.
Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions
title Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions
title_full Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions
title_fullStr Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions
title_full_unstemmed Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions
title_short Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions
title_sort struct2graph: a graph attention network for structure based predictions of protein–protein interactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9464414/
https://www.ncbi.nlm.nih.gov/pubmed/36088285
http://dx.doi.org/10.1186/s12859-022-04910-9
work_keys_str_mv AT baranwalmayank struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT magnerabram struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT saldingerjacob struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT turaliemreemines struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT elvatipaolo struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT kozarekarshivani struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT vaneppsjscott struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT kotovnicholasa struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT violiangela struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions
AT heroalfredo struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions