Cargando…
Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions
BACKGROUND: Development of new methods for analysis of protein–protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological o...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9464414/ https://www.ncbi.nlm.nih.gov/pubmed/36088285 http://dx.doi.org/10.1186/s12859-022-04910-9 |
_version_ | 1784787576368922624 |
---|---|
author | Baranwal, Mayank Magner, Abram Saldinger, Jacob Turali-Emre, Emine S. Elvati, Paolo Kozarekar, Shivani VanEpps, J. Scott Kotov, Nicholas A. Violi, Angela Hero, Alfred O. |
author_facet | Baranwal, Mayank Magner, Abram Saldinger, Jacob Turali-Emre, Emine S. Elvati, Paolo Kozarekar, Shivani VanEpps, J. Scott Kotov, Nicholas A. Violi, Angela Hero, Alfred O. |
author_sort | Baranwal, Mayank |
collection | PubMed |
description | BACKGROUND: Development of new methods for analysis of protein–protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological origins. Recent advances in computational tools, particularly the ones involving modern deep learning algorithms, have been shown to complement experimental approaches for describing and rationalizing PPIs. However, most of the existing works on PPI predictions use protein-sequence information, and thus have difficulties in accounting for the three-dimensional organization of the protein chains. RESULTS: In this study, we address this problem and describe a PPI analysis based on a graph attention network, named Struct2Graph, for identifying PPIs directly from the structural data of folded protein globules. Our method is capable of predicting the PPI with an accuracy of 98.89% on the balanced set consisting of an equal number of positive and negative pairs. On the unbalanced set with the ratio of 1:10 between positive and negative pairs, Struct2Graph achieves a fivefold cross validation average accuracy of 99.42%. Moreover, Struct2Graph can potentially identify residues that likely contribute to the formation of the protein–protein complex. The identification of important residues is tested for two different interaction types: (a) Proteins with multiple ligands competing for the same binding area, (b) Dynamic protein–protein adhesion interaction. Struct2Graph identifies interacting residues with 30% sensitivity, 89% specificity, and 87% accuracy. CONCLUSIONS: In this manuscript, we address the problem of prediction of PPIs using a first of its kind, 3D-structure-based graph attention network (code available at https://github.com/baranwa2/Struct2Graph). Furthermore, the novel mutual attention mechanism provides insights into likely interaction sites through its unsupervised knowledge selection process. This study demonstrates that a relatively low-dimensional feature embedding learned from graph structures of individual proteins outperforms other modern machine learning classifiers based on global protein features. In addition, through the analysis of single amino acid variations, the attention mechanism shows preference for disease-causing residue variations over benign polymorphisms, demonstrating that it is not limited to interface residues. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04910-9. |
format | Online Article Text |
id | pubmed-9464414 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-94644142022-09-12 Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions Baranwal, Mayank Magner, Abram Saldinger, Jacob Turali-Emre, Emine S. Elvati, Paolo Kozarekar, Shivani VanEpps, J. Scott Kotov, Nicholas A. Violi, Angela Hero, Alfred O. BMC Bioinformatics Research BACKGROUND: Development of new methods for analysis of protein–protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological origins. Recent advances in computational tools, particularly the ones involving modern deep learning algorithms, have been shown to complement experimental approaches for describing and rationalizing PPIs. However, most of the existing works on PPI predictions use protein-sequence information, and thus have difficulties in accounting for the three-dimensional organization of the protein chains. RESULTS: In this study, we address this problem and describe a PPI analysis based on a graph attention network, named Struct2Graph, for identifying PPIs directly from the structural data of folded protein globules. Our method is capable of predicting the PPI with an accuracy of 98.89% on the balanced set consisting of an equal number of positive and negative pairs. On the unbalanced set with the ratio of 1:10 between positive and negative pairs, Struct2Graph achieves a fivefold cross validation average accuracy of 99.42%. Moreover, Struct2Graph can potentially identify residues that likely contribute to the formation of the protein–protein complex. The identification of important residues is tested for two different interaction types: (a) Proteins with multiple ligands competing for the same binding area, (b) Dynamic protein–protein adhesion interaction. Struct2Graph identifies interacting residues with 30% sensitivity, 89% specificity, and 87% accuracy. CONCLUSIONS: In this manuscript, we address the problem of prediction of PPIs using a first of its kind, 3D-structure-based graph attention network (code available at https://github.com/baranwa2/Struct2Graph). Furthermore, the novel mutual attention mechanism provides insights into likely interaction sites through its unsupervised knowledge selection process. This study demonstrates that a relatively low-dimensional feature embedding learned from graph structures of individual proteins outperforms other modern machine learning classifiers based on global protein features. In addition, through the analysis of single amino acid variations, the attention mechanism shows preference for disease-causing residue variations over benign polymorphisms, demonstrating that it is not limited to interface residues. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04910-9. BioMed Central 2022-09-10 /pmc/articles/PMC9464414/ /pubmed/36088285 http://dx.doi.org/10.1186/s12859-022-04910-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Baranwal, Mayank Magner, Abram Saldinger, Jacob Turali-Emre, Emine S. Elvati, Paolo Kozarekar, Shivani VanEpps, J. Scott Kotov, Nicholas A. Violi, Angela Hero, Alfred O. Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions |
title | Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions |
title_full | Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions |
title_fullStr | Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions |
title_full_unstemmed | Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions |
title_short | Struct2Graph: a graph attention network for structure based predictions of protein–protein interactions |
title_sort | struct2graph: a graph attention network for structure based predictions of protein–protein interactions |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9464414/ https://www.ncbi.nlm.nih.gov/pubmed/36088285 http://dx.doi.org/10.1186/s12859-022-04910-9 |
work_keys_str_mv | AT baranwalmayank struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT magnerabram struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT saldingerjacob struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT turaliemreemines struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT elvatipaolo struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT kozarekarshivani struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT vaneppsjscott struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT kotovnicholasa struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT violiangela struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions AT heroalfredo struct2graphagraphattentionnetworkforstructurebasedpredictionsofproteinproteininteractions |