Cargando…

Prediction of protein–protein interaction using graph neural networks

Proteins are the essential biological macromolecules required to perform nearly all biological processes, and cellular functions. Proteins rarely carry out their tasks in isolation but interact with other proteins (known as protein–protein interaction) present in their surroundings to complete biolo...

Descripción completa

Detalles Bibliográficos
Autores principales: Jha, Kanchan, Saha, Sriparna, Singh, Hiteshi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9120162/
https://www.ncbi.nlm.nih.gov/pubmed/35589837
http://dx.doi.org/10.1038/s41598-022-12201-9
_version_ 1784710874183761920
author Jha, Kanchan
Saha, Sriparna
Singh, Hiteshi
author_facet Jha, Kanchan
Saha, Sriparna
Singh, Hiteshi
author_sort Jha, Kanchan
collection PubMed
description Proteins are the essential biological macromolecules required to perform nearly all biological processes, and cellular functions. Proteins rarely carry out their tasks in isolation but interact with other proteins (known as protein–protein interaction) present in their surroundings to complete biological activities. The knowledge of protein–protein interactions (PPIs) unravels the cellular behavior and its functionality. The computational methods automate the prediction of PPI and are less expensive than experimental methods in terms of resources and time. So far, most of the works on PPI have mainly focused on sequence information. Here, we use graph convolutional network (GCN) and graph attention network (GAT) to predict the interaction between proteins by utilizing protein’s structural information and sequence features. We build the graphs of proteins from their PDB files, which contain 3D coordinates of atoms. The protein graph represents the amino acid network, also known as residue contact network, where each node is a residue. Two nodes are connected if they have a pair of atoms (one from each node) within the threshold distance. To extract the node/residue features, we use the protein language model. The input to the language model is the protein sequence, and the output is the feature vector for each amino acid of the underlying sequence. We validate the predictive capability of the proposed graph-based approach on two PPI datasets: Human and S. cerevisiae. Obtained results demonstrate the effectiveness of the proposed approach as it outperforms the previous leading methods. The source code for training and data to train the model are available at https://github.com/JhaKanchan15/PPI_GNN.git.
format Online
Article
Text
id pubmed-9120162
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91201622022-05-21 Prediction of protein–protein interaction using graph neural networks Jha, Kanchan Saha, Sriparna Singh, Hiteshi Sci Rep Article Proteins are the essential biological macromolecules required to perform nearly all biological processes, and cellular functions. Proteins rarely carry out their tasks in isolation but interact with other proteins (known as protein–protein interaction) present in their surroundings to complete biological activities. The knowledge of protein–protein interactions (PPIs) unravels the cellular behavior and its functionality. The computational methods automate the prediction of PPI and are less expensive than experimental methods in terms of resources and time. So far, most of the works on PPI have mainly focused on sequence information. Here, we use graph convolutional network (GCN) and graph attention network (GAT) to predict the interaction between proteins by utilizing protein’s structural information and sequence features. We build the graphs of proteins from their PDB files, which contain 3D coordinates of atoms. The protein graph represents the amino acid network, also known as residue contact network, where each node is a residue. Two nodes are connected if they have a pair of atoms (one from each node) within the threshold distance. To extract the node/residue features, we use the protein language model. The input to the language model is the protein sequence, and the output is the feature vector for each amino acid of the underlying sequence. We validate the predictive capability of the proposed graph-based approach on two PPI datasets: Human and S. cerevisiae. Obtained results demonstrate the effectiveness of the proposed approach as it outperforms the previous leading methods. The source code for training and data to train the model are available at https://github.com/JhaKanchan15/PPI_GNN.git. Nature Publishing Group UK 2022-05-19 /pmc/articles/PMC9120162/ /pubmed/35589837 http://dx.doi.org/10.1038/s41598-022-12201-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Jha, Kanchan
Saha, Sriparna
Singh, Hiteshi
Prediction of protein–protein interaction using graph neural networks
title Prediction of protein–protein interaction using graph neural networks
title_full Prediction of protein–protein interaction using graph neural networks
title_fullStr Prediction of protein–protein interaction using graph neural networks
title_full_unstemmed Prediction of protein–protein interaction using graph neural networks
title_short Prediction of protein–protein interaction using graph neural networks
title_sort prediction of protein–protein interaction using graph neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9120162/
https://www.ncbi.nlm.nih.gov/pubmed/35589837
http://dx.doi.org/10.1038/s41598-022-12201-9
work_keys_str_mv AT jhakanchan predictionofproteinproteininteractionusinggraphneuralnetworks
AT sahasriparna predictionofproteinproteininteractionusinggraphneuralnetworks
AT singhhiteshi predictionofproteinproteininteractionusinggraphneuralnetworks