Cargando…
Prediction of protein–protein interaction using graph neural networks
Proteins are the essential biological macromolecules required to perform nearly all biological processes, and cellular functions. Proteins rarely carry out their tasks in isolation but interact with other proteins (known as protein–protein interaction) present in their surroundings to complete biolo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9120162/ https://www.ncbi.nlm.nih.gov/pubmed/35589837 http://dx.doi.org/10.1038/s41598-022-12201-9 |
_version_ | 1784710874183761920 |
---|---|
author | Jha, Kanchan Saha, Sriparna Singh, Hiteshi |
author_facet | Jha, Kanchan Saha, Sriparna Singh, Hiteshi |
author_sort | Jha, Kanchan |
collection | PubMed |
description | Proteins are the essential biological macromolecules required to perform nearly all biological processes, and cellular functions. Proteins rarely carry out their tasks in isolation but interact with other proteins (known as protein–protein interaction) present in their surroundings to complete biological activities. The knowledge of protein–protein interactions (PPIs) unravels the cellular behavior and its functionality. The computational methods automate the prediction of PPI and are less expensive than experimental methods in terms of resources and time. So far, most of the works on PPI have mainly focused on sequence information. Here, we use graph convolutional network (GCN) and graph attention network (GAT) to predict the interaction between proteins by utilizing protein’s structural information and sequence features. We build the graphs of proteins from their PDB files, which contain 3D coordinates of atoms. The protein graph represents the amino acid network, also known as residue contact network, where each node is a residue. Two nodes are connected if they have a pair of atoms (one from each node) within the threshold distance. To extract the node/residue features, we use the protein language model. The input to the language model is the protein sequence, and the output is the feature vector for each amino acid of the underlying sequence. We validate the predictive capability of the proposed graph-based approach on two PPI datasets: Human and S. cerevisiae. Obtained results demonstrate the effectiveness of the proposed approach as it outperforms the previous leading methods. The source code for training and data to train the model are available at https://github.com/JhaKanchan15/PPI_GNN.git. |
format | Online Article Text |
id | pubmed-9120162 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-91201622022-05-21 Prediction of protein–protein interaction using graph neural networks Jha, Kanchan Saha, Sriparna Singh, Hiteshi Sci Rep Article Proteins are the essential biological macromolecules required to perform nearly all biological processes, and cellular functions. Proteins rarely carry out their tasks in isolation but interact with other proteins (known as protein–protein interaction) present in their surroundings to complete biological activities. The knowledge of protein–protein interactions (PPIs) unravels the cellular behavior and its functionality. The computational methods automate the prediction of PPI and are less expensive than experimental methods in terms of resources and time. So far, most of the works on PPI have mainly focused on sequence information. Here, we use graph convolutional network (GCN) and graph attention network (GAT) to predict the interaction between proteins by utilizing protein’s structural information and sequence features. We build the graphs of proteins from their PDB files, which contain 3D coordinates of atoms. The protein graph represents the amino acid network, also known as residue contact network, where each node is a residue. Two nodes are connected if they have a pair of atoms (one from each node) within the threshold distance. To extract the node/residue features, we use the protein language model. The input to the language model is the protein sequence, and the output is the feature vector for each amino acid of the underlying sequence. We validate the predictive capability of the proposed graph-based approach on two PPI datasets: Human and S. cerevisiae. Obtained results demonstrate the effectiveness of the proposed approach as it outperforms the previous leading methods. The source code for training and data to train the model are available at https://github.com/JhaKanchan15/PPI_GNN.git. Nature Publishing Group UK 2022-05-19 /pmc/articles/PMC9120162/ /pubmed/35589837 http://dx.doi.org/10.1038/s41598-022-12201-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Jha, Kanchan Saha, Sriparna Singh, Hiteshi Prediction of protein–protein interaction using graph neural networks |
title | Prediction of protein–protein interaction using graph neural networks |
title_full | Prediction of protein–protein interaction using graph neural networks |
title_fullStr | Prediction of protein–protein interaction using graph neural networks |
title_full_unstemmed | Prediction of protein–protein interaction using graph neural networks |
title_short | Prediction of protein–protein interaction using graph neural networks |
title_sort | prediction of protein–protein interaction using graph neural networks |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9120162/ https://www.ncbi.nlm.nih.gov/pubmed/35589837 http://dx.doi.org/10.1038/s41598-022-12201-9 |
work_keys_str_mv | AT jhakanchan predictionofproteinproteininteractionusinggraphneuralnetworks AT sahasriparna predictionofproteinproteininteractionusinggraphneuralnetworks AT singhhiteshi predictionofproteinproteininteractionusinggraphneuralnetworks |