Cargando…
Graph-based prediction of Protein-protein interactions with attributed signed graph embedding
BACKGROUND: Protein-protein interactions (PPIs) are central to many biological processes. Considering that the experimental methods for identifying PPIs are time-consuming and expensive, it is important to develop automated computational methods to better predict PPIs. Various machine learning metho...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7372763/ https://www.ncbi.nlm.nih.gov/pubmed/32693790 http://dx.doi.org/10.1186/s12859-020-03646-8 |
_version_ | 1783561376306823168 |
---|---|
author | Yang, Fang Fan, Kunjie Song, Dandan Lin, Huakang |
author_facet | Yang, Fang Fan, Kunjie Song, Dandan Lin, Huakang |
author_sort | Yang, Fang |
collection | PubMed |
description | BACKGROUND: Protein-protein interactions (PPIs) are central to many biological processes. Considering that the experimental methods for identifying PPIs are time-consuming and expensive, it is important to develop automated computational methods to better predict PPIs. Various machine learning methods have been proposed, including a deep learning technique which is sequence-based that has achieved promising results. However, it only focuses on sequence information while ignoring the structural information of PPI networks. Structural information of PPI networks such as their degree, position, and neighboring nodes in a graph has been proved to be informative in PPI prediction. RESULTS: Facing the challenge of representing graph information, we introduce an improved graph representation learning method. Our model can study PPI prediction based on both sequence information and graph structure. Moreover, our study takes advantage of a representation learning model and employs a graph-based deep learning method for PPI prediction, which shows superiority over existing sequence-based methods. Statistically, Our method achieves state-of-the-art accuracy of 99.15% on Human protein reference database (HPRD) dataset and also obtains best results on Database of Interacting Protein (DIP) Human, Drosophila, Escherichia coli (E. coli), and Caenorhabditis elegans (C. elegan) datasets. CONCLUSION: Here, we introduce signed variational graph auto-encoder (S-VGAE), an improved graph representation learning method, to automatically learn to encode graph structure into low-dimensional embeddings. Experimental results demonstrate that our method outperforms other existing sequence-based methods on several datasets. We also prove the robustness of our model for very sparse networks and the generalization for a new dataset that consists of four datasets: HPRD, E.coli, C.elegan, and Drosophila. |
format | Online Article Text |
id | pubmed-7372763 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-73727632020-07-21 Graph-based prediction of Protein-protein interactions with attributed signed graph embedding Yang, Fang Fan, Kunjie Song, Dandan Lin, Huakang BMC Bioinformatics Methodology Article BACKGROUND: Protein-protein interactions (PPIs) are central to many biological processes. Considering that the experimental methods for identifying PPIs are time-consuming and expensive, it is important to develop automated computational methods to better predict PPIs. Various machine learning methods have been proposed, including a deep learning technique which is sequence-based that has achieved promising results. However, it only focuses on sequence information while ignoring the structural information of PPI networks. Structural information of PPI networks such as their degree, position, and neighboring nodes in a graph has been proved to be informative in PPI prediction. RESULTS: Facing the challenge of representing graph information, we introduce an improved graph representation learning method. Our model can study PPI prediction based on both sequence information and graph structure. Moreover, our study takes advantage of a representation learning model and employs a graph-based deep learning method for PPI prediction, which shows superiority over existing sequence-based methods. Statistically, Our method achieves state-of-the-art accuracy of 99.15% on Human protein reference database (HPRD) dataset and also obtains best results on Database of Interacting Protein (DIP) Human, Drosophila, Escherichia coli (E. coli), and Caenorhabditis elegans (C. elegan) datasets. CONCLUSION: Here, we introduce signed variational graph auto-encoder (S-VGAE), an improved graph representation learning method, to automatically learn to encode graph structure into low-dimensional embeddings. Experimental results demonstrate that our method outperforms other existing sequence-based methods on several datasets. We also prove the robustness of our model for very sparse networks and the generalization for a new dataset that consists of four datasets: HPRD, E.coli, C.elegan, and Drosophila. BioMed Central 2020-07-21 /pmc/articles/PMC7372763/ /pubmed/32693790 http://dx.doi.org/10.1186/s12859-020-03646-8 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Yang, Fang Fan, Kunjie Song, Dandan Lin, Huakang Graph-based prediction of Protein-protein interactions with attributed signed graph embedding |
title | Graph-based prediction of Protein-protein interactions with attributed signed graph embedding |
title_full | Graph-based prediction of Protein-protein interactions with attributed signed graph embedding |
title_fullStr | Graph-based prediction of Protein-protein interactions with attributed signed graph embedding |
title_full_unstemmed | Graph-based prediction of Protein-protein interactions with attributed signed graph embedding |
title_short | Graph-based prediction of Protein-protein interactions with attributed signed graph embedding |
title_sort | graph-based prediction of protein-protein interactions with attributed signed graph embedding |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7372763/ https://www.ncbi.nlm.nih.gov/pubmed/32693790 http://dx.doi.org/10.1186/s12859-020-03646-8 |
work_keys_str_mv | AT yangfang graphbasedpredictionofproteinproteininteractionswithattributedsignedgraphembedding AT fankunjie graphbasedpredictionofproteinproteininteractionswithattributedsignedgraphembedding AT songdandan graphbasedpredictionofproteinproteininteractionswithattributedsignedgraphembedding AT linhuakang graphbasedpredictionofproteinproteininteractionswithattributedsignedgraphembedding |