Cargando…

Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction

BACKGROUND: Protein–protein interaction (PPI) plays an important role in regulating cells and signals. Despite the ongoing efforts of the bioassay group, continued incomplete data limits our ability to understand the molecular roots of human disease. Therefore, it is urgent to develop a computationa...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Xiao-Rui, Hu, Lun, You, Zhu-Hong, Hu, Peng-Wei, Zhao, Bo-Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205098/
https://www.ncbi.nlm.nih.gov/pubmed/35710342
http://dx.doi.org/10.1186/s12859-022-04766-z
_version_ 1784729058516402176
author Su, Xiao-Rui
Hu, Lun
You, Zhu-Hong
Hu, Peng-Wei
Zhao, Bo-Wei
author_facet Su, Xiao-Rui
Hu, Lun
You, Zhu-Hong
Hu, Peng-Wei
Zhao, Bo-Wei
author_sort Su, Xiao-Rui
collection PubMed
description BACKGROUND: Protein–protein interaction (PPI) plays an important role in regulating cells and signals. Despite the ongoing efforts of the bioassay group, continued incomplete data limits our ability to understand the molecular roots of human disease. Therefore, it is urgent to develop a computational method to predict PPIs from the perspective of molecular system. METHODS: In this paper, a highly efficient computational model, MTV-PPI, is proposed for PPI prediction based on a heterogeneous molecular network by learning inter-view protein sequences and intra-view interactions between molecules simultaneously. On the one hand, the inter-view feature is extracted from the protein sequence by k-mer method. On the other hand, we use a popular embedding method LINE to encode the heterogeneous molecular network to obtain the intra-view feature. Thus, the protein representation used in MTV-PPI is constructed by the aggregation of its inter-view feature and intra-view feature. Finally, random forest is integrated to predict potential PPIs. RESULTS: To prove the effectiveness of MTV-PPI, we conduct extensive experiments on a collected heterogeneous molecular network with the accuracy of 86.55%, sensitivity of 82.49%, precision of 89.79%, AUC of 0.9301 and AUPR of 0.9308. Further comparison experiments are performed with various protein representations and classifiers to indicate the effectiveness of MTV-PPI in predicting PPIs based on a complex network. CONCLUSION: The achieved experimental results illustrate that MTV-PPI is a promising tool for PPI prediction, which may provide a new perspective for the future interactions prediction researches based on heterogeneous molecular network.
format Online
Article
Text
id pubmed-9205098
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92050982022-06-18 Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction Su, Xiao-Rui Hu, Lun You, Zhu-Hong Hu, Peng-Wei Zhao, Bo-Wei BMC Bioinformatics Research BACKGROUND: Protein–protein interaction (PPI) plays an important role in regulating cells and signals. Despite the ongoing efforts of the bioassay group, continued incomplete data limits our ability to understand the molecular roots of human disease. Therefore, it is urgent to develop a computational method to predict PPIs from the perspective of molecular system. METHODS: In this paper, a highly efficient computational model, MTV-PPI, is proposed for PPI prediction based on a heterogeneous molecular network by learning inter-view protein sequences and intra-view interactions between molecules simultaneously. On the one hand, the inter-view feature is extracted from the protein sequence by k-mer method. On the other hand, we use a popular embedding method LINE to encode the heterogeneous molecular network to obtain the intra-view feature. Thus, the protein representation used in MTV-PPI is constructed by the aggregation of its inter-view feature and intra-view feature. Finally, random forest is integrated to predict potential PPIs. RESULTS: To prove the effectiveness of MTV-PPI, we conduct extensive experiments on a collected heterogeneous molecular network with the accuracy of 86.55%, sensitivity of 82.49%, precision of 89.79%, AUC of 0.9301 and AUPR of 0.9308. Further comparison experiments are performed with various protein representations and classifiers to indicate the effectiveness of MTV-PPI in predicting PPIs based on a complex network. CONCLUSION: The achieved experimental results illustrate that MTV-PPI is a promising tool for PPI prediction, which may provide a new perspective for the future interactions prediction researches based on heterogeneous molecular network. BioMed Central 2022-06-16 /pmc/articles/PMC9205098/ /pubmed/35710342 http://dx.doi.org/10.1186/s12859-022-04766-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Su, Xiao-Rui
Hu, Lun
You, Zhu-Hong
Hu, Peng-Wei
Zhao, Bo-Wei
Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction
title Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction
title_full Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction
title_fullStr Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction
title_full_unstemmed Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction
title_short Multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction
title_sort multi-view heterogeneous molecular network representation learning for protein–protein interaction prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205098/
https://www.ncbi.nlm.nih.gov/pubmed/35710342
http://dx.doi.org/10.1186/s12859-022-04766-z
work_keys_str_mv AT suxiaorui multiviewheterogeneousmolecularnetworkrepresentationlearningforproteinproteininteractionprediction
AT hulun multiviewheterogeneousmolecularnetworkrepresentationlearningforproteinproteininteractionprediction
AT youzhuhong multiviewheterogeneousmolecularnetworkrepresentationlearningforproteinproteininteractionprediction
AT hupengwei multiviewheterogeneousmolecularnetworkrepresentationlearningforproteinproteininteractionprediction
AT zhaobowei multiviewheterogeneousmolecularnetworkrepresentationlearningforproteinproteininteractionprediction