Cargando…

Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction

BACKGROUND: Protein–protein interactions are widespread in biological systems and play an important role in cell biology. Since traditional laboratory-based methods have some drawbacks, such as time-consuming, money-consuming, etc., a large number of methods based on deep learning have emerged. Howe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhong, Wen, He, Changxiang, Xiao, Chen, Liu, Yuru, Qin, Xiaofei, Yu, Zhensheng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724439/ https://www.ncbi.nlm.nih.gov/pubmed/36471248 http://dx.doi.org/10.1186/s12859-022-05062-6

_version_	1784844416332070912
author	Zhong, Wen He, Changxiang Xiao, Chen Liu, Yuru Qin, Xiaofei Yu, Zhensheng
author_facet	Zhong, Wen He, Changxiang Xiao, Chen Liu, Yuru Qin, Xiaofei Yu, Zhensheng
author_sort	Zhong, Wen
collection	PubMed
description	BACKGROUND: Protein–protein interactions are widespread in biological systems and play an important role in cell biology. Since traditional laboratory-based methods have some drawbacks, such as time-consuming, money-consuming, etc., a large number of methods based on deep learning have emerged. However, these methods do not take into account the long-distance dependency information between each two amino acids in sequence. In addition, most existing models based on graph neural networks only aggregate the first-order neighbors in protein–protein interaction (PPI) network. Although multi-order neighbor information can be aggregated by increasing the number of layers of neural network, it is easy to cause over-fitting. So, it is necessary to design a network that can capture long distance dependency information between amino acids in the sequence and can directly capture multi-order neighbor information in protein–protein interaction network. RESULTS: In this study, we propose a multi-hop neural network (LDMGNN) model combining long distance dependency information to predict the multi-label protein–protein interactions. In the LDMGNN model, we design the protein amino acid sequence encoding (PAASE) module with the multi-head self-attention Transformer block to extract the features of amino acid sequences by calculating the interdependence between every two amino acids. And expand the receptive field in space by constructing a two-hop protein–protein interaction (THPPI) network. We combine PPI network and THPPI network with amino acid sequence features respectively, then input them into two identical GIN blocks at the same time to obtain two embeddings. Next, the two embeddings are fused and input to the classifier for predict multi-label protein–protein interactions. Compared with other state-of-the-art methods, LDMGNN shows the best performance on both the SHS27K and SHS148k datasets. Ablation experiments show that the PAASE module and the construction of THPPI network are feasible and effective. CONCLUSIONS: In general terms, our proposed LDMGNN model has achieved satisfactory results in the prediction of multi-label protein–protein interactions.
format	Online Article Text
id	pubmed-9724439
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-97244392022-12-07 Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction Zhong, Wen He, Changxiang Xiao, Chen Liu, Yuru Qin, Xiaofei Yu, Zhensheng BMC Bioinformatics Research BACKGROUND: Protein–protein interactions are widespread in biological systems and play an important role in cell biology. Since traditional laboratory-based methods have some drawbacks, such as time-consuming, money-consuming, etc., a large number of methods based on deep learning have emerged. However, these methods do not take into account the long-distance dependency information between each two amino acids in sequence. In addition, most existing models based on graph neural networks only aggregate the first-order neighbors in protein–protein interaction (PPI) network. Although multi-order neighbor information can be aggregated by increasing the number of layers of neural network, it is easy to cause over-fitting. So, it is necessary to design a network that can capture long distance dependency information between amino acids in the sequence and can directly capture multi-order neighbor information in protein–protein interaction network. RESULTS: In this study, we propose a multi-hop neural network (LDMGNN) model combining long distance dependency information to predict the multi-label protein–protein interactions. In the LDMGNN model, we design the protein amino acid sequence encoding (PAASE) module with the multi-head self-attention Transformer block to extract the features of amino acid sequences by calculating the interdependence between every two amino acids. And expand the receptive field in space by constructing a two-hop protein–protein interaction (THPPI) network. We combine PPI network and THPPI network with amino acid sequence features respectively, then input them into two identical GIN blocks at the same time to obtain two embeddings. Next, the two embeddings are fused and input to the classifier for predict multi-label protein–protein interactions. Compared with other state-of-the-art methods, LDMGNN shows the best performance on both the SHS27K and SHS148k datasets. Ablation experiments show that the PAASE module and the construction of THPPI network are feasible and effective. CONCLUSIONS: In general terms, our proposed LDMGNN model has achieved satisfactory results in the prediction of multi-label protein–protein interactions. BioMed Central 2022-12-05 /pmc/articles/PMC9724439/ /pubmed/36471248 http://dx.doi.org/10.1186/s12859-022-05062-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Zhong, Wen He, Changxiang Xiao, Chen Liu, Yuru Qin, Xiaofei Yu, Zhensheng Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction
title	Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction
title_full	Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction
title_fullStr	Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction
title_full_unstemmed	Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction
title_short	Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction
title_sort	long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9724439/ https://www.ncbi.nlm.nih.gov/pubmed/36471248 http://dx.doi.org/10.1186/s12859-022-05062-6
work_keys_str_mv	AT zhongwen longdistancedependencycombinedmultihopgraphneuralnetworksforproteinproteininteractionsprediction AT hechangxiang longdistancedependencycombinedmultihopgraphneuralnetworksforproteinproteininteractionsprediction AT xiaochen longdistancedependencycombinedmultihopgraphneuralnetworksforproteinproteininteractionsprediction AT liuyuru longdistancedependencycombinedmultihopgraphneuralnetworksforproteinproteininteractionsprediction AT qinxiaofei longdistancedependencycombinedmultihopgraphneuralnetworksforproteinproteininteractionsprediction AT yuzhensheng longdistancedependencycombinedmultihopgraphneuralnetworksforproteinproteininteractionsprediction

Long-distance dependency combined multi-hop graph neural networks for protein–protein interactions prediction

Ejemplares similares