Cargando…

CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction

MOTIVATION: Protein function annotation is fundamental to understanding biological mechanisms. The abundant genome-scale protein–protein interaction (PPI) networks, together with other protein biological attributes, provide rich information for annotating protein functions. As PPI networks and biolo...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Zhourun, Guo, Mingyue, Jin, Xiaopeng, Chen, Junjie, Liu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10032634/
https://www.ncbi.nlm.nih.gov/pubmed/36883697
http://dx.doi.org/10.1093/bioinformatics/btad123
_version_ 1784910845920149504
author Wu, Zhourun
Guo, Mingyue
Jin, Xiaopeng
Chen, Junjie
Liu, Bin
author_facet Wu, Zhourun
Guo, Mingyue
Jin, Xiaopeng
Chen, Junjie
Liu, Bin
author_sort Wu, Zhourun
collection PubMed
description MOTIVATION: Protein function annotation is fundamental to understanding biological mechanisms. The abundant genome-scale protein–protein interaction (PPI) networks, together with other protein biological attributes, provide rich information for annotating protein functions. As PPI networks and biological attributes describe protein functions from different perspectives, it is highly challenging to cross-fuse them for protein function prediction. Recently, several methods combine the PPI networks and protein attributes via the graph neural networks (GNNs). However, GNNs may inherit or even magnify the bias caused by noisy edges in PPI networks. Besides, GNNs with stacking of many layers may cause the over-smoothing problem of node representations. RESULTS: We develop a novel protein function prediction method, CFAGO, to integrate single-species PPI networks and protein biological attributes via a multi-head attention mechanism. CFAGO is first pre-trained with an encoder–decoder architecture to capture the universal protein representation of the two sources. It is then fine-tuned to learn more effective protein representations for protein function prediction. Benchmark experiments on human and mouse datasets show CFAGO outperforms state-of-the-art single-species network-based methods by at least 7.59%, 6.90%, 11.68% in terms of m-AUPR, M-AUPR, and Fmax, respectively, demonstrating cross-fusion by multi-head attention mechanism can greatly improve the protein function prediction. We further evaluate the quality of captured protein representations in terms of Davies Bouldin Score, whose results show that cross-fused protein representations by multi-head attention mechanism are at least 2.7% better than that of original and concatenated representations. We believe CFAGO is an effective tool for protein function prediction. AVAILABILITY AND IMPLEMENTATION: The source code of CFAGO and experiments data are available at: http://bliulab.net/CFAGO/.
format Online
Article
Text
id pubmed-10032634
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100326342023-03-23 CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction Wu, Zhourun Guo, Mingyue Jin, Xiaopeng Chen, Junjie Liu, Bin Bioinformatics Original Paper MOTIVATION: Protein function annotation is fundamental to understanding biological mechanisms. The abundant genome-scale protein–protein interaction (PPI) networks, together with other protein biological attributes, provide rich information for annotating protein functions. As PPI networks and biological attributes describe protein functions from different perspectives, it is highly challenging to cross-fuse them for protein function prediction. Recently, several methods combine the PPI networks and protein attributes via the graph neural networks (GNNs). However, GNNs may inherit or even magnify the bias caused by noisy edges in PPI networks. Besides, GNNs with stacking of many layers may cause the over-smoothing problem of node representations. RESULTS: We develop a novel protein function prediction method, CFAGO, to integrate single-species PPI networks and protein biological attributes via a multi-head attention mechanism. CFAGO is first pre-trained with an encoder–decoder architecture to capture the universal protein representation of the two sources. It is then fine-tuned to learn more effective protein representations for protein function prediction. Benchmark experiments on human and mouse datasets show CFAGO outperforms state-of-the-art single-species network-based methods by at least 7.59%, 6.90%, 11.68% in terms of m-AUPR, M-AUPR, and Fmax, respectively, demonstrating cross-fusion by multi-head attention mechanism can greatly improve the protein function prediction. We further evaluate the quality of captured protein representations in terms of Davies Bouldin Score, whose results show that cross-fused protein representations by multi-head attention mechanism are at least 2.7% better than that of original and concatenated representations. We believe CFAGO is an effective tool for protein function prediction. AVAILABILITY AND IMPLEMENTATION: The source code of CFAGO and experiments data are available at: http://bliulab.net/CFAGO/. Oxford University Press 2023-03-08 /pmc/articles/PMC10032634/ /pubmed/36883697 http://dx.doi.org/10.1093/bioinformatics/btad123 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Wu, Zhourun
Guo, Mingyue
Jin, Xiaopeng
Chen, Junjie
Liu, Bin
CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction
title CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction
title_full CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction
title_fullStr CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction
title_full_unstemmed CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction
title_short CFAGO: cross-fusion of network and attributes based on attention mechanism for protein function prediction
title_sort cfago: cross-fusion of network and attributes based on attention mechanism for protein function prediction
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10032634/
https://www.ncbi.nlm.nih.gov/pubmed/36883697
http://dx.doi.org/10.1093/bioinformatics/btad123
work_keys_str_mv AT wuzhourun cfagocrossfusionofnetworkandattributesbasedonattentionmechanismforproteinfunctionprediction
AT guomingyue cfagocrossfusionofnetworkandattributesbasedonattentionmechanismforproteinfunctionprediction
AT jinxiaopeng cfagocrossfusionofnetworkandattributesbasedonattentionmechanismforproteinfunctionprediction
AT chenjunjie cfagocrossfusionofnetworkandattributesbasedonattentionmechanismforproteinfunctionprediction
AT liubin cfagocrossfusionofnetworkandattributesbasedonattentionmechanismforproteinfunctionprediction