Cargando…

HNetGO: protein function prediction via heterogeneous network transformer

Protein function annotation is one of the most important research topics for revealing the essence of life at molecular level in the post-genome era. Current research shows that integrating multisource data can effectively improve the performance of protein function prediction models. However, the h...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xiaoshuai, Guo, Huannan, Zhang, Fan, Wang, Xuan, Wu, Kaitao, Qiu, Shizheng, Liu, Bo, Wang, Yadong, Hu, Yang, Li, Junyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588005/
https://www.ncbi.nlm.nih.gov/pubmed/37861172
http://dx.doi.org/10.1093/bib/bbab556
_version_ 1785123484938010624
author Zhang, Xiaoshuai
Guo, Huannan
Zhang, Fan
Wang, Xuan
Wu, Kaitao
Qiu, Shizheng
Liu, Bo
Wang, Yadong
Hu, Yang
Li, Junyi
author_facet Zhang, Xiaoshuai
Guo, Huannan
Zhang, Fan
Wang, Xuan
Wu, Kaitao
Qiu, Shizheng
Liu, Bo
Wang, Yadong
Hu, Yang
Li, Junyi
author_sort Zhang, Xiaoshuai
collection PubMed
description Protein function annotation is one of the most important research topics for revealing the essence of life at molecular level in the post-genome era. Current research shows that integrating multisource data can effectively improve the performance of protein function prediction models. However, the heavy reliance on complex feature engineering and model integration methods limits the development of existing methods. Besides, models based on deep learning only use labeled data in a certain dataset to extract sequence features, thus ignoring a large amount of existing unlabeled sequence data. Here, we propose an end-to-end protein function annotation model named HNetGO, which innovatively uses heterogeneous network to integrate protein sequence similarity and protein–protein interaction network information and combines the pretraining model to extract the semantic features of the protein sequence. In addition, we design an attention-based graph neural network model, which can effectively extract node-level features from heterogeneous networks and predict protein function by measuring the similarity between protein nodes and gene ontology term nodes. Comparative experiments on the human dataset show that HNetGO achieves state-of-the-art performance on cellular component and molecular function branches.
format Online
Article
Text
id pubmed-10588005
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105880052023-10-21 HNetGO: protein function prediction via heterogeneous network transformer Zhang, Xiaoshuai Guo, Huannan Zhang, Fan Wang, Xuan Wu, Kaitao Qiu, Shizheng Liu, Bo Wang, Yadong Hu, Yang Li, Junyi Brief Bioinform Problem Solving Protocol Protein function annotation is one of the most important research topics for revealing the essence of life at molecular level in the post-genome era. Current research shows that integrating multisource data can effectively improve the performance of protein function prediction models. However, the heavy reliance on complex feature engineering and model integration methods limits the development of existing methods. Besides, models based on deep learning only use labeled data in a certain dataset to extract sequence features, thus ignoring a large amount of existing unlabeled sequence data. Here, we propose an end-to-end protein function annotation model named HNetGO, which innovatively uses heterogeneous network to integrate protein sequence similarity and protein–protein interaction network information and combines the pretraining model to extract the semantic features of the protein sequence. In addition, we design an attention-based graph neural network model, which can effectively extract node-level features from heterogeneous networks and predict protein function by measuring the similarity between protein nodes and gene ontology term nodes. Comparative experiments on the human dataset show that HNetGO achieves state-of-the-art performance on cellular component and molecular function branches. Oxford University Press 2023-10-19 /pmc/articles/PMC10588005/ /pubmed/37861172 http://dx.doi.org/10.1093/bib/bbab556 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Zhang, Xiaoshuai
Guo, Huannan
Zhang, Fan
Wang, Xuan
Wu, Kaitao
Qiu, Shizheng
Liu, Bo
Wang, Yadong
Hu, Yang
Li, Junyi
HNetGO: protein function prediction via heterogeneous network transformer
title HNetGO: protein function prediction via heterogeneous network transformer
title_full HNetGO: protein function prediction via heterogeneous network transformer
title_fullStr HNetGO: protein function prediction via heterogeneous network transformer
title_full_unstemmed HNetGO: protein function prediction via heterogeneous network transformer
title_short HNetGO: protein function prediction via heterogeneous network transformer
title_sort hnetgo: protein function prediction via heterogeneous network transformer
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588005/
https://www.ncbi.nlm.nih.gov/pubmed/37861172
http://dx.doi.org/10.1093/bib/bbab556
work_keys_str_mv AT zhangxiaoshuai hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT guohuannan hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT zhangfan hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT wangxuan hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT wukaitao hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT qiushizheng hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT liubo hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT wangyadong hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT huyang hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer
AT lijunyi hnetgoproteinfunctionpredictionviaheterogeneousnetworktransformer