Cargando…

Protein function annotation based on heterogeneous biological networks

BACKGROUND: Accurate annotation of protein function is the key to understanding life at the molecular level and has great implications for biomedicine and pharmaceuticals. The rapid developments of high-throughput technologies have generated huge amounts of protein–protein interaction (PPI) data, wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Sai, Luo, Yingchun, Zhang, Zhihong, Xiong, Huijun, Yan, Wei, Jiang, Meiping, Zhao, Bihai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675105/
https://www.ncbi.nlm.nih.gov/pubmed/36401161
http://dx.doi.org/10.1186/s12859-022-05057-3
_version_ 1784833296349265920
author Hu, Sai
Luo, Yingchun
Zhang, Zhihong
Xiong, Huijun
Yan, Wei
Jiang, Meiping
Zhao, Bihai
author_facet Hu, Sai
Luo, Yingchun
Zhang, Zhihong
Xiong, Huijun
Yan, Wei
Jiang, Meiping
Zhao, Bihai
author_sort Hu, Sai
collection PubMed
description BACKGROUND: Accurate annotation of protein function is the key to understanding life at the molecular level and has great implications for biomedicine and pharmaceuticals. The rapid developments of high-throughput technologies have generated huge amounts of protein–protein interaction (PPI) data, which prompts the emergence of computational methods to determine protein function. Plagued by errors and noises hidden in PPI data, these computational methods have undertaken to focus on the prediction of functions by integrating the topology of protein interaction networks and multi-source biological data. Despite effective improvement of these computational methods, it is still challenging to build a suitable network model for integrating multiplex biological data. RESULTS: In this paper, we constructed a heterogeneous biological network by initially integrating original protein interaction networks, protein-domain association data and protein complexes. To prove the effectiveness of the heterogeneous biological network, we applied the propagation algorithm on this network, and proposed a novel iterative model, named Propagate on Heterogeneous Biological Networks (PHN) to score and rank functions in descending order from all functional partners, Finally, we picked out top L of these predicted functions as candidates to annotate the target protein. Our comprehensive experimental results demonstrated that PHN outperformed seven other competing approaches using cross-validation. Experimental results indicated that PHN performs significantly better than competing methods and improves the Area Under the Receiver-Operating Curve (AUROC) in Biological Process (BP), Molecular Function (MF) and Cellular Components (CC) by no less than 33%, 15% and 28%, respectively. CONCLUSIONS: We demonstrated that integrating multi-source data into a heterogeneous biological network can preserve the complex relationship among multiplex biological data and improve the prediction accuracy of protein function by getting rid of the constraints of errors in PPI networks effectively. PHN, our proposed method, is effective for protein function prediction.
format Online
Article
Text
id pubmed-9675105
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-96751052022-11-20 Protein function annotation based on heterogeneous biological networks Hu, Sai Luo, Yingchun Zhang, Zhihong Xiong, Huijun Yan, Wei Jiang, Meiping Zhao, Bihai BMC Bioinformatics Research BACKGROUND: Accurate annotation of protein function is the key to understanding life at the molecular level and has great implications for biomedicine and pharmaceuticals. The rapid developments of high-throughput technologies have generated huge amounts of protein–protein interaction (PPI) data, which prompts the emergence of computational methods to determine protein function. Plagued by errors and noises hidden in PPI data, these computational methods have undertaken to focus on the prediction of functions by integrating the topology of protein interaction networks and multi-source biological data. Despite effective improvement of these computational methods, it is still challenging to build a suitable network model for integrating multiplex biological data. RESULTS: In this paper, we constructed a heterogeneous biological network by initially integrating original protein interaction networks, protein-domain association data and protein complexes. To prove the effectiveness of the heterogeneous biological network, we applied the propagation algorithm on this network, and proposed a novel iterative model, named Propagate on Heterogeneous Biological Networks (PHN) to score and rank functions in descending order from all functional partners, Finally, we picked out top L of these predicted functions as candidates to annotate the target protein. Our comprehensive experimental results demonstrated that PHN outperformed seven other competing approaches using cross-validation. Experimental results indicated that PHN performs significantly better than competing methods and improves the Area Under the Receiver-Operating Curve (AUROC) in Biological Process (BP), Molecular Function (MF) and Cellular Components (CC) by no less than 33%, 15% and 28%, respectively. CONCLUSIONS: We demonstrated that integrating multi-source data into a heterogeneous biological network can preserve the complex relationship among multiplex biological data and improve the prediction accuracy of protein function by getting rid of the constraints of errors in PPI networks effectively. PHN, our proposed method, is effective for protein function prediction. BioMed Central 2022-11-18 /pmc/articles/PMC9675105/ /pubmed/36401161 http://dx.doi.org/10.1186/s12859-022-05057-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hu, Sai
Luo, Yingchun
Zhang, Zhihong
Xiong, Huijun
Yan, Wei
Jiang, Meiping
Zhao, Bihai
Protein function annotation based on heterogeneous biological networks
title Protein function annotation based on heterogeneous biological networks
title_full Protein function annotation based on heterogeneous biological networks
title_fullStr Protein function annotation based on heterogeneous biological networks
title_full_unstemmed Protein function annotation based on heterogeneous biological networks
title_short Protein function annotation based on heterogeneous biological networks
title_sort protein function annotation based on heterogeneous biological networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675105/
https://www.ncbi.nlm.nih.gov/pubmed/36401161
http://dx.doi.org/10.1186/s12859-022-05057-3
work_keys_str_mv AT husai proteinfunctionannotationbasedonheterogeneousbiologicalnetworks
AT luoyingchun proteinfunctionannotationbasedonheterogeneousbiologicalnetworks
AT zhangzhihong proteinfunctionannotationbasedonheterogeneousbiologicalnetworks
AT xionghuijun proteinfunctionannotationbasedonheterogeneousbiologicalnetworks
AT yanwei proteinfunctionannotationbasedonheterogeneousbiologicalnetworks
AT jiangmeiping proteinfunctionannotationbasedonheterogeneousbiologicalnetworks
AT zhaobihai proteinfunctionannotationbasedonheterogeneousbiologicalnetworks