Cargando…

Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network

In recent years, due to low accuracy and high costs of traditional biological experiments, more and more computational models have been proposed successively to infer potential essential proteins. In this paper, a novel prediction method called KFPM is proposed, in which, a novel protein-domain hete...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Xin, Kuang, Linai, Chen, Zhiping, Tan, Yihong, Wang, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8276041/
https://www.ncbi.nlm.nih.gov/pubmed/34267785
http://dx.doi.org/10.3389/fgene.2021.708162
_version_ 1783721835654807552
author He, Xin
Kuang, Linai
Chen, Zhiping
Tan, Yihong
Wang, Lei
author_facet He, Xin
Kuang, Linai
Chen, Zhiping
Tan, Yihong
Wang, Lei
author_sort He, Xin
collection PubMed
description In recent years, due to low accuracy and high costs of traditional biological experiments, more and more computational models have been proposed successively to infer potential essential proteins. In this paper, a novel prediction method called KFPM is proposed, in which, a novel protein-domain heterogeneous network is established first by combining known protein-protein interactions with known associations between proteins and domains. Next, based on key topological characteristics extracted from the newly constructed protein-domain network and functional characteristics extracted from multiple biological information of proteins, a new computational method is designed to effectively integrate multiple biological features to infer potential essential proteins based on an improved PageRank algorithm. Finally, in order to evaluate the performance of KFPM, we compared it with 13 state-of-the-art prediction methods, experimental results show that, among the top 1, 5, and 10% of candidate proteins predicted by KFPM, the prediction accuracy can achieve 96.08, 83.14, and 70.59%, respectively, which significantly outperform all these 13 competitive methods. It means that KFPM may be a meaningful tool for prediction of potential essential proteins in the future.
format Online
Article
Text
id pubmed-8276041
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82760412021-07-14 Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network He, Xin Kuang, Linai Chen, Zhiping Tan, Yihong Wang, Lei Front Genet Genetics In recent years, due to low accuracy and high costs of traditional biological experiments, more and more computational models have been proposed successively to infer potential essential proteins. In this paper, a novel prediction method called KFPM is proposed, in which, a novel protein-domain heterogeneous network is established first by combining known protein-protein interactions with known associations between proteins and domains. Next, based on key topological characteristics extracted from the newly constructed protein-domain network and functional characteristics extracted from multiple biological information of proteins, a new computational method is designed to effectively integrate multiple biological features to infer potential essential proteins based on an improved PageRank algorithm. Finally, in order to evaluate the performance of KFPM, we compared it with 13 state-of-the-art prediction methods, experimental results show that, among the top 1, 5, and 10% of candidate proteins predicted by KFPM, the prediction accuracy can achieve 96.08, 83.14, and 70.59%, respectively, which significantly outperform all these 13 competitive methods. It means that KFPM may be a meaningful tool for prediction of potential essential proteins in the future. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8276041/ /pubmed/34267785 http://dx.doi.org/10.3389/fgene.2021.708162 Text en Copyright © 2021 He, Kuang, Chen, Tan and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
He, Xin
Kuang, Linai
Chen, Zhiping
Tan, Yihong
Wang, Lei
Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
title Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
title_full Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
title_fullStr Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
title_full_unstemmed Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
title_short Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
title_sort method for identifying essential proteins by key features of proteins in a novel protein-domain network
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8276041/
https://www.ncbi.nlm.nih.gov/pubmed/34267785
http://dx.doi.org/10.3389/fgene.2021.708162
work_keys_str_mv AT hexin methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork
AT kuanglinai methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork
AT chenzhiping methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork
AT tanyihong methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork
AT wanglei methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork