Cargando…
Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network
In recent years, due to low accuracy and high costs of traditional biological experiments, more and more computational models have been proposed successively to infer potential essential proteins. In this paper, a novel prediction method called KFPM is proposed, in which, a novel protein-domain hete...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8276041/ https://www.ncbi.nlm.nih.gov/pubmed/34267785 http://dx.doi.org/10.3389/fgene.2021.708162 |
_version_ | 1783721835654807552 |
---|---|
author | He, Xin Kuang, Linai Chen, Zhiping Tan, Yihong Wang, Lei |
author_facet | He, Xin Kuang, Linai Chen, Zhiping Tan, Yihong Wang, Lei |
author_sort | He, Xin |
collection | PubMed |
description | In recent years, due to low accuracy and high costs of traditional biological experiments, more and more computational models have been proposed successively to infer potential essential proteins. In this paper, a novel prediction method called KFPM is proposed, in which, a novel protein-domain heterogeneous network is established first by combining known protein-protein interactions with known associations between proteins and domains. Next, based on key topological characteristics extracted from the newly constructed protein-domain network and functional characteristics extracted from multiple biological information of proteins, a new computational method is designed to effectively integrate multiple biological features to infer potential essential proteins based on an improved PageRank algorithm. Finally, in order to evaluate the performance of KFPM, we compared it with 13 state-of-the-art prediction methods, experimental results show that, among the top 1, 5, and 10% of candidate proteins predicted by KFPM, the prediction accuracy can achieve 96.08, 83.14, and 70.59%, respectively, which significantly outperform all these 13 competitive methods. It means that KFPM may be a meaningful tool for prediction of potential essential proteins in the future. |
format | Online Article Text |
id | pubmed-8276041 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82760412021-07-14 Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network He, Xin Kuang, Linai Chen, Zhiping Tan, Yihong Wang, Lei Front Genet Genetics In recent years, due to low accuracy and high costs of traditional biological experiments, more and more computational models have been proposed successively to infer potential essential proteins. In this paper, a novel prediction method called KFPM is proposed, in which, a novel protein-domain heterogeneous network is established first by combining known protein-protein interactions with known associations between proteins and domains. Next, based on key topological characteristics extracted from the newly constructed protein-domain network and functional characteristics extracted from multiple biological information of proteins, a new computational method is designed to effectively integrate multiple biological features to infer potential essential proteins based on an improved PageRank algorithm. Finally, in order to evaluate the performance of KFPM, we compared it with 13 state-of-the-art prediction methods, experimental results show that, among the top 1, 5, and 10% of candidate proteins predicted by KFPM, the prediction accuracy can achieve 96.08, 83.14, and 70.59%, respectively, which significantly outperform all these 13 competitive methods. It means that KFPM may be a meaningful tool for prediction of potential essential proteins in the future. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8276041/ /pubmed/34267785 http://dx.doi.org/10.3389/fgene.2021.708162 Text en Copyright © 2021 He, Kuang, Chen, Tan and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics He, Xin Kuang, Linai Chen, Zhiping Tan, Yihong Wang, Lei Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network |
title | Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network |
title_full | Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network |
title_fullStr | Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network |
title_full_unstemmed | Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network |
title_short | Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network |
title_sort | method for identifying essential proteins by key features of proteins in a novel protein-domain network |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8276041/ https://www.ncbi.nlm.nih.gov/pubmed/34267785 http://dx.doi.org/10.3389/fgene.2021.708162 |
work_keys_str_mv | AT hexin methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork AT kuanglinai methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork AT chenzhiping methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork AT tanyihong methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork AT wanglei methodforidentifyingessentialproteinsbykeyfeaturesofproteinsinanovelproteindomainnetwork |