Cargando…

A novel method to predict essential proteins based on tensor and HITS algorithm

BACKGROUND: Essential proteins are an important part of the cell and closely related to the life activities of the cell. Hitherto, Protein-Protein Interaction (PPI) networks have been adopted by many computational methods to predict essential proteins. Most of the current approaches focus mainly on...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhihong, Luo, Yingchun, Hu, Sai, Li, Xueyong, Wang, Lei, Zhao, Bihai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7137323/
https://www.ncbi.nlm.nih.gov/pubmed/32252824
http://dx.doi.org/10.1186/s40246-020-00263-7
_version_ 1783518404264591360
author Zhang, Zhihong
Luo, Yingchun
Hu, Sai
Li, Xueyong
Wang, Lei
Zhao, Bihai
author_facet Zhang, Zhihong
Luo, Yingchun
Hu, Sai
Li, Xueyong
Wang, Lei
Zhao, Bihai
author_sort Zhang, Zhihong
collection PubMed
description BACKGROUND: Essential proteins are an important part of the cell and closely related to the life activities of the cell. Hitherto, Protein-Protein Interaction (PPI) networks have been adopted by many computational methods to predict essential proteins. Most of the current approaches focus mainly on the topological structure of PPI networks. However, those methods relying solely on the PPI network have low detection accuracy for essential proteins. Therefore, it is necessary to integrate the PPI network with other biological information to identify essential proteins. RESULTS: In this paper, we proposed a novel random walk method for identifying essential proteins, called HEPT. A three-dimensional tensor is constructed first by combining the PPI network of Saccharomyces cerevisiae with multiple biological data such as gene ontology annotations and protein domains. Then, based on the newly constructed tensor, we extended the Hyperlink-Induced Topic Search (HITS) algorithm from a two-dimensional to a three-dimensional tensor model that can be utilized to infer essential proteins. Different from existing state-of-the-art methods, the importance of proteins and the types of interactions will both contribute to the essential protein prediction. To evaluate the performance of our newly proposed HEPT method, proteins are ranked in the descending order based on their ranking scores computed by our method and other competitive methods. After that, a certain number of the ranked proteins are selected as candidates for essential proteins. According to the list of known essential proteins, the number of true essential proteins is used to judge the performance of each method. Experimental results show that our method can achieve better prediction performance in comparison with other nine state-of-the-art methods in identifying essential proteins. CONCLUSIONS: Through analysis and experimental results, it is obvious that HEPT can be used to effectively improve the prediction accuracy of essential proteins by the use of HITS algorithm and the combination of network topology with gene ontology annotations and protein domains, which provides a new insight into multi-data source fusion.
format Online
Article
Text
id pubmed-7137323
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71373232020-04-11 A novel method to predict essential proteins based on tensor and HITS algorithm Zhang, Zhihong Luo, Yingchun Hu, Sai Li, Xueyong Wang, Lei Zhao, Bihai Hum Genomics Primary Research BACKGROUND: Essential proteins are an important part of the cell and closely related to the life activities of the cell. Hitherto, Protein-Protein Interaction (PPI) networks have been adopted by many computational methods to predict essential proteins. Most of the current approaches focus mainly on the topological structure of PPI networks. However, those methods relying solely on the PPI network have low detection accuracy for essential proteins. Therefore, it is necessary to integrate the PPI network with other biological information to identify essential proteins. RESULTS: In this paper, we proposed a novel random walk method for identifying essential proteins, called HEPT. A three-dimensional tensor is constructed first by combining the PPI network of Saccharomyces cerevisiae with multiple biological data such as gene ontology annotations and protein domains. Then, based on the newly constructed tensor, we extended the Hyperlink-Induced Topic Search (HITS) algorithm from a two-dimensional to a three-dimensional tensor model that can be utilized to infer essential proteins. Different from existing state-of-the-art methods, the importance of proteins and the types of interactions will both contribute to the essential protein prediction. To evaluate the performance of our newly proposed HEPT method, proteins are ranked in the descending order based on their ranking scores computed by our method and other competitive methods. After that, a certain number of the ranked proteins are selected as candidates for essential proteins. According to the list of known essential proteins, the number of true essential proteins is used to judge the performance of each method. Experimental results show that our method can achieve better prediction performance in comparison with other nine state-of-the-art methods in identifying essential proteins. CONCLUSIONS: Through analysis and experimental results, it is obvious that HEPT can be used to effectively improve the prediction accuracy of essential proteins by the use of HITS algorithm and the combination of network topology with gene ontology annotations and protein domains, which provides a new insight into multi-data source fusion. BioMed Central 2020-04-06 /pmc/articles/PMC7137323/ /pubmed/32252824 http://dx.doi.org/10.1186/s40246-020-00263-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Primary Research
Zhang, Zhihong
Luo, Yingchun
Hu, Sai
Li, Xueyong
Wang, Lei
Zhao, Bihai
A novel method to predict essential proteins based on tensor and HITS algorithm
title A novel method to predict essential proteins based on tensor and HITS algorithm
title_full A novel method to predict essential proteins based on tensor and HITS algorithm
title_fullStr A novel method to predict essential proteins based on tensor and HITS algorithm
title_full_unstemmed A novel method to predict essential proteins based on tensor and HITS algorithm
title_short A novel method to predict essential proteins based on tensor and HITS algorithm
title_sort novel method to predict essential proteins based on tensor and hits algorithm
topic Primary Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7137323/
https://www.ncbi.nlm.nih.gov/pubmed/32252824
http://dx.doi.org/10.1186/s40246-020-00263-7
work_keys_str_mv AT zhangzhihong anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT luoyingchun anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT husai anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT lixueyong anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT wanglei anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT zhaobihai anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT zhangzhihong novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT luoyingchun novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT husai novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT lixueyong novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT wanglei novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT zhaobihai novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm