Cargando…

A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins

Considering that traditional biological experiments are expensive and time consuming, it is important to develop effective computational models to infer potential essential proteins. In this manuscript, a novel collaborative filtering model-based method called CFMM was proposed, in which, an updated...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhu, Xianyou, He, Xin, Kuang, Linai, Chen, Zhiping, Lancine, Camara
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8566338/ https://www.ncbi.nlm.nih.gov/pubmed/34745230 http://dx.doi.org/10.3389/fgene.2021.763153

_version_	1784593988828790784
author	Zhu, Xianyou He, Xin Kuang, Linai Chen, Zhiping Lancine, Camara
author_facet	Zhu, Xianyou He, Xin Kuang, Linai Chen, Zhiping Lancine, Camara
author_sort	Zhu, Xianyou
collection	PubMed
description	Considering that traditional biological experiments are expensive and time consuming, it is important to develop effective computational models to infer potential essential proteins. In this manuscript, a novel collaborative filtering model-based method called CFMM was proposed, in which, an updated protein–domain interaction (PDI) network was constructed first by applying collaborative filtering algorithm on the original PDI network, and then, through integrating topological features of PDI networks with biological features of proteins, a calculative method was designed to infer potential essential proteins based on an improved PageRank algorithm. The novelties of CFMM lie in construction of an updated PDI network, application of the commodity-customer-based collaborative filtering algorithm, and introduction of the calculation method based on an improved PageRank algorithm, which ensured that CFMM can be applied to predict essential proteins without relying entirely on known protein–domain associations. Simulation results showed that CFMM can achieve reliable prediction accuracies of 92.16, 83.14, 71.37, 63.87, 55.84, and 52.43% in the top 1, 5, 10, 15, 20, and 25% predicted candidate key proteins based on the DIP database, which are remarkably higher than 14 competitive state-of-the-art predictive models as a whole, and in addition, CFMM can achieve satisfactory predictive performances based on different databases with various evaluation measurements, which further indicated that CFMM may be a useful tool for the identification of essential proteins in the future.
format	Online Article Text
id	pubmed-8566338
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-85663382021-11-05 A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins Zhu, Xianyou He, Xin Kuang, Linai Chen, Zhiping Lancine, Camara Front Genet Genetics Considering that traditional biological experiments are expensive and time consuming, it is important to develop effective computational models to infer potential essential proteins. In this manuscript, a novel collaborative filtering model-based method called CFMM was proposed, in which, an updated protein–domain interaction (PDI) network was constructed first by applying collaborative filtering algorithm on the original PDI network, and then, through integrating topological features of PDI networks with biological features of proteins, a calculative method was designed to infer potential essential proteins based on an improved PageRank algorithm. The novelties of CFMM lie in construction of an updated PDI network, application of the commodity-customer-based collaborative filtering algorithm, and introduction of the calculation method based on an improved PageRank algorithm, which ensured that CFMM can be applied to predict essential proteins without relying entirely on known protein–domain associations. Simulation results showed that CFMM can achieve reliable prediction accuracies of 92.16, 83.14, 71.37, 63.87, 55.84, and 52.43% in the top 1, 5, 10, 15, 20, and 25% predicted candidate key proteins based on the DIP database, which are remarkably higher than 14 competitive state-of-the-art predictive models as a whole, and in addition, CFMM can achieve satisfactory predictive performances based on different databases with various evaluation measurements, which further indicated that CFMM may be a useful tool for the identification of essential proteins in the future. Frontiers Media S.A. 2021-10-21 /pmc/articles/PMC8566338/ /pubmed/34745230 http://dx.doi.org/10.3389/fgene.2021.763153 Text en Copyright © 2021 Zhu, He, Kuang, Chen and Lancine. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Zhu, Xianyou He, Xin Kuang, Linai Chen, Zhiping Lancine, Camara A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins
title	A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins
title_full	A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins
title_fullStr	A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins
title_full_unstemmed	A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins
title_short	A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins
title_sort	novel collaborative filtering model-based method for identifying essential proteins
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8566338/ https://www.ncbi.nlm.nih.gov/pubmed/34745230 http://dx.doi.org/10.3389/fgene.2021.763153
work_keys_str_mv	AT zhuxianyou anovelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT hexin anovelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT kuanglinai anovelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT chenzhiping anovelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT lancinecamara anovelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT zhuxianyou novelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT hexin novelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT kuanglinai novelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT chenzhiping novelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins AT lancinecamara novelcollaborativefilteringmodelbasedmethodforidentifyingessentialproteins

A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins

Ejemplares similares