Cargando…

Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification

Essential genes are a group of genes that are indispensable for cell survival and cell fertility. Studying human essential genes helps scientists reveal the underlying biological mechanisms of a human cell but also guides disease treatment. Recently, the publication of human essential gene data make...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Wei, Chang, Qi, Peng, Wei, Zhong, Jiancheng, Li, Yongjiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7074227/
https://www.ncbi.nlm.nih.gov/pubmed/32023848
http://dx.doi.org/10.3390/genes11020153
_version_ 1783506785199456256
author Dai, Wei
Chang, Qi
Peng, Wei
Zhong, Jiancheng
Li, Yongjiang
author_facet Dai, Wei
Chang, Qi
Peng, Wei
Zhong, Jiancheng
Li, Yongjiang
author_sort Dai, Wei
collection PubMed
description Essential genes are a group of genes that are indispensable for cell survival and cell fertility. Studying human essential genes helps scientists reveal the underlying biological mechanisms of a human cell but also guides disease treatment. Recently, the publication of human essential gene data makes it possible for researchers to train a machine-learning classifier by using some features of the known human essential genes and to use the classifier to predict new human essential genes. Previous studies have found that the essentiality of genes closely relates to their properties in the protein–protein interaction (PPI) network. In this work, we propose a novel supervised method to predict human essential genes by network embedding the PPI network. Our approach implements a bias random walk on the network to get the node network context. Then, the node pairs are input into an artificial neural network to learn their representation vectors that maximally preserves network structure and the properties of the nodes in the network. Finally, the features are put into an SVM classifier to predict human essential genes. The prediction results on two human PPI networks show that our method achieves better performance than those that refer to either genes’ sequence information or genes’ centrality properties in the network as input features. Moreover, it also outperforms the methods that represent the PPI network by other previous approaches.
format Online
Article
Text
id pubmed-7074227
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70742272020-03-19 Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification Dai, Wei Chang, Qi Peng, Wei Zhong, Jiancheng Li, Yongjiang Genes (Basel) Article Essential genes are a group of genes that are indispensable for cell survival and cell fertility. Studying human essential genes helps scientists reveal the underlying biological mechanisms of a human cell but also guides disease treatment. Recently, the publication of human essential gene data makes it possible for researchers to train a machine-learning classifier by using some features of the known human essential genes and to use the classifier to predict new human essential genes. Previous studies have found that the essentiality of genes closely relates to their properties in the protein–protein interaction (PPI) network. In this work, we propose a novel supervised method to predict human essential genes by network embedding the PPI network. Our approach implements a bias random walk on the network to get the node network context. Then, the node pairs are input into an artificial neural network to learn their representation vectors that maximally preserves network structure and the properties of the nodes in the network. Finally, the features are put into an SVM classifier to predict human essential genes. The prediction results on two human PPI networks show that our method achieves better performance than those that refer to either genes’ sequence information or genes’ centrality properties in the network as input features. Moreover, it also outperforms the methods that represent the PPI network by other previous approaches. MDPI 2020-01-31 /pmc/articles/PMC7074227/ /pubmed/32023848 http://dx.doi.org/10.3390/genes11020153 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Dai, Wei
Chang, Qi
Peng, Wei
Zhong, Jiancheng
Li, Yongjiang
Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification
title Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification
title_full Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification
title_fullStr Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification
title_full_unstemmed Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification
title_short Network Embedding the Protein–Protein Interaction Network for Human Essential Genes Identification
title_sort network embedding the protein–protein interaction network for human essential genes identification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7074227/
https://www.ncbi.nlm.nih.gov/pubmed/32023848
http://dx.doi.org/10.3390/genes11020153
work_keys_str_mv AT daiwei networkembeddingtheproteinproteininteractionnetworkforhumanessentialgenesidentification
AT changqi networkembeddingtheproteinproteininteractionnetworkforhumanessentialgenesidentification
AT pengwei networkembeddingtheproteinproteininteractionnetworkforhumanessentialgenesidentification
AT zhongjiancheng networkembeddingtheproteinproteininteractionnetworkforhumanessentialgenesidentification
AT liyongjiang networkembeddingtheproteinproteininteractionnetworkforhumanessentialgenesidentification