Cargando…

PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm

The study of disease-gene associations is an important topic in the field of computational biology. The accumulation of massive amounts of biomedical data provides new possibilities for exploring potential relations between diseases and genes through computational strategy, but how to extract valuab...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yan, Xiang, Ju, Tang, Liang, Yang, Jialiang, Li, Jianming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9895109/
https://www.ncbi.nlm.nih.gov/pubmed/36744177
http://dx.doi.org/10.3389/fgene.2022.1087784
_version_ 1784881880330403840
author Zhang, Yan
Xiang, Ju
Tang, Liang
Yang, Jialiang
Li, Jianming
author_facet Zhang, Yan
Xiang, Ju
Tang, Liang
Yang, Jialiang
Li, Jianming
author_sort Zhang, Yan
collection PubMed
description The study of disease-gene associations is an important topic in the field of computational biology. The accumulation of massive amounts of biomedical data provides new possibilities for exploring potential relations between diseases and genes through computational strategy, but how to extract valuable information from the data to predict pathogenic genes accurately and rapidly is currently a challenging and meaningful task. Therefore, we present a novel computational method called PGAGP for inferring potential pathogenic genes based on an adaptive network embedding algorithm. The PGAGP algorithm is to first extract initial features of nodes from a heterogeneous network of diseases and genes efficiently and effectively by Gaussian random projection and then optimize the features of nodes by an adaptive refining process. These low-dimensional features are used to improve the disease-gene heterogenous network, and we apply network propagation to the improved heterogenous network to predict pathogenic genes more effectively. By a series of experiments, we study the effect of PGAGP’s parameters and integrated strategies on predictive performance and confirm that PGAGP is better than the state-of-the-art algorithms. Case studies show that many of the predicted candidate genes for specific diseases have been implied to be related to these diseases by literature verification and enrichment analysis, which further verifies the effectiveness of PGAGP. Overall, this work provides a useful solution for mining disease-gene heterogeneous network to predict pathogenic genes more effectively.
format Online
Article
Text
id pubmed-9895109
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-98951092023-02-04 PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm Zhang, Yan Xiang, Ju Tang, Liang Yang, Jialiang Li, Jianming Front Genet Genetics The study of disease-gene associations is an important topic in the field of computational biology. The accumulation of massive amounts of biomedical data provides new possibilities for exploring potential relations between diseases and genes through computational strategy, but how to extract valuable information from the data to predict pathogenic genes accurately and rapidly is currently a challenging and meaningful task. Therefore, we present a novel computational method called PGAGP for inferring potential pathogenic genes based on an adaptive network embedding algorithm. The PGAGP algorithm is to first extract initial features of nodes from a heterogeneous network of diseases and genes efficiently and effectively by Gaussian random projection and then optimize the features of nodes by an adaptive refining process. These low-dimensional features are used to improve the disease-gene heterogenous network, and we apply network propagation to the improved heterogenous network to predict pathogenic genes more effectively. By a series of experiments, we study the effect of PGAGP’s parameters and integrated strategies on predictive performance and confirm that PGAGP is better than the state-of-the-art algorithms. Case studies show that many of the predicted candidate genes for specific diseases have been implied to be related to these diseases by literature verification and enrichment analysis, which further verifies the effectiveness of PGAGP. Overall, this work provides a useful solution for mining disease-gene heterogeneous network to predict pathogenic genes more effectively. Frontiers Media S.A. 2023-01-20 /pmc/articles/PMC9895109/ /pubmed/36744177 http://dx.doi.org/10.3389/fgene.2022.1087784 Text en Copyright © 2023 Zhang, Xiang, Tang, Yang and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhang, Yan
Xiang, Ju
Tang, Liang
Yang, Jialiang
Li, Jianming
PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm
title PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm
title_full PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm
title_fullStr PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm
title_full_unstemmed PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm
title_short PGAGP: Predicting pathogenic genes based on adaptive network embedding algorithm
title_sort pgagp: predicting pathogenic genes based on adaptive network embedding algorithm
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9895109/
https://www.ncbi.nlm.nih.gov/pubmed/36744177
http://dx.doi.org/10.3389/fgene.2022.1087784
work_keys_str_mv AT zhangyan pgagppredictingpathogenicgenesbasedonadaptivenetworkembeddingalgorithm
AT xiangju pgagppredictingpathogenicgenesbasedonadaptivenetworkembeddingalgorithm
AT tangliang pgagppredictingpathogenicgenesbasedonadaptivenetworkembeddingalgorithm
AT yangjialiang pgagppredictingpathogenicgenesbasedonadaptivenetworkembeddingalgorithm
AT lijianming pgagppredictingpathogenicgenesbasedonadaptivenetworkembeddingalgorithm