Cargando…

Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles

BACKGROUND: Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches wer...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Jie, Qin, Yufang, Liu, Taigang, Wang, Jun, Zheng, Xiaoqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622672/
https://www.ncbi.nlm.nih.gov/pubmed/23734762
http://dx.doi.org/10.1186/1471-2105-14-S5-S5
_version_ 1782265865531555840
author Zhu, Jie
Qin, Yufang
Liu, Taigang
Wang, Jun
Zheng, Xiaoqi
author_facet Zhu, Jie
Qin, Yufang
Liu, Taigang
Wang, Jun
Zheng, Xiaoqi
author_sort Zhu, Jie
collection PubMed
description BACKGROUND: Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. RESULTS: In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. CONCLUSIONS: Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. AVAILABILITY: Programs and data are available upon request.
format Online
Article
Text
id pubmed-3622672
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36226722013-04-15 Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles Zhu, Jie Qin, Yufang Liu, Taigang Wang, Jun Zheng, Xiaoqi BMC Bioinformatics Proceedings BACKGROUND: Identification of gene-phenotype relationships is a fundamental challenge in human health clinic. Based on the observation that genes causing the same or similar phenotypes tend to correlate with each other in the protein-protein interaction network, a lot of network-based approaches were proposed based on different underlying models. A recent comparative study showed that diffusion-based methods achieve the state-of-the-art predictive performance. RESULTS: In this paper, a new diffusion-based method was proposed to prioritize candidate disease genes. Diffusion profile of a disease was defined as the stationary distribution of candidate genes given a random walk with restart where similarities between phenotypes are incorporated. Then, candidate disease genes are prioritized by comparing their diffusion profiles with that of the disease. Finally, the effectiveness of our method was demonstrated through the leave-one-out cross-validation against control genes from artificial linkage intervals and randomly chosen genes. Comparative study showed that our method achieves improved performance compared to some classical diffusion-based methods. To further illustrate our method, we used our algorithm to predict new causing genes of 16 multifactorial diseases including Prostate cancer and Alzheimer's disease, and the top predictions were in good consistent with literature reports. CONCLUSIONS: Our study indicates that integration of multiple information sources, especially the phenotype similarity profile data, and introduction of global similarity measure between disease and gene diffusion profiles are helpful for prioritizing candidate disease genes. AVAILABILITY: Programs and data are available upon request. BioMed Central 2013-04-10 /pmc/articles/PMC3622672/ /pubmed/23734762 http://dx.doi.org/10.1186/1471-2105-14-S5-S5 Text en Copyright © 2013 Zhu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Zhu, Jie
Qin, Yufang
Liu, Taigang
Wang, Jun
Zheng, Xiaoqi
Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles
title Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles
title_full Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles
title_fullStr Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles
title_full_unstemmed Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles
title_short Prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles
title_sort prioritization of candidate disease genes by topological similarity between disease and protein diffusion profiles
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3622672/
https://www.ncbi.nlm.nih.gov/pubmed/23734762
http://dx.doi.org/10.1186/1471-2105-14-S5-S5
work_keys_str_mv AT zhujie prioritizationofcandidatediseasegenesbytopologicalsimilaritybetweendiseaseandproteindiffusionprofiles
AT qinyufang prioritizationofcandidatediseasegenesbytopologicalsimilaritybetweendiseaseandproteindiffusionprofiles
AT liutaigang prioritizationofcandidatediseasegenesbytopologicalsimilaritybetweendiseaseandproteindiffusionprofiles
AT wangjun prioritizationofcandidatediseasegenesbytopologicalsimilaritybetweendiseaseandproteindiffusionprofiles
AT zhengxiaoqi prioritizationofcandidatediseasegenesbytopologicalsimilaritybetweendiseaseandproteindiffusionprofiles