Cargando…
Constructing a gene semantic similarity network for the inference of disease genes
MOTIVATION: The inference of genes that are truly associated with inherited human diseases from a set of candidates resulting from genetic linkage studies has been one of the most challenging tasks in human genetics. Although several computational approaches have been proposed to prioritize candidat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287482/ https://www.ncbi.nlm.nih.gov/pubmed/22784573 http://dx.doi.org/10.1186/1752-0509-5-S2-S2 |
_version_ | 1782224673244708864 |
---|---|
author | Jiang, Rui Gan, Mingxin He, Peng |
author_facet | Jiang, Rui Gan, Mingxin He, Peng |
author_sort | Jiang, Rui |
collection | PubMed |
description | MOTIVATION: The inference of genes that are truly associated with inherited human diseases from a set of candidates resulting from genetic linkage studies has been one of the most challenging tasks in human genetics. Although several computational approaches have been proposed to prioritize candidate genes relying on protein-protein interaction (PPI) networks, these methods can usually cover less than half of known human genes. RESULTS: We propose to rely on the biological process domain of the gene ontology to construct a gene semantic similarity network and then use the network to infer disease genes. We show that the constructed network covers about 50% more genes than a typical PPI network. By analyzing the gene semantic similarity network with the PPI network, we show that gene pairs tend to have higher semantic similarity scores if the corresponding proteins are closer to each other in the PPI network. By analyzing the gene semantic similarity network with a phenotype similarity network, we show that semantic similarity scores of genes associated with similar diseases are significantly different from those of genes selected at random, and that genes with higher semantic similarity scores tend to be associated with diseases with higher phenotype similarity scores. We further use the gene semantic similarity network with a random walk with restart model to infer disease genes. Through a series of large-scale leave-one-out cross-validation experiments, we show that the gene semantic similarity network can achieve not only higher coverage but also higher accuracy than the PPI network in the inference of disease genes. CONTACT: ruijiang@tsinghua.edu.cn |
format | Online Article Text |
id | pubmed-3287482 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32874822012-02-28 Constructing a gene semantic similarity network for the inference of disease genes Jiang, Rui Gan, Mingxin He, Peng BMC Syst Biol Proceedings MOTIVATION: The inference of genes that are truly associated with inherited human diseases from a set of candidates resulting from genetic linkage studies has been one of the most challenging tasks in human genetics. Although several computational approaches have been proposed to prioritize candidate genes relying on protein-protein interaction (PPI) networks, these methods can usually cover less than half of known human genes. RESULTS: We propose to rely on the biological process domain of the gene ontology to construct a gene semantic similarity network and then use the network to infer disease genes. We show that the constructed network covers about 50% more genes than a typical PPI network. By analyzing the gene semantic similarity network with the PPI network, we show that gene pairs tend to have higher semantic similarity scores if the corresponding proteins are closer to each other in the PPI network. By analyzing the gene semantic similarity network with a phenotype similarity network, we show that semantic similarity scores of genes associated with similar diseases are significantly different from those of genes selected at random, and that genes with higher semantic similarity scores tend to be associated with diseases with higher phenotype similarity scores. We further use the gene semantic similarity network with a random walk with restart model to infer disease genes. Through a series of large-scale leave-one-out cross-validation experiments, we show that the gene semantic similarity network can achieve not only higher coverage but also higher accuracy than the PPI network in the inference of disease genes. CONTACT: ruijiang@tsinghua.edu.cn BioMed Central 2011-12-14 /pmc/articles/PMC3287482/ /pubmed/22784573 http://dx.doi.org/10.1186/1752-0509-5-S2-S2 Text en Copyright ©2011 Jiang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Jiang, Rui Gan, Mingxin He, Peng Constructing a gene semantic similarity network for the inference of disease genes |
title | Constructing a gene semantic similarity network for the inference of disease genes |
title_full | Constructing a gene semantic similarity network for the inference of disease genes |
title_fullStr | Constructing a gene semantic similarity network for the inference of disease genes |
title_full_unstemmed | Constructing a gene semantic similarity network for the inference of disease genes |
title_short | Constructing a gene semantic similarity network for the inference of disease genes |
title_sort | constructing a gene semantic similarity network for the inference of disease genes |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287482/ https://www.ncbi.nlm.nih.gov/pubmed/22784573 http://dx.doi.org/10.1186/1752-0509-5-S2-S2 |
work_keys_str_mv | AT jiangrui constructingagenesemanticsimilaritynetworkfortheinferenceofdiseasegenes AT ganmingxin constructingagenesemanticsimilaritynetworkfortheinferenceofdiseasegenes AT hepeng constructingagenesemanticsimilaritynetworkfortheinferenceofdiseasegenes |