Cargando…

A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations

The identification of disease-causing genes is a fundamental challenge in human health and of great importance in improving medical care, and provides a better understanding of gene functions. Recent computational approaches based on the interactions among human proteins and disease similarities hav...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Xingli, Gao, Lin, Wei, Chunshui, Yang, Xiaofei, Zhao, Yi, Dong, Anguo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3166294/
https://www.ncbi.nlm.nih.gov/pubmed/21912671
http://dx.doi.org/10.1371/journal.pone.0024171
_version_ 1782211144401813504
author Guo, Xingli
Gao, Lin
Wei, Chunshui
Yang, Xiaofei
Zhao, Yi
Dong, Anguo
author_facet Guo, Xingli
Gao, Lin
Wei, Chunshui
Yang, Xiaofei
Zhao, Yi
Dong, Anguo
author_sort Guo, Xingli
collection PubMed
description The identification of disease-causing genes is a fundamental challenge in human health and of great importance in improving medical care, and provides a better understanding of gene functions. Recent computational approaches based on the interactions among human proteins and disease similarities have shown their power in tackling the issue. In this paper, a novel systematic and global method that integrates two heterogeneous networks for prioritizing candidate disease-causing genes is provided, based on the observation that genes causing the same or similar diseases tend to lie close to one another in a network of protein-protein interactions. In this method, the association score function between a query disease and a candidate gene is defined as the weighted sum of all the association scores between similar diseases and neighbouring genes. Moreover, the topological correlation of these two heterogeneous networks can be incorporated into the definition of the score function, and finally an iterative algorithm is designed for this issue. This method was tested with 10-fold cross-validation on all 1,126 diseases that have at least a known causal gene, and it ranked the correct gene as one of the top ten in 622 of all the 1,428 cases, significantly outperforming a state-of-the-art method called PRINCE. The results brought about by this method were applied to study three multi-factorial disorders: breast cancer, Alzheimer disease and diabetes mellitus type 2, and some suggestions of novel causal genes and candidate disease-causing subnetworks were provided for further investigation.
format Online
Article
Text
id pubmed-3166294
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31662942011-09-12 A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations Guo, Xingli Gao, Lin Wei, Chunshui Yang, Xiaofei Zhao, Yi Dong, Anguo PLoS One Research Article The identification of disease-causing genes is a fundamental challenge in human health and of great importance in improving medical care, and provides a better understanding of gene functions. Recent computational approaches based on the interactions among human proteins and disease similarities have shown their power in tackling the issue. In this paper, a novel systematic and global method that integrates two heterogeneous networks for prioritizing candidate disease-causing genes is provided, based on the observation that genes causing the same or similar diseases tend to lie close to one another in a network of protein-protein interactions. In this method, the association score function between a query disease and a candidate gene is defined as the weighted sum of all the association scores between similar diseases and neighbouring genes. Moreover, the topological correlation of these two heterogeneous networks can be incorporated into the definition of the score function, and finally an iterative algorithm is designed for this issue. This method was tested with 10-fold cross-validation on all 1,126 diseases that have at least a known causal gene, and it ranked the correct gene as one of the top ten in 622 of all the 1,428 cases, significantly outperforming a state-of-the-art method called PRINCE. The results brought about by this method were applied to study three multi-factorial disorders: breast cancer, Alzheimer disease and diabetes mellitus type 2, and some suggestions of novel causal genes and candidate disease-causing subnetworks were provided for further investigation. Public Library of Science 2011-09-02 /pmc/articles/PMC3166294/ /pubmed/21912671 http://dx.doi.org/10.1371/journal.pone.0024171 Text en Guo et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Guo, Xingli
Gao, Lin
Wei, Chunshui
Yang, Xiaofei
Zhao, Yi
Dong, Anguo
A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations
title A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations
title_full A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations
title_fullStr A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations
title_full_unstemmed A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations
title_short A Computational Method Based on the Integration of Heterogeneous Networks for Predicting Disease-Gene Associations
title_sort computational method based on the integration of heterogeneous networks for predicting disease-gene associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3166294/
https://www.ncbi.nlm.nih.gov/pubmed/21912671
http://dx.doi.org/10.1371/journal.pone.0024171
work_keys_str_mv AT guoxingli acomputationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT gaolin acomputationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT weichunshui acomputationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT yangxiaofei acomputationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT zhaoyi acomputationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT donganguo acomputationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT guoxingli computationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT gaolin computationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT weichunshui computationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT yangxiaofei computationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT zhaoyi computationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations
AT donganguo computationalmethodbasedontheintegrationofheterogeneousnetworksforpredictingdiseasegeneassociations