Cargando…

Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model

BACKGROUND: Accurately prioritizing candidate disease genes is an important and challenging problem. Various network-based methods have been developed to predict potential disease genes by utilizing the disease similarity network and molecular networks such as protein interaction or gene co-expressi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ni, Jingchao, Koyuturk, Mehmet, Tong, Hanghang, Haines, Jonathan, Xu, Rong, Zhang, Xiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5103411/
https://www.ncbi.nlm.nih.gov/pubmed/27829360
http://dx.doi.org/10.1186/s12859-016-1317-x
_version_ 1782466585214058496
author Ni, Jingchao
Koyuturk, Mehmet
Tong, Hanghang
Haines, Jonathan
Xu, Rong
Zhang, Xiang
author_facet Ni, Jingchao
Koyuturk, Mehmet
Tong, Hanghang
Haines, Jonathan
Xu, Rong
Zhang, Xiang
author_sort Ni, Jingchao
collection PubMed
description BACKGROUND: Accurately prioritizing candidate disease genes is an important and challenging problem. Various network-based methods have been developed to predict potential disease genes by utilizing the disease similarity network and molecular networks such as protein interaction or gene co-expression networks. Although successful, a common limitation of the existing methods is that they assume all diseases share the same molecular network and a single generic molecular network is used to predict candidate genes for all diseases. However, different diseases tend to manifest in different tissues, and the molecular networks in different tissues are usually different. An ideal method should be able to incorporate tissue-specific molecular networks for different diseases. RESULTS: In this paper, we develop a robust and flexible method to integrate tissue-specific molecular networks for disease gene prioritization. Our method allows each disease to have its own tissue-specific network(s). We formulate the problem of candidate gene prioritization as an optimization problem based on network propagation. When there are multiple tissue-specific networks available for a disease, our method can automatically infer the relative importance of each tissue-specific network. Thus it is robust to the noisy and incomplete network data. To solve the optimization problem, we develop fast algorithms which have linear time complexities in the number of nodes in the molecular networks. We also provide rigorous theoretical foundations for our algorithms in terms of their optimality and convergence properties. Extensive experimental results show that our method can significantly improve the accuracy of candidate gene prioritization compared with the state-of-the-art methods. CONCLUSIONS: In our experiments, we compare our methods with 7 popular network-based disease gene prioritization algorithms on diseases from Online Mendelian Inheritance in Man (OMIM) database. The experimental results demonstrate that our methods recover true associations more accurately than other methods in terms of AUC values, and the performance differences are significant (with paired t-test p-values less than 0.05). This validates the importance to integrate tissue-specific molecular networks for studying disease gene prioritization and show the superiority of our network models and ranking algorithms toward this purpose. The source code and datasets are available at http://nijingchao.github.io/CRstar/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1317-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5103411
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51034112016-11-10 Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model Ni, Jingchao Koyuturk, Mehmet Tong, Hanghang Haines, Jonathan Xu, Rong Zhang, Xiang BMC Bioinformatics Research Article BACKGROUND: Accurately prioritizing candidate disease genes is an important and challenging problem. Various network-based methods have been developed to predict potential disease genes by utilizing the disease similarity network and molecular networks such as protein interaction or gene co-expression networks. Although successful, a common limitation of the existing methods is that they assume all diseases share the same molecular network and a single generic molecular network is used to predict candidate genes for all diseases. However, different diseases tend to manifest in different tissues, and the molecular networks in different tissues are usually different. An ideal method should be able to incorporate tissue-specific molecular networks for different diseases. RESULTS: In this paper, we develop a robust and flexible method to integrate tissue-specific molecular networks for disease gene prioritization. Our method allows each disease to have its own tissue-specific network(s). We formulate the problem of candidate gene prioritization as an optimization problem based on network propagation. When there are multiple tissue-specific networks available for a disease, our method can automatically infer the relative importance of each tissue-specific network. Thus it is robust to the noisy and incomplete network data. To solve the optimization problem, we develop fast algorithms which have linear time complexities in the number of nodes in the molecular networks. We also provide rigorous theoretical foundations for our algorithms in terms of their optimality and convergence properties. Extensive experimental results show that our method can significantly improve the accuracy of candidate gene prioritization compared with the state-of-the-art methods. CONCLUSIONS: In our experiments, we compare our methods with 7 popular network-based disease gene prioritization algorithms on diseases from Online Mendelian Inheritance in Man (OMIM) database. The experimental results demonstrate that our methods recover true associations more accurately than other methods in terms of AUC values, and the performance differences are significant (with paired t-test p-values less than 0.05). This validates the importance to integrate tissue-specific molecular networks for studying disease gene prioritization and show the superiority of our network models and ranking algorithms toward this purpose. The source code and datasets are available at http://nijingchao.github.io/CRstar/. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1317-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-10 /pmc/articles/PMC5103411/ /pubmed/27829360 http://dx.doi.org/10.1186/s12859-016-1317-x Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ni, Jingchao
Koyuturk, Mehmet
Tong, Hanghang
Haines, Jonathan
Xu, Rong
Zhang, Xiang
Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model
title Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model
title_full Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model
title_fullStr Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model
title_full_unstemmed Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model
title_short Disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model
title_sort disease gene prioritization by integrating tissue-specific molecular networks using a robust multi-network model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5103411/
https://www.ncbi.nlm.nih.gov/pubmed/27829360
http://dx.doi.org/10.1186/s12859-016-1317-x
work_keys_str_mv AT nijingchao diseasegeneprioritizationbyintegratingtissuespecificmolecularnetworksusingarobustmultinetworkmodel
AT koyuturkmehmet diseasegeneprioritizationbyintegratingtissuespecificmolecularnetworksusingarobustmultinetworkmodel
AT tonghanghang diseasegeneprioritizationbyintegratingtissuespecificmolecularnetworksusingarobustmultinetworkmodel
AT hainesjonathan diseasegeneprioritizationbyintegratingtissuespecificmolecularnetworksusingarobustmultinetworkmodel
AT xurong diseasegeneprioritizationbyintegratingtissuespecificmolecularnetworksusingarobustmultinetworkmodel
AT zhangxiang diseasegeneprioritizationbyintegratingtissuespecificmolecularnetworksusingarobustmultinetworkmodel