Cargando…
Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning
Complex diseases are known to be associated with disease genes. Uncovering disease-gene associations is critical for diagnosis, treatment, and prevention of diseases. Computational algorithms which effectively predict candidate disease-gene associations prior to experimental proof can greatly reduce...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454152/ https://www.ncbi.nlm.nih.gov/pubmed/31001321 http://dx.doi.org/10.3389/fgene.2019.00270 |
_version_ | 1783409517030014976 |
---|---|
author | Luo, Ping Xiao, Qianghua Wei, Pi-Jing Liao, Bo Wu, Fang-Xiang |
author_facet | Luo, Ping Xiao, Qianghua Wei, Pi-Jing Liao, Bo Wu, Fang-Xiang |
author_sort | Luo, Ping |
collection | PubMed |
description | Complex diseases are known to be associated with disease genes. Uncovering disease-gene associations is critical for diagnosis, treatment, and prevention of diseases. Computational algorithms which effectively predict candidate disease-gene associations prior to experimental proof can greatly reduce the associated cost and time. Most existing methods are disease-specific which can only predict genes associated with a specific disease at a time. Similarities among diseases are not used during the prediction. Meanwhile, most methods predict new disease genes based on known associations, making them unable to predict disease genes for diseases without known associated genes.In this study, a manifold learning-based method is proposed for predicting disease-gene associations by assuming that the geodesic distance between any disease and its associated genes should be shorter than that of other non-associated disease-gene pairs. The model maps the diseases and genes into a lower dimensional manifold based on the known disease-gene associations, disease similarity and gene similarity to predict new associations in terms of the geodesic distance between disease-gene pairs. In the 3-fold cross-validation experiments, our method achieves scores of 0.882 and 0.854 in terms of the area under of the receiver operating characteristic (ROC) curve (AUC) for diseases with more than one known associated genes and diseases with only one known associated gene, respectively. Further de novo studies on Lung Cancer and Bladder Cancer also show that our model is capable of identifying new disease-gene associations. |
format | Online Article Text |
id | pubmed-6454152 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-64541522019-04-18 Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning Luo, Ping Xiao, Qianghua Wei, Pi-Jing Liao, Bo Wu, Fang-Xiang Front Genet Genetics Complex diseases are known to be associated with disease genes. Uncovering disease-gene associations is critical for diagnosis, treatment, and prevention of diseases. Computational algorithms which effectively predict candidate disease-gene associations prior to experimental proof can greatly reduce the associated cost and time. Most existing methods are disease-specific which can only predict genes associated with a specific disease at a time. Similarities among diseases are not used during the prediction. Meanwhile, most methods predict new disease genes based on known associations, making them unable to predict disease genes for diseases without known associated genes.In this study, a manifold learning-based method is proposed for predicting disease-gene associations by assuming that the geodesic distance between any disease and its associated genes should be shorter than that of other non-associated disease-gene pairs. The model maps the diseases and genes into a lower dimensional manifold based on the known disease-gene associations, disease similarity and gene similarity to predict new associations in terms of the geodesic distance between disease-gene pairs. In the 3-fold cross-validation experiments, our method achieves scores of 0.882 and 0.854 in terms of the area under of the receiver operating characteristic (ROC) curve (AUC) for diseases with more than one known associated genes and diseases with only one known associated gene, respectively. Further de novo studies on Lung Cancer and Bladder Cancer also show that our model is capable of identifying new disease-gene associations. Frontiers Media S.A. 2019-04-02 /pmc/articles/PMC6454152/ /pubmed/31001321 http://dx.doi.org/10.3389/fgene.2019.00270 Text en Copyright © 2019 Luo, Xiao, Wei, Liao and Wu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Luo, Ping Xiao, Qianghua Wei, Pi-Jing Liao, Bo Wu, Fang-Xiang Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning |
title | Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning |
title_full | Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning |
title_fullStr | Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning |
title_full_unstemmed | Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning |
title_short | Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning |
title_sort | identifying disease-gene associations with graph-regularized manifold learning |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454152/ https://www.ncbi.nlm.nih.gov/pubmed/31001321 http://dx.doi.org/10.3389/fgene.2019.00270 |
work_keys_str_mv | AT luoping identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning AT xiaoqianghua identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning AT weipijing identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning AT liaobo identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning AT wufangxiang identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning |