Cargando…

Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning

Complex diseases are known to be associated with disease genes. Uncovering disease-gene associations is critical for diagnosis, treatment, and prevention of diseases. Computational algorithms which effectively predict candidate disease-gene associations prior to experimental proof can greatly reduce...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Ping, Xiao, Qianghua, Wei, Pi-Jing, Liao, Bo, Wu, Fang-Xiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454152/
https://www.ncbi.nlm.nih.gov/pubmed/31001321
http://dx.doi.org/10.3389/fgene.2019.00270
_version_ 1783409517030014976
author Luo, Ping
Xiao, Qianghua
Wei, Pi-Jing
Liao, Bo
Wu, Fang-Xiang
author_facet Luo, Ping
Xiao, Qianghua
Wei, Pi-Jing
Liao, Bo
Wu, Fang-Xiang
author_sort Luo, Ping
collection PubMed
description Complex diseases are known to be associated with disease genes. Uncovering disease-gene associations is critical for diagnosis, treatment, and prevention of diseases. Computational algorithms which effectively predict candidate disease-gene associations prior to experimental proof can greatly reduce the associated cost and time. Most existing methods are disease-specific which can only predict genes associated with a specific disease at a time. Similarities among diseases are not used during the prediction. Meanwhile, most methods predict new disease genes based on known associations, making them unable to predict disease genes for diseases without known associated genes.In this study, a manifold learning-based method is proposed for predicting disease-gene associations by assuming that the geodesic distance between any disease and its associated genes should be shorter than that of other non-associated disease-gene pairs. The model maps the diseases and genes into a lower dimensional manifold based on the known disease-gene associations, disease similarity and gene similarity to predict new associations in terms of the geodesic distance between disease-gene pairs. In the 3-fold cross-validation experiments, our method achieves scores of 0.882 and 0.854 in terms of the area under of the receiver operating characteristic (ROC) curve (AUC) for diseases with more than one known associated genes and diseases with only one known associated gene, respectively. Further de novo studies on Lung Cancer and Bladder Cancer also show that our model is capable of identifying new disease-gene associations.
format Online
Article
Text
id pubmed-6454152
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64541522019-04-18 Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning Luo, Ping Xiao, Qianghua Wei, Pi-Jing Liao, Bo Wu, Fang-Xiang Front Genet Genetics Complex diseases are known to be associated with disease genes. Uncovering disease-gene associations is critical for diagnosis, treatment, and prevention of diseases. Computational algorithms which effectively predict candidate disease-gene associations prior to experimental proof can greatly reduce the associated cost and time. Most existing methods are disease-specific which can only predict genes associated with a specific disease at a time. Similarities among diseases are not used during the prediction. Meanwhile, most methods predict new disease genes based on known associations, making them unable to predict disease genes for diseases without known associated genes.In this study, a manifold learning-based method is proposed for predicting disease-gene associations by assuming that the geodesic distance between any disease and its associated genes should be shorter than that of other non-associated disease-gene pairs. The model maps the diseases and genes into a lower dimensional manifold based on the known disease-gene associations, disease similarity and gene similarity to predict new associations in terms of the geodesic distance between disease-gene pairs. In the 3-fold cross-validation experiments, our method achieves scores of 0.882 and 0.854 in terms of the area under of the receiver operating characteristic (ROC) curve (AUC) for diseases with more than one known associated genes and diseases with only one known associated gene, respectively. Further de novo studies on Lung Cancer and Bladder Cancer also show that our model is capable of identifying new disease-gene associations. Frontiers Media S.A. 2019-04-02 /pmc/articles/PMC6454152/ /pubmed/31001321 http://dx.doi.org/10.3389/fgene.2019.00270 Text en Copyright © 2019 Luo, Xiao, Wei, Liao and Wu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Luo, Ping
Xiao, Qianghua
Wei, Pi-Jing
Liao, Bo
Wu, Fang-Xiang
Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning
title Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning
title_full Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning
title_fullStr Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning
title_full_unstemmed Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning
title_short Identifying Disease-Gene Associations With Graph-Regularized Manifold Learning
title_sort identifying disease-gene associations with graph-regularized manifold learning
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454152/
https://www.ncbi.nlm.nih.gov/pubmed/31001321
http://dx.doi.org/10.3389/fgene.2019.00270
work_keys_str_mv AT luoping identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning
AT xiaoqianghua identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning
AT weipijing identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning
AT liaobo identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning
AT wufangxiang identifyingdiseasegeneassociationswithgraphregularizedmanifoldlearning