Cargando…

A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations

BACKGROUND: In recent years, lncRNAs (long-non-coding RNAs) have been proved to be closely related to the occurrence and development of many serious diseases that are seriously harmful to human health. However, most of the lncRNA-disease associations have not been found yet due to high costs and tim...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jiechen, Li, Xueyong, Feng, Xiang, Wang, Bing, Zhao, Bihai, Wang, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889579/
https://www.ncbi.nlm.nih.gov/pubmed/31795943
http://dx.doi.org/10.1186/s12859-019-3216-4
_version_ 1783475448973361152
author Li, Jiechen
Li, Xueyong
Feng, Xiang
Wang, Bing
Zhao, Bihai
Wang, Lei
author_facet Li, Jiechen
Li, Xueyong
Feng, Xiang
Wang, Bing
Zhao, Bihai
Wang, Lei
author_sort Li, Jiechen
collection PubMed
description BACKGROUND: In recent years, lncRNAs (long-non-coding RNAs) have been proved to be closely related to the occurrence and development of many serious diseases that are seriously harmful to human health. However, most of the lncRNA-disease associations have not been found yet due to high costs and time complexity of traditional bio-experiments. Hence, it is quite urgent and necessary to establish efficient and reasonable computational models to predict potential associations between lncRNAs and diseases. RESULTS: In this manuscript, a novel prediction model called TCSRWRLD is proposed to predict potential lncRNA-disease associations based on improved random walk with restart. In TCSRWRLD, a heterogeneous lncRNA-disease network is constructed first by combining the integrated similarity of lncRNAs and the integrated similarity of diseases. And then, for each lncRNA/disease node in the newly constructed heterogeneous lncRNA-disease network, it will establish a node set called TCS (Target Convergence Set) consisting of top 100 disease/lncRNA nodes with minimum average network distances to these disease/lncRNA nodes having known associations with itself. Finally, an improved random walk with restart is implemented on the heterogeneous lncRNA-disease network to infer potential lncRNA-disease associations. The major contribution of this manuscript lies in the introduction of the concept of TCS, based on which, the velocity of convergence of TCSRWRLD can be quicken effectively, since the walker can stop its random walk while the walking probability vectors obtained by it at the nodes in TCS instead of all nodes in the whole network have reached stable state. And Simulation results show that TCSRWRLD can achieve a reliable AUC of 0.8712 in the Leave-One-Out Cross Validation (LOOCV), which outperforms previous state-of-the-art results apparently. Moreover, case studies of lung cancer and leukemia demonstrate the satisfactory prediction performance of TCSRWRLD as well. CONCLUSIONS: Both comparative results and case studies have demonstrated that TCSRWRLD can achieve excellent performances in prediction of potential lncRNA-disease associations, which imply as well that TCSRWRLD may be a good addition to the research of bioinformatics in the future.
format Online
Article
Text
id pubmed-6889579
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68895792019-12-11 A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations Li, Jiechen Li, Xueyong Feng, Xiang Wang, Bing Zhao, Bihai Wang, Lei BMC Bioinformatics Research Article BACKGROUND: In recent years, lncRNAs (long-non-coding RNAs) have been proved to be closely related to the occurrence and development of many serious diseases that are seriously harmful to human health. However, most of the lncRNA-disease associations have not been found yet due to high costs and time complexity of traditional bio-experiments. Hence, it is quite urgent and necessary to establish efficient and reasonable computational models to predict potential associations between lncRNAs and diseases. RESULTS: In this manuscript, a novel prediction model called TCSRWRLD is proposed to predict potential lncRNA-disease associations based on improved random walk with restart. In TCSRWRLD, a heterogeneous lncRNA-disease network is constructed first by combining the integrated similarity of lncRNAs and the integrated similarity of diseases. And then, for each lncRNA/disease node in the newly constructed heterogeneous lncRNA-disease network, it will establish a node set called TCS (Target Convergence Set) consisting of top 100 disease/lncRNA nodes with minimum average network distances to these disease/lncRNA nodes having known associations with itself. Finally, an improved random walk with restart is implemented on the heterogeneous lncRNA-disease network to infer potential lncRNA-disease associations. The major contribution of this manuscript lies in the introduction of the concept of TCS, based on which, the velocity of convergence of TCSRWRLD can be quicken effectively, since the walker can stop its random walk while the walking probability vectors obtained by it at the nodes in TCS instead of all nodes in the whole network have reached stable state. And Simulation results show that TCSRWRLD can achieve a reliable AUC of 0.8712 in the Leave-One-Out Cross Validation (LOOCV), which outperforms previous state-of-the-art results apparently. Moreover, case studies of lung cancer and leukemia demonstrate the satisfactory prediction performance of TCSRWRLD as well. CONCLUSIONS: Both comparative results and case studies have demonstrated that TCSRWRLD can achieve excellent performances in prediction of potential lncRNA-disease associations, which imply as well that TCSRWRLD may be a good addition to the research of bioinformatics in the future. BioMed Central 2019-12-03 /pmc/articles/PMC6889579/ /pubmed/31795943 http://dx.doi.org/10.1186/s12859-019-3216-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Li, Jiechen
Li, Xueyong
Feng, Xiang
Wang, Bing
Zhao, Bihai
Wang, Lei
A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations
title A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations
title_full A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations
title_fullStr A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations
title_full_unstemmed A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations
title_short A novel target convergence set based random walk with restart for prediction of potential LncRNA-disease associations
title_sort novel target convergence set based random walk with restart for prediction of potential lncrna-disease associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889579/
https://www.ncbi.nlm.nih.gov/pubmed/31795943
http://dx.doi.org/10.1186/s12859-019-3216-4
work_keys_str_mv AT lijiechen anoveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT lixueyong anoveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT fengxiang anoveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT wangbing anoveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT zhaobihai anoveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT wanglei anoveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT lijiechen noveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT lixueyong noveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT fengxiang noveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT wangbing noveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT zhaobihai noveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations
AT wanglei noveltargetconvergencesetbasedrandomwalkwithrestartforpredictionofpotentiallncrnadiseaseassociations