Cargando…

A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs

BACKGROUND: Accumulating evidence has demonstrated that long non-coding RNAs (lncRNAs) are closely associated with human diseases, and it is useful for the diagnosis and treatment of diseases to get the relationships between lncRNAs and diseases. Due to the high costs and time complexity of traditio...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Yubin, Xiao, Zheng, Feng, Xiang, Chen, Zhiping, Kuang, Linai, Wang, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7709313/
https://www.ncbi.nlm.nih.gov/pubmed/33267800
http://dx.doi.org/10.1186/s12859-020-03906-7
_version_ 1783617723127824384
author Xiao, Yubin
Xiao, Zheng
Feng, Xiang
Chen, Zhiping
Kuang, Linai
Wang, Lei
author_facet Xiao, Yubin
Xiao, Zheng
Feng, Xiang
Chen, Zhiping
Kuang, Linai
Wang, Lei
author_sort Xiao, Yubin
collection PubMed
description BACKGROUND: Accumulating evidence has demonstrated that long non-coding RNAs (lncRNAs) are closely associated with human diseases, and it is useful for the diagnosis and treatment of diseases to get the relationships between lncRNAs and diseases. Due to the high costs and time complexity of traditional bio-experiments, in recent years, more and more computational methods have been proposed by researchers to infer potential lncRNA-disease associations. However, there exist all kinds of limitations in these state-of-the-art prediction methods as well. RESULTS: In this manuscript, a novel computational model named FVTLDA is proposed to infer potential lncRNA-disease associations. In FVTLDA, its major novelty lies in the integration of direct and indirect features related to lncRNA-disease associations such as the feature vectors of lncRNA-disease pairs and their corresponding association probability fractions, which guarantees that FVTLDA can be utilized to predict diseases without known related-lncRNAs and lncRNAs without known related-diseases. Moreover, FVTLDA neither relies solely on known lncRNA-disease nor requires any negative samples, which guarantee that it can infer potential lncRNA-disease associations more equitably and effectively than traditional state-of-the-art prediction methods. Additionally, to avoid the limitations of single model prediction techniques, we combine FVTLDA with the Multiple Linear Regression (MLR) and the Artificial Neural Network (ANN) for data analysis respectively. Simulation experiment results show that FVTLDA with MLR can achieve reliable AUCs of 0.8909, 0.8936 and 0.8970 in 5-Fold Cross Validation (fivefold CV), 10-Fold Cross Validation (tenfold CV) and Leave-One-Out Cross Validation (LOOCV), separately, while FVTLDA with ANN can achieve reliable AUCs of 0.8766, 0.8830 and 0.8807 in fivefold CV, tenfold CV, and LOOCV respectively. Furthermore, in case studies of gastric cancer, leukemia and lung cancer, experiment results show that there are 8, 8 and 8 out of top 10 candidate lncRNAs predicted by FVTLDA with MLR, and 8, 7 and 8 out of top 10 candidate lncRNAs predicted by FVTLDA with ANN, having been verified by recent literature. Comparing with the representative prediction model of KATZLDA, comparison results illustrate that FVTLDA with MLR and FVTLDA with ANN can achieve the average case study contrast scores of 0.8429 and 0.8515 respectively, which are both notably higher than the average case study contrast score of 0.6375 achieved by KATZLDA. CONCLUSION: The simulation results show that FVTLDA has good prediction performance, which is a good supplement to future bioinformatics research.
format Online
Article
Text
id pubmed-7709313
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77093132020-12-02 A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs Xiao, Yubin Xiao, Zheng Feng, Xiang Chen, Zhiping Kuang, Linai Wang, Lei BMC Bioinformatics Methodology Article BACKGROUND: Accumulating evidence has demonstrated that long non-coding RNAs (lncRNAs) are closely associated with human diseases, and it is useful for the diagnosis and treatment of diseases to get the relationships between lncRNAs and diseases. Due to the high costs and time complexity of traditional bio-experiments, in recent years, more and more computational methods have been proposed by researchers to infer potential lncRNA-disease associations. However, there exist all kinds of limitations in these state-of-the-art prediction methods as well. RESULTS: In this manuscript, a novel computational model named FVTLDA is proposed to infer potential lncRNA-disease associations. In FVTLDA, its major novelty lies in the integration of direct and indirect features related to lncRNA-disease associations such as the feature vectors of lncRNA-disease pairs and their corresponding association probability fractions, which guarantees that FVTLDA can be utilized to predict diseases without known related-lncRNAs and lncRNAs without known related-diseases. Moreover, FVTLDA neither relies solely on known lncRNA-disease nor requires any negative samples, which guarantee that it can infer potential lncRNA-disease associations more equitably and effectively than traditional state-of-the-art prediction methods. Additionally, to avoid the limitations of single model prediction techniques, we combine FVTLDA with the Multiple Linear Regression (MLR) and the Artificial Neural Network (ANN) for data analysis respectively. Simulation experiment results show that FVTLDA with MLR can achieve reliable AUCs of 0.8909, 0.8936 and 0.8970 in 5-Fold Cross Validation (fivefold CV), 10-Fold Cross Validation (tenfold CV) and Leave-One-Out Cross Validation (LOOCV), separately, while FVTLDA with ANN can achieve reliable AUCs of 0.8766, 0.8830 and 0.8807 in fivefold CV, tenfold CV, and LOOCV respectively. Furthermore, in case studies of gastric cancer, leukemia and lung cancer, experiment results show that there are 8, 8 and 8 out of top 10 candidate lncRNAs predicted by FVTLDA with MLR, and 8, 7 and 8 out of top 10 candidate lncRNAs predicted by FVTLDA with ANN, having been verified by recent literature. Comparing with the representative prediction model of KATZLDA, comparison results illustrate that FVTLDA with MLR and FVTLDA with ANN can achieve the average case study contrast scores of 0.8429 and 0.8515 respectively, which are both notably higher than the average case study contrast score of 0.6375 achieved by KATZLDA. CONCLUSION: The simulation results show that FVTLDA has good prediction performance, which is a good supplement to future bioinformatics research. BioMed Central 2020-12-02 /pmc/articles/PMC7709313/ /pubmed/33267800 http://dx.doi.org/10.1186/s12859-020-03906-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Xiao, Yubin
Xiao, Zheng
Feng, Xiang
Chen, Zhiping
Kuang, Linai
Wang, Lei
A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs
title A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs
title_full A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs
title_fullStr A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs
title_full_unstemmed A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs
title_short A novel computational model for predicting potential LncRNA-disease associations based on both direct and indirect features of LncRNA-disease pairs
title_sort novel computational model for predicting potential lncrna-disease associations based on both direct and indirect features of lncrna-disease pairs
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7709313/
https://www.ncbi.nlm.nih.gov/pubmed/33267800
http://dx.doi.org/10.1186/s12859-020-03906-7
work_keys_str_mv AT xiaoyubin anovelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT xiaozheng anovelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT fengxiang anovelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT chenzhiping anovelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT kuanglinai anovelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT wanglei anovelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT xiaoyubin novelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT xiaozheng novelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT fengxiang novelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT chenzhiping novelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT kuanglinai novelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs
AT wanglei novelcomputationalmodelforpredictingpotentiallncrnadiseaseassociationsbasedonbothdirectandindirectfeaturesoflncrnadiseasepairs