Cargando…

A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest

Long non-coding RNA (lncRNA) play critical roles in the occurrence and development of various diseases. The determination of the lncRNA-disease associations thus would contribute to provide new insights into the pathogenesis of the disease, the diagnosis, and the gene treatments. Considering that tr...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Zhen-Hao, You, Zhu-Hong, Wang, Yan-Bin, Yi, Hai-Cheng, Chen, Zhan-Heng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6733997/
https://www.ncbi.nlm.nih.gov/pubmed/31494494
http://dx.doi.org/10.1016/j.isci.2019.08.030
_version_ 1783450072707497984
author Guo, Zhen-Hao
You, Zhu-Hong
Wang, Yan-Bin
Yi, Hai-Cheng
Chen, Zhan-Heng
author_facet Guo, Zhen-Hao
You, Zhu-Hong
Wang, Yan-Bin
Yi, Hai-Cheng
Chen, Zhan-Heng
author_sort Guo, Zhen-Hao
collection PubMed
description Long non-coding RNA (lncRNA) play critical roles in the occurrence and development of various diseases. The determination of the lncRNA-disease associations thus would contribute to provide new insights into the pathogenesis of the disease, the diagnosis, and the gene treatments. Considering that traditional experimental approaches are difficult to detect potential human lncRNA-disease associations from the vast amount of biological data, developing computational method could be of significant value. In this paper, we proposed a novel computational method named LDASR to identify associations between lncRNA and disease by analyzing known lncRNA-disease associations. First, the feature vectors of the lncRNA-disease pairs were obtained by integrating lncRNA Gaussian interaction profile kernel similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. Second, autoencoder neural network was employed to reduce the feature dimension and get the optimal feature subspace from the original feature set. Finally, Rotating Forest was used to carry out prediction of lncRNA-disease association. The proposed method achieves an excellent preference with 0.9502 AUC in leave-one-out cross-validations (LOOCV) and 0.9428 AUC in 5-fold cross-validation, which significantly outperformed previous methods. Moreover, two kinds of case studies on identifying lncRNAs associated with colorectal cancer and glioma further proves the capability of LDASR in identifying novel lncRNA-disease associations. The promising experimental results show that the LDASR can be an excellent addition to the biomedical research in the future.
format Online
Article
Text
id pubmed-6733997
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-67339972019-09-12 A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest Guo, Zhen-Hao You, Zhu-Hong Wang, Yan-Bin Yi, Hai-Cheng Chen, Zhan-Heng iScience Article Long non-coding RNA (lncRNA) play critical roles in the occurrence and development of various diseases. The determination of the lncRNA-disease associations thus would contribute to provide new insights into the pathogenesis of the disease, the diagnosis, and the gene treatments. Considering that traditional experimental approaches are difficult to detect potential human lncRNA-disease associations from the vast amount of biological data, developing computational method could be of significant value. In this paper, we proposed a novel computational method named LDASR to identify associations between lncRNA and disease by analyzing known lncRNA-disease associations. First, the feature vectors of the lncRNA-disease pairs were obtained by integrating lncRNA Gaussian interaction profile kernel similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity. Second, autoencoder neural network was employed to reduce the feature dimension and get the optimal feature subspace from the original feature set. Finally, Rotating Forest was used to carry out prediction of lncRNA-disease association. The proposed method achieves an excellent preference with 0.9502 AUC in leave-one-out cross-validations (LOOCV) and 0.9428 AUC in 5-fold cross-validation, which significantly outperformed previous methods. Moreover, two kinds of case studies on identifying lncRNAs associated with colorectal cancer and glioma further proves the capability of LDASR in identifying novel lncRNA-disease associations. The promising experimental results show that the LDASR can be an excellent addition to the biomedical research in the future. Elsevier 2019-08-23 /pmc/articles/PMC6733997/ /pubmed/31494494 http://dx.doi.org/10.1016/j.isci.2019.08.030 Text en © 2019 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Guo, Zhen-Hao
You, Zhu-Hong
Wang, Yan-Bin
Yi, Hai-Cheng
Chen, Zhan-Heng
A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest
title A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest
title_full A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest
title_fullStr A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest
title_full_unstemmed A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest
title_short A Learning-Based Method for LncRNA-Disease Association Identification Combing Similarity Information and Rotation Forest
title_sort learning-based method for lncrna-disease association identification combing similarity information and rotation forest
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6733997/
https://www.ncbi.nlm.nih.gov/pubmed/31494494
http://dx.doi.org/10.1016/j.isci.2019.08.030
work_keys_str_mv AT guozhenhao alearningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT youzhuhong alearningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT wangyanbin alearningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT yihaicheng alearningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT chenzhanheng alearningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT guozhenhao learningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT youzhuhong learningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT wangyanbin learningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT yihaicheng learningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest
AT chenzhanheng learningbasedmethodforlncrnadiseaseassociationidentificationcombingsimilarityinformationandrotationforest