Cargando…
A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7983260/ https://www.ncbi.nlm.nih.gov/pubmed/33745450 http://dx.doi.org/10.1186/s12859-021-04073-z |
_version_ | 1783667873016709120 |
---|---|
author | Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin |
author_facet | Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin |
author_sort | Shi, Zhuangwei |
collection | PubMed |
description | BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately. RESULTS: We proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach. CONCLUSION: Cross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04073-z. |
format | Online Article Text |
id | pubmed-7983260 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79832602021-03-22 A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin BMC Bioinformatics Research Article BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately. RESULTS: We proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach. CONCLUSION: Cross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04073-z. BioMed Central 2021-03-21 /pmc/articles/PMC7983260/ /pubmed/33745450 http://dx.doi.org/10.1186/s12859-021-04073-z Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations |
title | A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations |
title_full | A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations |
title_fullStr | A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations |
title_full_unstemmed | A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations |
title_short | A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations |
title_sort | representation learning model based on variational inference and graph autoencoder for predicting lncrna-disease associations |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7983260/ https://www.ncbi.nlm.nih.gov/pubmed/33745450 http://dx.doi.org/10.1186/s12859-021-04073-z |
work_keys_str_mv | AT shizhuangwei arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT zhanghan arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT jinchen arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT quanxiongwen arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT yinyanbin arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT shizhuangwei representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT zhanghan representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT jinchen representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT quanxiongwen representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT yinyanbin representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations |