Cargando…

A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations

BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Zhuangwei, Zhang, Han, Jin, Chen, Quan, Xiongwen, Yin, Yanbin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7983260/
https://www.ncbi.nlm.nih.gov/pubmed/33745450
http://dx.doi.org/10.1186/s12859-021-04073-z
_version_ 1783667873016709120
author Shi, Zhuangwei
Zhang, Han
Jin, Chen
Quan, Xiongwen
Yin, Yanbin
author_facet Shi, Zhuangwei
Zhang, Han
Jin, Chen
Quan, Xiongwen
Yin, Yanbin
author_sort Shi, Zhuangwei
collection PubMed
description BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately. RESULTS: We proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach. CONCLUSION: Cross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04073-z.
format Online
Article
Text
id pubmed-7983260
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79832602021-03-22 A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin BMC Bioinformatics Research Article BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately. RESULTS: We proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach. CONCLUSION: Cross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04073-z. BioMed Central 2021-03-21 /pmc/articles/PMC7983260/ /pubmed/33745450 http://dx.doi.org/10.1186/s12859-021-04073-z Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Shi, Zhuangwei
Zhang, Han
Jin, Chen
Quan, Xiongwen
Yin, Yanbin
A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_full A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_fullStr A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_full_unstemmed A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_short A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_sort representation learning model based on variational inference and graph autoencoder for predicting lncrna-disease associations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7983260/
https://www.ncbi.nlm.nih.gov/pubmed/33745450
http://dx.doi.org/10.1186/s12859-021-04073-z
work_keys_str_mv AT shizhuangwei arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT zhanghan arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT jinchen arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT quanxiongwen arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT yinyanbin arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT shizhuangwei representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT zhanghan representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT jinchen representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT quanxiongwen representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations
AT yinyanbin representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations