Cargando…

A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations

BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been...

Descripción completa

Detalles Bibliográficos
Autores principales:	Shi, Zhuangwei, Zhang, Han, Jin, Chen, Quan, Xiongwen, Yin, Yanbin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7983260/ https://www.ncbi.nlm.nih.gov/pubmed/33745450 http://dx.doi.org/10.1186/s12859-021-04073-z

_version_	1783667873016709120
author	Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin
author_facet	Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin
author_sort	Shi, Zhuangwei
collection	PubMed
description	BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately. RESULTS: We proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach. CONCLUSION: Cross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04073-z.
format	Online Article Text
id	pubmed-7983260
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-79832602021-03-22 A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin BMC Bioinformatics Research Article BACKGROUND: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately. RESULTS: We proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach. CONCLUSION: Cross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04073-z. BioMed Central 2021-03-21 /pmc/articles/PMC7983260/ /pubmed/33745450 http://dx.doi.org/10.1186/s12859-021-04073-z Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Article Shi, Zhuangwei Zhang, Han Jin, Chen Quan, Xiongwen Yin, Yanbin A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title	A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_full	A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_fullStr	A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_full_unstemmed	A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_short	A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations
title_sort	representation learning model based on variational inference and graph autoencoder for predicting lncrna-disease associations
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7983260/ https://www.ncbi.nlm.nih.gov/pubmed/33745450 http://dx.doi.org/10.1186/s12859-021-04073-z
work_keys_str_mv	AT shizhuangwei arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT zhanghan arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT jinchen arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT quanxiongwen arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT yinyanbin arepresentationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT shizhuangwei representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT zhanghan representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT jinchen representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT quanxiongwen representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations AT yinyanbin representationlearningmodelbasedonvariationalinferenceandgraphautoencoderforpredictinglncrnadiseaseassociations

A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations

Ejemplares similares