Cargando…

Combined embedding model for MiRNA-disease association prediction

BACKGROUND: Cumulative evidence from biological experiments has confirmed that miRNAs have significant roles to diagnose and treat complex diseases. However, traditional medical experiments have limitations in time-consuming and high cost so that they fail to find the unconfirmed miRNA and disease i...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Bailong, Zhu, Xiaoyan, Zhang, Lei, Liang, Zhizheng, Li, Zhengwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7995599/
https://www.ncbi.nlm.nih.gov/pubmed/33765909
http://dx.doi.org/10.1186/s12859-021-04092-w
_version_ 1783669949859889152
author Liu, Bailong
Zhu, Xiaoyan
Zhang, Lei
Liang, Zhizheng
Li, Zhengwei
author_facet Liu, Bailong
Zhu, Xiaoyan
Zhang, Lei
Liang, Zhizheng
Li, Zhengwei
author_sort Liu, Bailong
collection PubMed
description BACKGROUND: Cumulative evidence from biological experiments has confirmed that miRNAs have significant roles to diagnose and treat complex diseases. However, traditional medical experiments have limitations in time-consuming and high cost so that they fail to find the unconfirmed miRNA and disease interactions. Thus, discovering potential miRNA-disease associations will make a contribution to the decrease of the pathogenesis of diseases and benefit disease therapy. Although, existing methods using different computational algorithms have favorable performances to search for the potential miRNA-disease interactions. We still need to do some work to improve experimental results. RESULTS: We present a novel combined embedding model to predict MiRNA-disease associations (CEMDA) in this article. The combined embedding information of miRNA and disease is composed of pair embedding and node embedding. Compared with the previous heterogeneous network methods that are merely node-centric to simply compute the similarity of miRNA and disease, our method fuses pair embedding to pay more attention to capturing the features behind the relative information, which models the fine-grained pairwise relationship better than the previous case when each node only has a single embedding. First, we construct the heterogeneous network from supported miRNA-disease pairs, disease semantic similarity and miRNA functional similarity. Given by the above heterogeneous network, we find all the associated context paths of each confirmed miRNA and disease. Meta-paths are linked by nodes and then input to the gate recurrent unit (GRU) to directly learn more accurate similarity measures between miRNA and disease. Here, the multi-head attention mechanism is used to weight the hidden state of each meta-path, and the similarity information transmission mechanism in a meta-path of miRNA and disease is obtained through multiple network layers. Second, pair embedding of miRNA and disease is fed to the multi-layer perceptron (MLP), which focuses on more important segments in pairwise relationship. Finally, we combine meta-path based node embedding and pair embedding with the cost function to learn and predict miRNA-disease association. The source code and data sets that verify the results of our research are shown at https://github.com/liubailong/CEMDA. CONCLUSIONS: The performance of CEMDA in the leave-one-out cross validation and fivefold cross validation are 93.16% and 92.03%, respectively. It denotes that compared with other methods, CEMDA accomplishes superior performance. Three cases with lung cancers, breast cancers, prostate cancers and pancreatic cancers show that 48,50,50 and 50 out of the top 50 miRNAs, which are confirmed in HDMM V2.0. Thus, this further identifies the feasibility and effectiveness of our method. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04092-w.
format Online
Article
Text
id pubmed-7995599
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79955992021-03-26 Combined embedding model for MiRNA-disease association prediction Liu, Bailong Zhu, Xiaoyan Zhang, Lei Liang, Zhizheng Li, Zhengwei BMC Bioinformatics Research Article BACKGROUND: Cumulative evidence from biological experiments has confirmed that miRNAs have significant roles to diagnose and treat complex diseases. However, traditional medical experiments have limitations in time-consuming and high cost so that they fail to find the unconfirmed miRNA and disease interactions. Thus, discovering potential miRNA-disease associations will make a contribution to the decrease of the pathogenesis of diseases and benefit disease therapy. Although, existing methods using different computational algorithms have favorable performances to search for the potential miRNA-disease interactions. We still need to do some work to improve experimental results. RESULTS: We present a novel combined embedding model to predict MiRNA-disease associations (CEMDA) in this article. The combined embedding information of miRNA and disease is composed of pair embedding and node embedding. Compared with the previous heterogeneous network methods that are merely node-centric to simply compute the similarity of miRNA and disease, our method fuses pair embedding to pay more attention to capturing the features behind the relative information, which models the fine-grained pairwise relationship better than the previous case when each node only has a single embedding. First, we construct the heterogeneous network from supported miRNA-disease pairs, disease semantic similarity and miRNA functional similarity. Given by the above heterogeneous network, we find all the associated context paths of each confirmed miRNA and disease. Meta-paths are linked by nodes and then input to the gate recurrent unit (GRU) to directly learn more accurate similarity measures between miRNA and disease. Here, the multi-head attention mechanism is used to weight the hidden state of each meta-path, and the similarity information transmission mechanism in a meta-path of miRNA and disease is obtained through multiple network layers. Second, pair embedding of miRNA and disease is fed to the multi-layer perceptron (MLP), which focuses on more important segments in pairwise relationship. Finally, we combine meta-path based node embedding and pair embedding with the cost function to learn and predict miRNA-disease association. The source code and data sets that verify the results of our research are shown at https://github.com/liubailong/CEMDA. CONCLUSIONS: The performance of CEMDA in the leave-one-out cross validation and fivefold cross validation are 93.16% and 92.03%, respectively. It denotes that compared with other methods, CEMDA accomplishes superior performance. Three cases with lung cancers, breast cancers, prostate cancers and pancreatic cancers show that 48,50,50 and 50 out of the top 50 miRNAs, which are confirmed in HDMM V2.0. Thus, this further identifies the feasibility and effectiveness of our method. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04092-w. BioMed Central 2021-03-25 /pmc/articles/PMC7995599/ /pubmed/33765909 http://dx.doi.org/10.1186/s12859-021-04092-w Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Liu, Bailong
Zhu, Xiaoyan
Zhang, Lei
Liang, Zhizheng
Li, Zhengwei
Combined embedding model for MiRNA-disease association prediction
title Combined embedding model for MiRNA-disease association prediction
title_full Combined embedding model for MiRNA-disease association prediction
title_fullStr Combined embedding model for MiRNA-disease association prediction
title_full_unstemmed Combined embedding model for MiRNA-disease association prediction
title_short Combined embedding model for MiRNA-disease association prediction
title_sort combined embedding model for mirna-disease association prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7995599/
https://www.ncbi.nlm.nih.gov/pubmed/33765909
http://dx.doi.org/10.1186/s12859-021-04092-w
work_keys_str_mv AT liubailong combinedembeddingmodelformirnadiseaseassociationprediction
AT zhuxiaoyan combinedembeddingmodelformirnadiseaseassociationprediction
AT zhanglei combinedembeddingmodelformirnadiseaseassociationprediction
AT liangzhizheng combinedembeddingmodelformirnadiseaseassociationprediction
AT lizhengwei combinedembeddingmodelformirnadiseaseassociationprediction