Cargando…
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
BACKGROUND: It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Neverthe...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7430881/ https://www.ncbi.nlm.nih.gov/pubmed/32736513 http://dx.doi.org/10.1186/s12859-020-03699-9 |
_version_ | 1783571500946685952 |
---|---|
author | Fan, Wenwen Shang, Junliang Li, Feng Sun, Yan Yuan, Shasha Liu, Jin-Xing |
author_facet | Fan, Wenwen Shang, Junliang Li, Feng Sun, Yan Yuan, Shasha Liu, Jin-Xing |
author_sort | Fan, Wenwen |
collection | PubMed |
description | BACKGROUND: It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. RESULTS: In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. CONCLUSIONS: Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM. |
format | Online Article Text |
id | pubmed-7430881 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-74308812020-08-18 IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method Fan, Wenwen Shang, Junliang Li, Feng Sun, Yan Yuan, Shasha Liu, Jin-Xing BMC Bioinformatics Methodology Article BACKGROUND: It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. RESULTS: In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. CONCLUSIONS: Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM. BioMed Central 2020-07-31 /pmc/articles/PMC7430881/ /pubmed/32736513 http://dx.doi.org/10.1186/s12859-020-03699-9 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Fan, Wenwen Shang, Junliang Li, Feng Sun, Yan Yuan, Shasha Liu, Jin-Xing IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title | IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_full | IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_fullStr | IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_full_unstemmed | IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_short | IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method |
title_sort | idssim: an lncrna functional similarity calculation model based on an improved disease semantic similarity method |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7430881/ https://www.ncbi.nlm.nih.gov/pubmed/32736513 http://dx.doi.org/10.1186/s12859-020-03699-9 |
work_keys_str_mv | AT fanwenwen idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT shangjunliang idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT lifeng idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT sunyan idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT yuanshasha idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod AT liujinxing idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod |