Cargando…

IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method

BACKGROUND: It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Neverthe...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Wenwen, Shang, Junliang, Li, Feng, Sun, Yan, Yuan, Shasha, Liu, Jin-Xing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7430881/
https://www.ncbi.nlm.nih.gov/pubmed/32736513
http://dx.doi.org/10.1186/s12859-020-03699-9
_version_ 1783571500946685952
author Fan, Wenwen
Shang, Junliang
Li, Feng
Sun, Yan
Yuan, Shasha
Liu, Jin-Xing
author_facet Fan, Wenwen
Shang, Junliang
Li, Feng
Sun, Yan
Yuan, Shasha
Liu, Jin-Xing
author_sort Fan, Wenwen
collection PubMed
description BACKGROUND: It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. RESULTS: In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. CONCLUSIONS: Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM.
format Online
Article
Text
id pubmed-7430881
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74308812020-08-18 IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method Fan, Wenwen Shang, Junliang Li, Feng Sun, Yan Yuan, Shasha Liu, Jin-Xing BMC Bioinformatics Methodology Article BACKGROUND: It has been widely accepted that long non-coding RNAs (lncRNAs) play important roles in the development and progression of human diseases. Many association prediction models have been proposed for predicting lncRNA functions and identifying potential lncRNA-disease associations. Nevertheless, among them, little effort has been attempted to measure lncRNA functional similarity, which is an essential part of association prediction models. RESULTS: In this study, we presented an lncRNA functional similarity calculation model, IDSSIM for short, based on an improved disease semantic similarity method, highlight of which is the introduction of information content contribution factor into the semantic value calculation to take into account both the hierarchical structures of disease directed acyclic graphs and the disease specificities. IDSSIM and three state-of-the-art models, i.e., LNCSIM1, LNCSIM2, and ILNCSIM, were evaluated by applying their disease semantic similarity matrices and the lncRNA functional similarity matrices, as well as corresponding matrices of human lncRNA-disease associations coming from either lncRNADisease database or MNDR database, into an association prediction method WKNKN for lncRNA-disease association prediction. In addition, case studies of breast cancer and adenocarcinoma were also performed to validate the effectiveness of IDSSIM. CONCLUSIONS: Results demonstrated that in terms of ROC curves and AUC values, IDSSIM is superior to compared models, and can improve accuracy of disease semantic similarity effectively, leading to increase the association prediction ability of the IDSSIM-WKNKN model; in terms of case studies, most of potential disease-associated lncRNAs predicted by IDSSIM can be confirmed by databases and literatures, implying that IDSSIM can serve as a promising tool for predicting lncRNA functions, identifying potential lncRNA-disease associations, and pre-screening candidate lncRNAs to perform biological experiments. The IDSSIM code, all experimental data and prediction results are available online at https://github.com/CDMB-lab/IDSSIM. BioMed Central 2020-07-31 /pmc/articles/PMC7430881/ /pubmed/32736513 http://dx.doi.org/10.1186/s12859-020-03699-9 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Fan, Wenwen
Shang, Junliang
Li, Feng
Sun, Yan
Yuan, Shasha
Liu, Jin-Xing
IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_full IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_fullStr IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_full_unstemmed IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_short IDSSIM: an lncRNA functional similarity calculation model based on an improved disease semantic similarity method
title_sort idssim: an lncrna functional similarity calculation model based on an improved disease semantic similarity method
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7430881/
https://www.ncbi.nlm.nih.gov/pubmed/32736513
http://dx.doi.org/10.1186/s12859-020-03699-9
work_keys_str_mv AT fanwenwen idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT shangjunliang idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT lifeng idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT sunyan idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT yuanshasha idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod
AT liujinxing idssimanlncrnafunctionalsimilaritycalculationmodelbasedonanimproveddiseasesemanticsimilaritymethod