Cargando…
Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression
BACKGROUND: In human genomes, long non-coding RNAs (lncRNAs) have attracted more and more attention because their dysfunctions are involved in many diseases. However, the associations between lncRNAs and diseases (LDA) still remain unknown in most cases. While identifying disease-related lncRNAs in...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763297/ https://www.ncbi.nlm.nih.gov/pubmed/29322937 http://dx.doi.org/10.1186/s12920-017-0305-y |
_version_ | 1783291856330686464 |
---|---|
author | Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Long, Yu-Xi Yiu, Siu-Ming |
author_facet | Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Long, Yu-Xi Yiu, Siu-Ming |
author_sort | Shi, Jian-Yu |
collection | PubMed |
description | BACKGROUND: In human genomes, long non-coding RNAs (lncRNAs) have attracted more and more attention because their dysfunctions are involved in many diseases. However, the associations between lncRNAs and diseases (LDA) still remain unknown in most cases. While identifying disease-related lncRNAs in vivo is costly, computational approaches are promising to not only accelerate the possible identification of associations but also provide clues on the underlying mechanism of various lncRNA-caused diseases. Former computational approaches usually only focus on predicting new associations between lncRNAs having known associations with diseases and other lncRNA-associated diseases. They also only work on binary lncRNA-disease associations (whether the pair has an association or not), which cannot reflect and reveal other biological facts, such as the number of proteins involved in LDA or how strong the association is (i.e., the intensity of LDA). RESULTS: To address abovementioned issues, we propose a graph regression-based unified framework (GRUF). In particular, our method can work on lncRNAs, which have no previously known disease association and diseases that have no known association with any lncRNAs. Also, instead of only a binary answer for the association, our method tries to uncover more biological relationship between a pair of lncRNA and disease, which may provide better clues for researchers. We compared GRUF with three state-of-the-art approaches and demonstrated the superiority of GRUF, which achieves 5%~16% improvement in terms of the area under the receiver operating characteristic curve (AUC). GRUF also provides a predicted confidence score for the predicted LDA, which reveals the significant correlation between the score and the number of RNA-Binding Proteins involved in LDAs. Lastly, three out of top-5 LDA candidates generated by GRUF in novel prediction are verified indirectly by medical literature and known biological facts. CONCLUSIONS: The proposed GRUF has two advantages over existing approaches. Firstly, it can be used to work on lncRNAs that have no known disease association and diseases that have no known association with any lncRNAs. Secondly, instead of providing a binary answer (with or without association), GRUF works for both discrete and continued LDA, which help revealing the pathological implications between lncRNAs and diseases. |
format | Online Article Text |
id | pubmed-5763297 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57632972018-01-17 Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Long, Yu-Xi Yiu, Siu-Ming BMC Med Genomics Research BACKGROUND: In human genomes, long non-coding RNAs (lncRNAs) have attracted more and more attention because their dysfunctions are involved in many diseases. However, the associations between lncRNAs and diseases (LDA) still remain unknown in most cases. While identifying disease-related lncRNAs in vivo is costly, computational approaches are promising to not only accelerate the possible identification of associations but also provide clues on the underlying mechanism of various lncRNA-caused diseases. Former computational approaches usually only focus on predicting new associations between lncRNAs having known associations with diseases and other lncRNA-associated diseases. They also only work on binary lncRNA-disease associations (whether the pair has an association or not), which cannot reflect and reveal other biological facts, such as the number of proteins involved in LDA or how strong the association is (i.e., the intensity of LDA). RESULTS: To address abovementioned issues, we propose a graph regression-based unified framework (GRUF). In particular, our method can work on lncRNAs, which have no previously known disease association and diseases that have no known association with any lncRNAs. Also, instead of only a binary answer for the association, our method tries to uncover more biological relationship between a pair of lncRNA and disease, which may provide better clues for researchers. We compared GRUF with three state-of-the-art approaches and demonstrated the superiority of GRUF, which achieves 5%~16% improvement in terms of the area under the receiver operating characteristic curve (AUC). GRUF also provides a predicted confidence score for the predicted LDA, which reveals the significant correlation between the score and the number of RNA-Binding Proteins involved in LDAs. Lastly, three out of top-5 LDA candidates generated by GRUF in novel prediction are verified indirectly by medical literature and known biological facts. CONCLUSIONS: The proposed GRUF has two advantages over existing approaches. Firstly, it can be used to work on lncRNAs that have no known disease association and diseases that have no known association with any lncRNAs. Secondly, instead of providing a binary answer (with or without association), GRUF works for both discrete and continued LDA, which help revealing the pathological implications between lncRNAs and diseases. BioMed Central 2017-12-21 /pmc/articles/PMC5763297/ /pubmed/29322937 http://dx.doi.org/10.1186/s12920-017-0305-y Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Shi, Jian-Yu Huang, Hua Zhang, Yan-Ning Long, Yu-Xi Yiu, Siu-Ming Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression |
title | Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression |
title_full | Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression |
title_fullStr | Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression |
title_full_unstemmed | Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression |
title_short | Predicting binary, discrete and continued lncRNA-disease associations via a unified framework based on graph regression |
title_sort | predicting binary, discrete and continued lncrna-disease associations via a unified framework based on graph regression |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763297/ https://www.ncbi.nlm.nih.gov/pubmed/29322937 http://dx.doi.org/10.1186/s12920-017-0305-y |
work_keys_str_mv | AT shijianyu predictingbinarydiscreteandcontinuedlncrnadiseaseassociationsviaaunifiedframeworkbasedongraphregression AT huanghua predictingbinarydiscreteandcontinuedlncrnadiseaseassociationsviaaunifiedframeworkbasedongraphregression AT zhangyanning predictingbinarydiscreteandcontinuedlncrnadiseaseassociationsviaaunifiedframeworkbasedongraphregression AT longyuxi predictingbinarydiscreteandcontinuedlncrnadiseaseassociationsviaaunifiedframeworkbasedongraphregression AT yiusiuming predictingbinarydiscreteandcontinuedlncrnadiseaseassociationsviaaunifiedframeworkbasedongraphregression |