Cargando…
Identifying potential association on gene-disease network via dual hypergraph regularized least squares
BACKGROUND: Identifying potential associations between genes and diseases via biomedical experiments must be the time-consuming and expensive research works. The computational technologies based on machine learning models have been widely utilized to explore genetic information related to complex di...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8351363/ https://www.ncbi.nlm.nih.gov/pubmed/34372777 http://dx.doi.org/10.1186/s12864-021-07864-z |
_version_ | 1783735958999400448 |
---|---|
author | Yang, Hongpeng Ding, Yijie Tang, Jijun Guo, Fei |
author_facet | Yang, Hongpeng Ding, Yijie Tang, Jijun Guo, Fei |
author_sort | Yang, Hongpeng |
collection | PubMed |
description | BACKGROUND: Identifying potential associations between genes and diseases via biomedical experiments must be the time-consuming and expensive research works. The computational technologies based on machine learning models have been widely utilized to explore genetic information related to complex diseases. Importantly, the gene-disease association detection can be defined as the link prediction problem in bipartite network. However, many existing methods do not utilize multiple sources of biological information; Additionally, they do not extract higher-order relationships among genes and diseases. RESULTS: In this study, we propose a novel method called Dual Hypergraph Regularized Least Squares (DHRLS) with Centered Kernel Alignment-based Multiple Kernel Learning (CKA-MKL), in order to detect all potential gene-disease associations. First, we construct multiple kernels based on various biological data sources in gene and disease spaces respectively. After that, we use CAK-MKL to obtain the optimal kernels in the two spaces respectively. To specific, hypergraph can be employed to establish higher-order relationships. Finally, our DHRLS model is solved by the Alternating Least squares algorithm (ALSA), for predicting gene-disease associations. CONCLUSION: Comparing with many outstanding prediction tools, DHRLS achieves best performance on gene-disease associations network under two types of cross validation. To verify robustness, our proposed approach has excellent prediction performance on six real-world networks. Our research work can effectively discover potential disease-associated genes and provide guidance for the follow-up verification methods of complex diseases. |
format | Online Article Text |
id | pubmed-8351363 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-83513632021-08-09 Identifying potential association on gene-disease network via dual hypergraph regularized least squares Yang, Hongpeng Ding, Yijie Tang, Jijun Guo, Fei BMC Genomics Methodology Article BACKGROUND: Identifying potential associations between genes and diseases via biomedical experiments must be the time-consuming and expensive research works. The computational technologies based on machine learning models have been widely utilized to explore genetic information related to complex diseases. Importantly, the gene-disease association detection can be defined as the link prediction problem in bipartite network. However, many existing methods do not utilize multiple sources of biological information; Additionally, they do not extract higher-order relationships among genes and diseases. RESULTS: In this study, we propose a novel method called Dual Hypergraph Regularized Least Squares (DHRLS) with Centered Kernel Alignment-based Multiple Kernel Learning (CKA-MKL), in order to detect all potential gene-disease associations. First, we construct multiple kernels based on various biological data sources in gene and disease spaces respectively. After that, we use CAK-MKL to obtain the optimal kernels in the two spaces respectively. To specific, hypergraph can be employed to establish higher-order relationships. Finally, our DHRLS model is solved by the Alternating Least squares algorithm (ALSA), for predicting gene-disease associations. CONCLUSION: Comparing with many outstanding prediction tools, DHRLS achieves best performance on gene-disease associations network under two types of cross validation. To verify robustness, our proposed approach has excellent prediction performance on six real-world networks. Our research work can effectively discover potential disease-associated genes and provide guidance for the follow-up verification methods of complex diseases. BioMed Central 2021-08-09 /pmc/articles/PMC8351363/ /pubmed/34372777 http://dx.doi.org/10.1186/s12864-021-07864-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Yang, Hongpeng Ding, Yijie Tang, Jijun Guo, Fei Identifying potential association on gene-disease network via dual hypergraph regularized least squares |
title | Identifying potential association on gene-disease network via dual hypergraph regularized least squares |
title_full | Identifying potential association on gene-disease network via dual hypergraph regularized least squares |
title_fullStr | Identifying potential association on gene-disease network via dual hypergraph regularized least squares |
title_full_unstemmed | Identifying potential association on gene-disease network via dual hypergraph regularized least squares |
title_short | Identifying potential association on gene-disease network via dual hypergraph regularized least squares |
title_sort | identifying potential association on gene-disease network via dual hypergraph regularized least squares |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8351363/ https://www.ncbi.nlm.nih.gov/pubmed/34372777 http://dx.doi.org/10.1186/s12864-021-07864-z |
work_keys_str_mv | AT yanghongpeng identifyingpotentialassociationongenediseasenetworkviadualhypergraphregularizedleastsquares AT dingyijie identifyingpotentialassociationongenediseasenetworkviadualhypergraphregularizedleastsquares AT tangjijun identifyingpotentialassociationongenediseasenetworkviadualhypergraphregularizedleastsquares AT guofei identifyingpotentialassociationongenediseasenetworkviadualhypergraphregularizedleastsquares |