Cargando…

CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network

BACKGROUND: The existing studies show that circRNAs can be used as a biomarker of diseases and play a prominent role in the treatment and diagnosis of diseases. However, the relationships between the vast majority of circRNAs and diseases are still unclear, and more experiments are needed to study t...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Zhihao, Kuang, Zhufang, Deng, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588735/
https://www.ncbi.nlm.nih.gov/pubmed/34772332
http://dx.doi.org/10.1186/s12859-021-04467-z
_version_ 1784598544499343360
author Ma, Zhihao
Kuang, Zhufang
Deng, Lei
author_facet Ma, Zhihao
Kuang, Zhufang
Deng, Lei
author_sort Ma, Zhihao
collection PubMed
description BACKGROUND: The existing studies show that circRNAs can be used as a biomarker of diseases and play a prominent role in the treatment and diagnosis of diseases. However, the relationships between the vast majority of circRNAs and diseases are still unclear, and more experiments are needed to study the mechanism of circRNAs. Nowadays, some scholars use the attributes between circRNAs and diseases to study and predict their associations. Nonetheless, most of the existing experimental methods use less information about the attributes of circRNAs, which has a certain impact on the accuracy of the final prediction results. On the other hand, some scholars also apply experimental methods to predict the associations between circRNAs and diseases. But such methods are usually expensive and time-consuming. Based on the above shortcomings, follow-up research is needed to propose a more efficient calculation-based method to predict the associations between circRNAs and diseases. RESULTS: In this study, a novel algorithm (method) is proposed, which is based on the Graph Convolutional Network (GCN) constructed with Random Walk with Restart (RWR) and Principal Component Analysis (PCA) to predict the associations between circRNAs and diseases (CRPGCN). In the construction of CRPGCN, the RWR algorithm is used to improve the similarity associations of the computed nodes with their neighbours. After that, the PCA method is used to dimensionality reduction and extract features, it makes the connection between circRNAs with higher similarity and diseases closer. Finally, The GCN algorithm is used to learn the features between circRNAs and diseases and calculate the final similarity scores, and the learning datas are constructed from the adjacency matrix, similarity matrix and feature matrix as a heterogeneous adjacency matrix and a heterogeneous feature matrix. CONCLUSIONS: After 2-fold cross-validation, 5-fold cross-validation and 10-fold cross-validation, the area under the ROC curve of the CRPGCN is 0.9490, 0.9720 and 0.9722, respectively. The CRPGCN method has a valuable effect in predict the associations between circRNAs and diseases.
format Online
Article
Text
id pubmed-8588735
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85887352021-11-15 CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network Ma, Zhihao Kuang, Zhufang Deng, Lei BMC Bioinformatics Research BACKGROUND: The existing studies show that circRNAs can be used as a biomarker of diseases and play a prominent role in the treatment and diagnosis of diseases. However, the relationships between the vast majority of circRNAs and diseases are still unclear, and more experiments are needed to study the mechanism of circRNAs. Nowadays, some scholars use the attributes between circRNAs and diseases to study and predict their associations. Nonetheless, most of the existing experimental methods use less information about the attributes of circRNAs, which has a certain impact on the accuracy of the final prediction results. On the other hand, some scholars also apply experimental methods to predict the associations between circRNAs and diseases. But such methods are usually expensive and time-consuming. Based on the above shortcomings, follow-up research is needed to propose a more efficient calculation-based method to predict the associations between circRNAs and diseases. RESULTS: In this study, a novel algorithm (method) is proposed, which is based on the Graph Convolutional Network (GCN) constructed with Random Walk with Restart (RWR) and Principal Component Analysis (PCA) to predict the associations between circRNAs and diseases (CRPGCN). In the construction of CRPGCN, the RWR algorithm is used to improve the similarity associations of the computed nodes with their neighbours. After that, the PCA method is used to dimensionality reduction and extract features, it makes the connection between circRNAs with higher similarity and diseases closer. Finally, The GCN algorithm is used to learn the features between circRNAs and diseases and calculate the final similarity scores, and the learning datas are constructed from the adjacency matrix, similarity matrix and feature matrix as a heterogeneous adjacency matrix and a heterogeneous feature matrix. CONCLUSIONS: After 2-fold cross-validation, 5-fold cross-validation and 10-fold cross-validation, the area under the ROC curve of the CRPGCN is 0.9490, 0.9720 and 0.9722, respectively. The CRPGCN method has a valuable effect in predict the associations between circRNAs and diseases. BioMed Central 2021-11-12 /pmc/articles/PMC8588735/ /pubmed/34772332 http://dx.doi.org/10.1186/s12859-021-04467-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Ma, Zhihao
Kuang, Zhufang
Deng, Lei
CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network
title CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network
title_full CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network
title_fullStr CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network
title_full_unstemmed CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network
title_short CRPGCN: predicting circRNA-disease associations using graph convolutional network based on heterogeneous network
title_sort crpgcn: predicting circrna-disease associations using graph convolutional network based on heterogeneous network
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588735/
https://www.ncbi.nlm.nih.gov/pubmed/34772332
http://dx.doi.org/10.1186/s12859-021-04467-z
work_keys_str_mv AT mazhihao crpgcnpredictingcircrnadiseaseassociationsusinggraphconvolutionalnetworkbasedonheterogeneousnetwork
AT kuangzhufang crpgcnpredictingcircrnadiseaseassociationsusinggraphconvolutionalnetworkbasedonheterogeneousnetwork
AT denglei crpgcnpredictingcircrnadiseaseassociationsusinggraphconvolutionalnetworkbasedonheterogeneousnetwork