Cargando…
HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction
N(7)-methylguanosine (m(7)G) is a typical positively charged RNA modification, playing a vital role in transcriptional regulation. m(7)G can affect the biological processes of mRNA and tRNA and has associations with multiple diseases including cancers. Wet-lab experiments are cost and time ineffecti...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7970120/ https://www.ncbi.nlm.nih.gov/pubmed/33747055 http://dx.doi.org/10.3389/fgene.2021.655284 |
Sumario: | N(7)-methylguanosine (m(7)G) is a typical positively charged RNA modification, playing a vital role in transcriptional regulation. m(7)G can affect the biological processes of mRNA and tRNA and has associations with multiple diseases including cancers. Wet-lab experiments are cost and time ineffective for the identification of disease-related m(7)G sites. Thus, a heterogeneous network method based on Convolutional Neural Networks (HN-CNN) has been proposed to predict unknown associations between m(7)G sites and diseases. HN-CNN constructs a heterogeneous network with m(7)G site similarity, disease similarity, and disease-associated m(7)G sites to formulate features for m(7)G site-disease pairs. Next, a convolutional neural network (CNN) obtains multidimensional and irrelevant features prominently. Finally, XGBoost is adopted to predict the association between m(7)G sites and diseases. The performance of HN-CNN is compared with Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), as well as Gradient Boosting Decision Tree (GBDT) through 10-fold cross-validation. The average AUC of HN-CNN is 0.827, which is superior to others. |
---|