Cargando…

HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction

N(7)-methylguanosine (m(7)G) is a typical positively charged RNA modification, playing a vital role in transcriptional regulation. m(7)G can affect the biological processes of mRNA and tRNA and has associations with multiple diseases including cancers. Wet-lab experiments are cost and time ineffecti...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Lin, Chen, Jin, Ma, Jiani, Liu, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7970120/
https://www.ncbi.nlm.nih.gov/pubmed/33747055
http://dx.doi.org/10.3389/fgene.2021.655284
_version_ 1783666371155984384
author Zhang, Lin
Chen, Jin
Ma, Jiani
Liu, Hui
author_facet Zhang, Lin
Chen, Jin
Ma, Jiani
Liu, Hui
author_sort Zhang, Lin
collection PubMed
description N(7)-methylguanosine (m(7)G) is a typical positively charged RNA modification, playing a vital role in transcriptional regulation. m(7)G can affect the biological processes of mRNA and tRNA and has associations with multiple diseases including cancers. Wet-lab experiments are cost and time ineffective for the identification of disease-related m(7)G sites. Thus, a heterogeneous network method based on Convolutional Neural Networks (HN-CNN) has been proposed to predict unknown associations between m(7)G sites and diseases. HN-CNN constructs a heterogeneous network with m(7)G site similarity, disease similarity, and disease-associated m(7)G sites to formulate features for m(7)G site-disease pairs. Next, a convolutional neural network (CNN) obtains multidimensional and irrelevant features prominently. Finally, XGBoost is adopted to predict the association between m(7)G sites and diseases. The performance of HN-CNN is compared with Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), as well as Gradient Boosting Decision Tree (GBDT) through 10-fold cross-validation. The average AUC of HN-CNN is 0.827, which is superior to others.
format Online
Article
Text
id pubmed-7970120
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79701202021-03-19 HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction Zhang, Lin Chen, Jin Ma, Jiani Liu, Hui Front Genet Genetics N(7)-methylguanosine (m(7)G) is a typical positively charged RNA modification, playing a vital role in transcriptional regulation. m(7)G can affect the biological processes of mRNA and tRNA and has associations with multiple diseases including cancers. Wet-lab experiments are cost and time ineffective for the identification of disease-related m(7)G sites. Thus, a heterogeneous network method based on Convolutional Neural Networks (HN-CNN) has been proposed to predict unknown associations between m(7)G sites and diseases. HN-CNN constructs a heterogeneous network with m(7)G site similarity, disease similarity, and disease-associated m(7)G sites to formulate features for m(7)G site-disease pairs. Next, a convolutional neural network (CNN) obtains multidimensional and irrelevant features prominently. Finally, XGBoost is adopted to predict the association between m(7)G sites and diseases. The performance of HN-CNN is compared with Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), as well as Gradient Boosting Decision Tree (GBDT) through 10-fold cross-validation. The average AUC of HN-CNN is 0.827, which is superior to others. Frontiers Media S.A. 2021-03-04 /pmc/articles/PMC7970120/ /pubmed/33747055 http://dx.doi.org/10.3389/fgene.2021.655284 Text en Copyright © 2021 Zhang, Chen, Ma and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhang, Lin
Chen, Jin
Ma, Jiani
Liu, Hui
HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction
title HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction
title_full HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction
title_fullStr HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction
title_full_unstemmed HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction
title_short HN-CNN: A Heterogeneous Network Based on Convolutional Neural Network for m(7) G Site Disease Association Prediction
title_sort hn-cnn: a heterogeneous network based on convolutional neural network for m(7) g site disease association prediction
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7970120/
https://www.ncbi.nlm.nih.gov/pubmed/33747055
http://dx.doi.org/10.3389/fgene.2021.655284
work_keys_str_mv AT zhanglin hncnnaheterogeneousnetworkbasedonconvolutionalneuralnetworkform7gsitediseaseassociationprediction
AT chenjin hncnnaheterogeneousnetworkbasedonconvolutionalneuralnetworkform7gsitediseaseassociationprediction
AT majiani hncnnaheterogeneousnetworkbasedonconvolutionalneuralnetworkform7gsitediseaseassociationprediction
AT liuhui hncnnaheterogeneousnetworkbasedonconvolutionalneuralnetworkform7gsitediseaseassociationprediction