Cargando…

EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning

BACKGROUND: N6, 2'-O-dimethyladenosine (m(6)Am) is an abundant RNA methylation modification on vertebrate mRNAs and is present in the transcription initiation region of mRNAs. It has recently been experimentally shown to be associated with several human disorders, including obesity genes, and s...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Jianhua, Wei, Zhangying, Sun, Mingwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598967/
https://www.ncbi.nlm.nih.gov/pubmed/37880673
http://dx.doi.org/10.1186/s12859-023-05543-2
_version_ 1785125673084387328
author Jia, Jianhua
Wei, Zhangying
Sun, Mingwei
author_facet Jia, Jianhua
Wei, Zhangying
Sun, Mingwei
author_sort Jia, Jianhua
collection PubMed
description BACKGROUND: N6, 2'-O-dimethyladenosine (m(6)Am) is an abundant RNA methylation modification on vertebrate mRNAs and is present in the transcription initiation region of mRNAs. It has recently been experimentally shown to be associated with several human disorders, including obesity genes, and stomach cancer, among others. As a result, N6,2′-O-dimethyladenosine (m(6)Am) site will play a crucial part in the regulation of RNA if it can be correctly identified. RESULTS: This study proposes a novel deep learning-based m(6)Am prediction model, EMDL_m6Am, which employs one-hot encoding to expressthe feature map of the RNA sequence and recognizes m(6)Am sites by integrating different CNN models via stacking. Including DenseNet, Inflated Convolutional Network (DCNN) and Deep Multiscale Residual Network (MSRN), the sensitivity (Sn), specificity (Sp), accuracy (ACC), Mathews correlation coefficient (MCC) and area under the curve (AUC) of our model on the training data set reach 86.62%, 88.94%, 87.78%, 0.7590 and 0.8778, respectively, and the prediction results on the independent test set are as high as 82.25%, 79.72%, 80.98%, 0.6199, and 0.8211. CONCLUSIONS: In conclusion, the experimental results demonstrated that EMDL_m6Am greatly improved the predictive performance of the m(6)Am sites and could provide a valuable reference for the next part of the study. The source code and experimental data are available at: https://github.com/13133989982/EMDL-m6Am.
format Online
Article
Text
id pubmed-10598967
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105989672023-10-26 EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning Jia, Jianhua Wei, Zhangying Sun, Mingwei BMC Bioinformatics Research BACKGROUND: N6, 2'-O-dimethyladenosine (m(6)Am) is an abundant RNA methylation modification on vertebrate mRNAs and is present in the transcription initiation region of mRNAs. It has recently been experimentally shown to be associated with several human disorders, including obesity genes, and stomach cancer, among others. As a result, N6,2′-O-dimethyladenosine (m(6)Am) site will play a crucial part in the regulation of RNA if it can be correctly identified. RESULTS: This study proposes a novel deep learning-based m(6)Am prediction model, EMDL_m6Am, which employs one-hot encoding to expressthe feature map of the RNA sequence and recognizes m(6)Am sites by integrating different CNN models via stacking. Including DenseNet, Inflated Convolutional Network (DCNN) and Deep Multiscale Residual Network (MSRN), the sensitivity (Sn), specificity (Sp), accuracy (ACC), Mathews correlation coefficient (MCC) and area under the curve (AUC) of our model on the training data set reach 86.62%, 88.94%, 87.78%, 0.7590 and 0.8778, respectively, and the prediction results on the independent test set are as high as 82.25%, 79.72%, 80.98%, 0.6199, and 0.8211. CONCLUSIONS: In conclusion, the experimental results demonstrated that EMDL_m6Am greatly improved the predictive performance of the m(6)Am sites and could provide a valuable reference for the next part of the study. The source code and experimental data are available at: https://github.com/13133989982/EMDL-m6Am. BioMed Central 2023-10-25 /pmc/articles/PMC10598967/ /pubmed/37880673 http://dx.doi.org/10.1186/s12859-023-05543-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Jia, Jianhua
Wei, Zhangying
Sun, Mingwei
EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning
title EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning
title_full EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning
title_fullStr EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning
title_full_unstemmed EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning
title_short EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning
title_sort emdl_m6am: identifying n6,2′-o-dimethyladenosine sites based on stacking ensemble deep learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598967/
https://www.ncbi.nlm.nih.gov/pubmed/37880673
http://dx.doi.org/10.1186/s12859-023-05543-2
work_keys_str_mv AT jiajianhua emdlm6amidentifyingn62odimethyladenosinesitesbasedonstackingensembledeeplearning
AT weizhangying emdlm6amidentifyingn62odimethyladenosinesitesbasedonstackingensembledeeplearning
AT sunmingwei emdlm6amidentifyingn62odimethyladenosinesitesbasedonstackingensembledeeplearning