Cargando…
EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning
BACKGROUND: N6, 2'-O-dimethyladenosine (m(6)Am) is an abundant RNA methylation modification on vertebrate mRNAs and is present in the transcription initiation region of mRNAs. It has recently been experimentally shown to be associated with several human disorders, including obesity genes, and s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598967/ https://www.ncbi.nlm.nih.gov/pubmed/37880673 http://dx.doi.org/10.1186/s12859-023-05543-2 |
_version_ | 1785125673084387328 |
---|---|
author | Jia, Jianhua Wei, Zhangying Sun, Mingwei |
author_facet | Jia, Jianhua Wei, Zhangying Sun, Mingwei |
author_sort | Jia, Jianhua |
collection | PubMed |
description | BACKGROUND: N6, 2'-O-dimethyladenosine (m(6)Am) is an abundant RNA methylation modification on vertebrate mRNAs and is present in the transcription initiation region of mRNAs. It has recently been experimentally shown to be associated with several human disorders, including obesity genes, and stomach cancer, among others. As a result, N6,2′-O-dimethyladenosine (m(6)Am) site will play a crucial part in the regulation of RNA if it can be correctly identified. RESULTS: This study proposes a novel deep learning-based m(6)Am prediction model, EMDL_m6Am, which employs one-hot encoding to expressthe feature map of the RNA sequence and recognizes m(6)Am sites by integrating different CNN models via stacking. Including DenseNet, Inflated Convolutional Network (DCNN) and Deep Multiscale Residual Network (MSRN), the sensitivity (Sn), specificity (Sp), accuracy (ACC), Mathews correlation coefficient (MCC) and area under the curve (AUC) of our model on the training data set reach 86.62%, 88.94%, 87.78%, 0.7590 and 0.8778, respectively, and the prediction results on the independent test set are as high as 82.25%, 79.72%, 80.98%, 0.6199, and 0.8211. CONCLUSIONS: In conclusion, the experimental results demonstrated that EMDL_m6Am greatly improved the predictive performance of the m(6)Am sites and could provide a valuable reference for the next part of the study. The source code and experimental data are available at: https://github.com/13133989982/EMDL-m6Am. |
format | Online Article Text |
id | pubmed-10598967 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-105989672023-10-26 EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning Jia, Jianhua Wei, Zhangying Sun, Mingwei BMC Bioinformatics Research BACKGROUND: N6, 2'-O-dimethyladenosine (m(6)Am) is an abundant RNA methylation modification on vertebrate mRNAs and is present in the transcription initiation region of mRNAs. It has recently been experimentally shown to be associated with several human disorders, including obesity genes, and stomach cancer, among others. As a result, N6,2′-O-dimethyladenosine (m(6)Am) site will play a crucial part in the regulation of RNA if it can be correctly identified. RESULTS: This study proposes a novel deep learning-based m(6)Am prediction model, EMDL_m6Am, which employs one-hot encoding to expressthe feature map of the RNA sequence and recognizes m(6)Am sites by integrating different CNN models via stacking. Including DenseNet, Inflated Convolutional Network (DCNN) and Deep Multiscale Residual Network (MSRN), the sensitivity (Sn), specificity (Sp), accuracy (ACC), Mathews correlation coefficient (MCC) and area under the curve (AUC) of our model on the training data set reach 86.62%, 88.94%, 87.78%, 0.7590 and 0.8778, respectively, and the prediction results on the independent test set are as high as 82.25%, 79.72%, 80.98%, 0.6199, and 0.8211. CONCLUSIONS: In conclusion, the experimental results demonstrated that EMDL_m6Am greatly improved the predictive performance of the m(6)Am sites and could provide a valuable reference for the next part of the study. The source code and experimental data are available at: https://github.com/13133989982/EMDL-m6Am. BioMed Central 2023-10-25 /pmc/articles/PMC10598967/ /pubmed/37880673 http://dx.doi.org/10.1186/s12859-023-05543-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Jia, Jianhua Wei, Zhangying Sun, Mingwei EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning |
title | EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning |
title_full | EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning |
title_fullStr | EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning |
title_full_unstemmed | EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning |
title_short | EMDL_m6Am: identifying N6,2′-O-dimethyladenosine sites based on stacking ensemble deep learning |
title_sort | emdl_m6am: identifying n6,2′-o-dimethyladenosine sites based on stacking ensemble deep learning |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10598967/ https://www.ncbi.nlm.nih.gov/pubmed/37880673 http://dx.doi.org/10.1186/s12859-023-05543-2 |
work_keys_str_mv | AT jiajianhua emdlm6amidentifyingn62odimethyladenosinesitesbasedonstackingensembledeeplearning AT weizhangying emdlm6amidentifyingn62odimethyladenosinesitesbasedonstackingensembledeeplearning AT sunmingwei emdlm6amidentifyingn62odimethyladenosinesitesbasedonstackingensembledeeplearning |