Cargando…

DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences

N6,2′-O-dimethyladenosine (m(6)Am) is a post-transcriptional modification that may be associated with regulatory roles in the control of cellular functions. Therefore, it is crucial to accurately identify transcriptome-wide m(6)Am sites to understand underlying m(6)Am-dependent mRNA regulation mecha...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Zhengtao, Su, Wei, Lou, Liliang, Qiu, Wangren, Xiao, Xuan, Xu, Zhaochun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570463/
https://www.ncbi.nlm.nih.gov/pubmed/36232325
http://dx.doi.org/10.3390/ijms231911026
_version_ 1784810112960954368
author Luo, Zhengtao
Su, Wei
Lou, Liliang
Qiu, Wangren
Xiao, Xuan
Xu, Zhaochun
author_facet Luo, Zhengtao
Su, Wei
Lou, Liliang
Qiu, Wangren
Xiao, Xuan
Xu, Zhaochun
author_sort Luo, Zhengtao
collection PubMed
description N6,2′-O-dimethyladenosine (m(6)Am) is a post-transcriptional modification that may be associated with regulatory roles in the control of cellular functions. Therefore, it is crucial to accurately identify transcriptome-wide m(6)Am sites to understand underlying m(6)Am-dependent mRNA regulation mechanisms and biological functions. Here, we used three sequence-based feature-encoding schemes, including one-hot, nucleotide chemical property (NCP), and nucleotide density (ND), to represent RNA sequence samples. Additionally, we proposed an ensemble deep learning framework, named DLm6Am, to identify m(6)Am sites. DLm6Am consists of three similar base classifiers, each of which contains a multi-head attention module, an embedding module with two parallel deep learning sub-modules, a convolutional neural network (CNN) and a Bi-directional long short-term memory (BiLSTM), and a prediction module. To demonstrate the superior performance of our model’s architecture, we compared multiple model frameworks with our method by analyzing the training data and independent testing data. Additionally, we compared our model with the existing state-of-the-art computational methods, m6AmPred and MultiRM. The accuracy (ACC) for the DLm6Am model was improved by 6.45% and 8.42% compared to that of m6AmPred and MultiRM on independent testing data, respectively, while the area under receiver operating characteristic curve (AUROC) for the DLm6Am model was increased by 4.28% and 5.75%, respectively. All the results indicate that DLm6Am achieved the best prediction performance in terms of ACC, Matthews correlation coefficient (MCC), AUROC, and the area under precision and recall curves (AUPR). To further assess the generalization performance of our proposed model, we implemented chromosome-level leave-out cross-validation, and found that the obtained AUROC values were greater than 0.83, indicating that our proposed method is robust and can accurately predict m(6)Am sites.
format Online
Article
Text
id pubmed-9570463
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95704632022-10-17 DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences Luo, Zhengtao Su, Wei Lou, Liliang Qiu, Wangren Xiao, Xuan Xu, Zhaochun Int J Mol Sci Article N6,2′-O-dimethyladenosine (m(6)Am) is a post-transcriptional modification that may be associated with regulatory roles in the control of cellular functions. Therefore, it is crucial to accurately identify transcriptome-wide m(6)Am sites to understand underlying m(6)Am-dependent mRNA regulation mechanisms and biological functions. Here, we used three sequence-based feature-encoding schemes, including one-hot, nucleotide chemical property (NCP), and nucleotide density (ND), to represent RNA sequence samples. Additionally, we proposed an ensemble deep learning framework, named DLm6Am, to identify m(6)Am sites. DLm6Am consists of three similar base classifiers, each of which contains a multi-head attention module, an embedding module with two parallel deep learning sub-modules, a convolutional neural network (CNN) and a Bi-directional long short-term memory (BiLSTM), and a prediction module. To demonstrate the superior performance of our model’s architecture, we compared multiple model frameworks with our method by analyzing the training data and independent testing data. Additionally, we compared our model with the existing state-of-the-art computational methods, m6AmPred and MultiRM. The accuracy (ACC) for the DLm6Am model was improved by 6.45% and 8.42% compared to that of m6AmPred and MultiRM on independent testing data, respectively, while the area under receiver operating characteristic curve (AUROC) for the DLm6Am model was increased by 4.28% and 5.75%, respectively. All the results indicate that DLm6Am achieved the best prediction performance in terms of ACC, Matthews correlation coefficient (MCC), AUROC, and the area under precision and recall curves (AUPR). To further assess the generalization performance of our proposed model, we implemented chromosome-level leave-out cross-validation, and found that the obtained AUROC values were greater than 0.83, indicating that our proposed method is robust and can accurately predict m(6)Am sites. MDPI 2022-09-20 /pmc/articles/PMC9570463/ /pubmed/36232325 http://dx.doi.org/10.3390/ijms231911026 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Luo, Zhengtao
Su, Wei
Lou, Liliang
Qiu, Wangren
Xiao, Xuan
Xu, Zhaochun
DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences
title DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences
title_full DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences
title_fullStr DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences
title_full_unstemmed DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences
title_short DLm6Am: A Deep-Learning-Based Tool for Identifying N6,2′-O-Dimethyladenosine Sites in RNA Sequences
title_sort dlm6am: a deep-learning-based tool for identifying n6,2′-o-dimethyladenosine sites in rna sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570463/
https://www.ncbi.nlm.nih.gov/pubmed/36232325
http://dx.doi.org/10.3390/ijms231911026
work_keys_str_mv AT luozhengtao dlm6amadeeplearningbasedtoolforidentifyingn62odimethyladenosinesitesinrnasequences
AT suwei dlm6amadeeplearningbasedtoolforidentifyingn62odimethyladenosinesitesinrnasequences
AT louliliang dlm6amadeeplearningbasedtoolforidentifyingn62odimethyladenosinesitesinrnasequences
AT qiuwangren dlm6amadeeplearningbasedtoolforidentifyingn62odimethyladenosinesitesinrnasequences
AT xiaoxuan dlm6amadeeplearningbasedtoolforidentifyingn62odimethyladenosinesitesinrnasequences
AT xuzhaochun dlm6amadeeplearningbasedtoolforidentifyingn62odimethyladenosinesitesinrnasequences