Cargando…

Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model

N6-methyladenosine (m6A), the most common posttranscriptional modification in eukaryotic mRNAs, plays an important role in mRNA splicing, editing, stability, degradation, etc. Since the methylation state is dynamic, methylation sequencing needs to be carried out over different time periods, which br...

Descripción completa

Detalles Bibliográficos
Autores principales: Pian, Cong, Yang, Zhixin, Yang, Yuqian, Zhang, Liangyun, Chen, Yuanyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017269/
https://www.ncbi.nlm.nih.gov/pubmed/33815484
http://dx.doi.org/10.3389/fgene.2021.650803
_version_ 1783674029000884224
author Pian, Cong
Yang, Zhixin
Yang, Yuqian
Zhang, Liangyun
Chen, Yuanyuan
author_facet Pian, Cong
Yang, Zhixin
Yang, Yuqian
Zhang, Liangyun
Chen, Yuanyuan
author_sort Pian, Cong
collection PubMed
description N6-methyladenosine (m6A), the most common posttranscriptional modification in eukaryotic mRNAs, plays an important role in mRNA splicing, editing, stability, degradation, etc. Since the methylation state is dynamic, methylation sequencing needs to be carried out over different time periods, which brings some difficulties to identify the RNA methyladenine sites. Thus, it is necessary to develop a fast and accurate method to identify the RNA N6-methyladenosine sites in the transcriptome. In this study, we use first-order and second-order Markov models to identify RNA N6-methyladenine sites in three species (Saccharomyces cerevisiae, mouse, and Homo sapiens). These two methods can fully consider the correlation between adjacent nucleotides. The results show that the performance of our method is better than that of other existing methods. Furthermore, the codons encoded by three nucleotides have biases in mRNA, and a second-order Markov model can capture this kind of information exactly. This may be the main reason why the performance of the second-order Markov model is better than that of the first-order Markov model in the m6A prediction problem. In addition, we provide a corresponding web tool called MM-m6APred.
format Online
Article
Text
id pubmed-8017269
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80172692021-04-03 Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model Pian, Cong Yang, Zhixin Yang, Yuqian Zhang, Liangyun Chen, Yuanyuan Front Genet Genetics N6-methyladenosine (m6A), the most common posttranscriptional modification in eukaryotic mRNAs, plays an important role in mRNA splicing, editing, stability, degradation, etc. Since the methylation state is dynamic, methylation sequencing needs to be carried out over different time periods, which brings some difficulties to identify the RNA methyladenine sites. Thus, it is necessary to develop a fast and accurate method to identify the RNA N6-methyladenosine sites in the transcriptome. In this study, we use first-order and second-order Markov models to identify RNA N6-methyladenine sites in three species (Saccharomyces cerevisiae, mouse, and Homo sapiens). These two methods can fully consider the correlation between adjacent nucleotides. The results show that the performance of our method is better than that of other existing methods. Furthermore, the codons encoded by three nucleotides have biases in mRNA, and a second-order Markov model can capture this kind of information exactly. This may be the main reason why the performance of the second-order Markov model is better than that of the first-order Markov model in the m6A prediction problem. In addition, we provide a corresponding web tool called MM-m6APred. Frontiers Media S.A. 2021-03-19 /pmc/articles/PMC8017269/ /pubmed/33815484 http://dx.doi.org/10.3389/fgene.2021.650803 Text en Copyright © 2021 Pian, Yang, Yang, Zhang and Chen. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Pian, Cong
Yang, Zhixin
Yang, Yuqian
Zhang, Liangyun
Chen, Yuanyuan
Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model
title Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model
title_full Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model
title_fullStr Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model
title_full_unstemmed Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model
title_short Identifying RNA N6-Methyladenine Sites in Three Species Based on a Markov Model
title_sort identifying rna n6-methyladenine sites in three species based on a markov model
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8017269/
https://www.ncbi.nlm.nih.gov/pubmed/33815484
http://dx.doi.org/10.3389/fgene.2021.650803
work_keys_str_mv AT piancong identifyingrnan6methyladeninesitesinthreespeciesbasedonamarkovmodel
AT yangzhixin identifyingrnan6methyladeninesitesinthreespeciesbasedonamarkovmodel
AT yangyuqian identifyingrnan6methyladeninesitesinthreespeciesbasedonamarkovmodel
AT zhangliangyun identifyingrnan6methyladeninesitesinthreespeciesbasedonamarkovmodel
AT chenyuanyuan identifyingrnan6methyladeninesitesinthreespeciesbasedonamarkovmodel