Cargando…

M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning

N6-methyladenosine (m(6)A) modification is the most abundant RNA methylation modification and involves various biological processes, such as RNA splicing and degradation. Recent studies have demonstrated the feasibility of identifying m(6)A peaks using high-throughput sequencing techniques. However,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Leyi, Chen, Huangrong, Su, Ran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Gene & Cell Therapy 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6082921/
https://www.ncbi.nlm.nih.gov/pubmed/30081234
http://dx.doi.org/10.1016/j.omtn.2018.07.004
_version_ 1783345874060967936
author Wei, Leyi
Chen, Huangrong
Su, Ran
author_facet Wei, Leyi
Chen, Huangrong
Su, Ran
author_sort Wei, Leyi
collection PubMed
description N6-methyladenosine (m(6)A) modification is the most abundant RNA methylation modification and involves various biological processes, such as RNA splicing and degradation. Recent studies have demonstrated the feasibility of identifying m(6)A peaks using high-throughput sequencing techniques. However, such techniques cannot accurately identify specific methylated sites, which is important for a better understanding of m(6)A functions. In this study, we develop a novel machine learning-based predictor called M6APred-EL for the identification of m(6)A sites. To predict m(6)A sites accurately within genomic sequences, we trained an ensemble of three support vector machine classifiers that explore the position-specific information and physical chemical information from position-specific k-mer nucleotide propensity, physical-chemical properties, and ring-function-hydrogen-chemical properties. We examined and compared the performance of our predictor with other state-of-the-art methods of benchmarking datasets. Comparative results showed that the proposed M6APred-EL performed more accurately for m(6)A site identification. Moreover, a user-friendly web server that implements the proposed M6APred-EL is well established and is currently available at http://server.malab.cn/M6APred-EL/. It is expected to be a practical and effective tool for the investigation of m(6)A functional mechanisms.
format Online
Article
Text
id pubmed-6082921
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher American Society of Gene & Cell Therapy
record_format MEDLINE/PubMed
spelling pubmed-60829212018-08-10 M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning Wei, Leyi Chen, Huangrong Su, Ran Mol Ther Nucleic Acids Article N6-methyladenosine (m(6)A) modification is the most abundant RNA methylation modification and involves various biological processes, such as RNA splicing and degradation. Recent studies have demonstrated the feasibility of identifying m(6)A peaks using high-throughput sequencing techniques. However, such techniques cannot accurately identify specific methylated sites, which is important for a better understanding of m(6)A functions. In this study, we develop a novel machine learning-based predictor called M6APred-EL for the identification of m(6)A sites. To predict m(6)A sites accurately within genomic sequences, we trained an ensemble of three support vector machine classifiers that explore the position-specific information and physical chemical information from position-specific k-mer nucleotide propensity, physical-chemical properties, and ring-function-hydrogen-chemical properties. We examined and compared the performance of our predictor with other state-of-the-art methods of benchmarking datasets. Comparative results showed that the proposed M6APred-EL performed more accurately for m(6)A site identification. Moreover, a user-friendly web server that implements the proposed M6APred-EL is well established and is currently available at http://server.malab.cn/M6APred-EL/. It is expected to be a practical and effective tool for the investigation of m(6)A functional mechanisms. American Society of Gene & Cell Therapy 2018-07-09 /pmc/articles/PMC6082921/ /pubmed/30081234 http://dx.doi.org/10.1016/j.omtn.2018.07.004 Text en © 2018 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Wei, Leyi
Chen, Huangrong
Su, Ran
M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
title M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
title_full M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
title_fullStr M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
title_full_unstemmed M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
title_short M6APred-EL: A Sequence-Based Predictor for Identifying N6-methyladenosine Sites Using Ensemble Learning
title_sort m6apred-el: a sequence-based predictor for identifying n6-methyladenosine sites using ensemble learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6082921/
https://www.ncbi.nlm.nih.gov/pubmed/30081234
http://dx.doi.org/10.1016/j.omtn.2018.07.004
work_keys_str_mv AT weileyi m6apredelasequencebasedpredictorforidentifyingn6methyladenosinesitesusingensemblelearning
AT chenhuangrong m6apredelasequencebasedpredictorforidentifyingn6methyladenosinesitesusingensemblelearning
AT suran m6apredelasequencebasedpredictorforidentifyingn6methyladenosinesitesusingensemblelearning