Cargando…

BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach

N(6)-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m(6)A sites identified from different species using high-throughput experiments provides a rich resource to construct in-silico approaches for identifying m(6)A...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yu, He, Ningning, Chen, Yu, Chen, Zhen, Li, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Ivyspring International Publisher 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6216033/
https://www.ncbi.nlm.nih.gov/pubmed/30416381
http://dx.doi.org/10.7150/ijbs.27819
_version_ 1783368264479408128
author Huang, Yu
He, Ningning
Chen, Yu
Chen, Zhen
Li, Lei
author_facet Huang, Yu
He, Ningning
Chen, Yu
Chen, Zhen
Li, Lei
author_sort Huang, Yu
collection PubMed
description N(6)-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m(6)A sites identified from different species using high-throughput experiments provides a rich resource to construct in-silico approaches for identifying m(6)A sites. The existing m(6)A predictors are developed using conventional machine-learning (ML) algorithms and most are species-centric. In this paper, we develop a novel cross-species deep-learning classifier based on bidirectional Gated Recurrent Unit (BGRU) for the prediction of m(6)A sites. In comparison with conventional ML approaches, BGRU achieves outstanding performance for the Mammalia dataset that contains over fifty thousand m(6)A sites but inferior for the Saccharomyces cerevisiae dataset that covers around a thousand positives. The accuracy of BGRU is sensitive to the data size and the sensitivity is compensated by the integration of a random forest classifier with a novel encoding of enhanced nucleic acid content. The integrated approach dubbed as BGRU-based Ensemble RNA Methylation site Predictor (BERMP) has competitive performance in both cross-validation test and independent test. BERMP also outperforms existing m(6)A predictors for different species. Therefore, BERMP is a novel multi-species tool for identifying m(6)A sites with high confidence. This classifier is freely available at http://www.bioinfogo.org/bermp.
format Online
Article
Text
id pubmed-6216033
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Ivyspring International Publisher
record_format MEDLINE/PubMed
spelling pubmed-62160332018-11-09 BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach Huang, Yu He, Ningning Chen, Yu Chen, Zhen Li, Lei Int J Biol Sci Research Paper N(6)-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m(6)A sites identified from different species using high-throughput experiments provides a rich resource to construct in-silico approaches for identifying m(6)A sites. The existing m(6)A predictors are developed using conventional machine-learning (ML) algorithms and most are species-centric. In this paper, we develop a novel cross-species deep-learning classifier based on bidirectional Gated Recurrent Unit (BGRU) for the prediction of m(6)A sites. In comparison with conventional ML approaches, BGRU achieves outstanding performance for the Mammalia dataset that contains over fifty thousand m(6)A sites but inferior for the Saccharomyces cerevisiae dataset that covers around a thousand positives. The accuracy of BGRU is sensitive to the data size and the sensitivity is compensated by the integration of a random forest classifier with a novel encoding of enhanced nucleic acid content. The integrated approach dubbed as BGRU-based Ensemble RNA Methylation site Predictor (BERMP) has competitive performance in both cross-validation test and independent test. BERMP also outperforms existing m(6)A predictors for different species. Therefore, BERMP is a novel multi-species tool for identifying m(6)A sites with high confidence. This classifier is freely available at http://www.bioinfogo.org/bermp. Ivyspring International Publisher 2018-09-07 /pmc/articles/PMC6216033/ /pubmed/30416381 http://dx.doi.org/10.7150/ijbs.27819 Text en © Ivyspring International Publisher This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY-NC) license (https://creativecommons.org/licenses/by-nc/4.0/). See http://ivyspring.com/terms for full terms and conditions.
spellingShingle Research Paper
Huang, Yu
He, Ningning
Chen, Yu
Chen, Zhen
Li, Lei
BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach
title BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach
title_full BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach
title_fullStr BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach
title_full_unstemmed BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach
title_short BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach
title_sort bermp: a cross-species classifier for predicting m(6)a sites by integrating a deep learning algorithm and a random forest approach
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6216033/
https://www.ncbi.nlm.nih.gov/pubmed/30416381
http://dx.doi.org/10.7150/ijbs.27819
work_keys_str_mv AT huangyu bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach
AT heningning bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach
AT chenyu bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach
AT chenzhen bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach
AT lilei bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach