Cargando…
BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach
N(6)-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m(6)A sites identified from different species using high-throughput experiments provides a rich resource to construct in-silico approaches for identifying m(6)A...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Ivyspring International Publisher
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6216033/ https://www.ncbi.nlm.nih.gov/pubmed/30416381 http://dx.doi.org/10.7150/ijbs.27819 |
_version_ | 1783368264479408128 |
---|---|
author | Huang, Yu He, Ningning Chen, Yu Chen, Zhen Li, Lei |
author_facet | Huang, Yu He, Ningning Chen, Yu Chen, Zhen Li, Lei |
author_sort | Huang, Yu |
collection | PubMed |
description | N(6)-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m(6)A sites identified from different species using high-throughput experiments provides a rich resource to construct in-silico approaches for identifying m(6)A sites. The existing m(6)A predictors are developed using conventional machine-learning (ML) algorithms and most are species-centric. In this paper, we develop a novel cross-species deep-learning classifier based on bidirectional Gated Recurrent Unit (BGRU) for the prediction of m(6)A sites. In comparison with conventional ML approaches, BGRU achieves outstanding performance for the Mammalia dataset that contains over fifty thousand m(6)A sites but inferior for the Saccharomyces cerevisiae dataset that covers around a thousand positives. The accuracy of BGRU is sensitive to the data size and the sensitivity is compensated by the integration of a random forest classifier with a novel encoding of enhanced nucleic acid content. The integrated approach dubbed as BGRU-based Ensemble RNA Methylation site Predictor (BERMP) has competitive performance in both cross-validation test and independent test. BERMP also outperforms existing m(6)A predictors for different species. Therefore, BERMP is a novel multi-species tool for identifying m(6)A sites with high confidence. This classifier is freely available at http://www.bioinfogo.org/bermp. |
format | Online Article Text |
id | pubmed-6216033 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Ivyspring International Publisher |
record_format | MEDLINE/PubMed |
spelling | pubmed-62160332018-11-09 BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach Huang, Yu He, Ningning Chen, Yu Chen, Zhen Li, Lei Int J Biol Sci Research Paper N(6)-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m(6)A sites identified from different species using high-throughput experiments provides a rich resource to construct in-silico approaches for identifying m(6)A sites. The existing m(6)A predictors are developed using conventional machine-learning (ML) algorithms and most are species-centric. In this paper, we develop a novel cross-species deep-learning classifier based on bidirectional Gated Recurrent Unit (BGRU) for the prediction of m(6)A sites. In comparison with conventional ML approaches, BGRU achieves outstanding performance for the Mammalia dataset that contains over fifty thousand m(6)A sites but inferior for the Saccharomyces cerevisiae dataset that covers around a thousand positives. The accuracy of BGRU is sensitive to the data size and the sensitivity is compensated by the integration of a random forest classifier with a novel encoding of enhanced nucleic acid content. The integrated approach dubbed as BGRU-based Ensemble RNA Methylation site Predictor (BERMP) has competitive performance in both cross-validation test and independent test. BERMP also outperforms existing m(6)A predictors for different species. Therefore, BERMP is a novel multi-species tool for identifying m(6)A sites with high confidence. This classifier is freely available at http://www.bioinfogo.org/bermp. Ivyspring International Publisher 2018-09-07 /pmc/articles/PMC6216033/ /pubmed/30416381 http://dx.doi.org/10.7150/ijbs.27819 Text en © Ivyspring International Publisher This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY-NC) license (https://creativecommons.org/licenses/by-nc/4.0/). See http://ivyspring.com/terms for full terms and conditions. |
spellingShingle | Research Paper Huang, Yu He, Ningning Chen, Yu Chen, Zhen Li, Lei BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach |
title | BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach |
title_full | BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach |
title_fullStr | BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach |
title_full_unstemmed | BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach |
title_short | BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach |
title_sort | bermp: a cross-species classifier for predicting m(6)a sites by integrating a deep learning algorithm and a random forest approach |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6216033/ https://www.ncbi.nlm.nih.gov/pubmed/30416381 http://dx.doi.org/10.7150/ijbs.27819 |
work_keys_str_mv | AT huangyu bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach AT heningning bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach AT chenyu bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach AT chenzhen bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach AT lilei bermpacrossspeciesclassifierforpredictingm6asitesbyintegratingadeeplearningalgorithmandarandomforestapproach |