Cargando…

Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae

INTRODUCTION: N(6)-methyladenosine (m6A) is one of the most common post-transcriptional modifications in RNA, which has been related to several biological processes. The accurate prediction of m6A sites from RNA sequences is one of the challenging tasks in computational biology. Several computationa...

Descripción completa

Detalles Bibliográficos
Autores principales: Govindaraj, Rajiv G., Subramaniyam, Sathiyamoorthy, Manavalan, Balachandran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Bentham Science Publishers 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324895/
https://www.ncbi.nlm.nih.gov/pubmed/32655295
http://dx.doi.org/10.2174/1389202921666200219125625
_version_ 1783552047405072384
author Govindaraj, Rajiv G.
Subramaniyam, Sathiyamoorthy
Manavalan, Balachandran
author_facet Govindaraj, Rajiv G.
Subramaniyam, Sathiyamoorthy
Manavalan, Balachandran
author_sort Govindaraj, Rajiv G.
collection PubMed
description INTRODUCTION: N(6)-methyladenosine (m6A) is one of the most common post-transcriptional modifications in RNA, which has been related to several biological processes. The accurate prediction of m6A sites from RNA sequences is one of the challenging tasks in computational biology. Several computational methods utilizing machine-learning algorithms have been proposed that accelerate in silico screening of m6A sites, thereby drastically reducing the experimental time and labor costs involved. METHODOLOGY: In this study, we proposed a novel computational predictor termed ERT-m6Apred, for the accurate prediction of m6A sites. To identify the feature encodings with more discriminative capability, we applied a two-step feature selection technique on seven different feature encodings and identified the corresponding optimal feature set. RESULTS: Subsequently, performance comparison of the corresponding optimal feature set-based extremely randomized tree model revealed that Pseudo k-tuple composition encoding, which includes 14 physicochemical properties significantly outperformed other encodings. Moreover, ERT-m6Apred achieved an accuracy of 78.84% during cross-validation analysis, which is comparatively better than recently reported predictors. CONCLUSION: In summary, ERT-m6Apred predicts Saccharomyces cerevisiae m6A sites with higher accuracy, thus facilitating biological hypothesis generation and experimental validations.
format Online
Article
Text
id pubmed-7324895
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Bentham Science Publishers
record_format MEDLINE/PubMed
spelling pubmed-73248952020-07-10 Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae Govindaraj, Rajiv G. Subramaniyam, Sathiyamoorthy Manavalan, Balachandran Curr Genomics Article INTRODUCTION: N(6)-methyladenosine (m6A) is one of the most common post-transcriptional modifications in RNA, which has been related to several biological processes. The accurate prediction of m6A sites from RNA sequences is one of the challenging tasks in computational biology. Several computational methods utilizing machine-learning algorithms have been proposed that accelerate in silico screening of m6A sites, thereby drastically reducing the experimental time and labor costs involved. METHODOLOGY: In this study, we proposed a novel computational predictor termed ERT-m6Apred, for the accurate prediction of m6A sites. To identify the feature encodings with more discriminative capability, we applied a two-step feature selection technique on seven different feature encodings and identified the corresponding optimal feature set. RESULTS: Subsequently, performance comparison of the corresponding optimal feature set-based extremely randomized tree model revealed that Pseudo k-tuple composition encoding, which includes 14 physicochemical properties significantly outperformed other encodings. Moreover, ERT-m6Apred achieved an accuracy of 78.84% during cross-validation analysis, which is comparatively better than recently reported predictors. CONCLUSION: In summary, ERT-m6Apred predicts Saccharomyces cerevisiae m6A sites with higher accuracy, thus facilitating biological hypothesis generation and experimental validations. Bentham Science Publishers 2020-01 2020-01 /pmc/articles/PMC7324895/ /pubmed/32655295 http://dx.doi.org/10.2174/1389202921666200219125625 Text en © 2020 Bentham Science Publishers https://creativecommons.org/licenses/by-nc/4.0/legalcode This is an open access article licensed under the terms of the Creative Commons Attribution-Non-Commercial 4.0 International Public License (CC BY-NC 4.0) (https://creativecommons.org/licenses/by-nc/4.0/legalcode), which permits unrestricted, non-commercial use, distribution and reproduction in any medium, provided the work is properly cited.
spellingShingle Article
Govindaraj, Rajiv G.
Subramaniyam, Sathiyamoorthy
Manavalan, Balachandran
Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae
title Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae
title_full Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae
title_fullStr Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae
title_full_unstemmed Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae
title_short Extremely-randomized-tree-based Prediction of N(6)-Methyladenosine Sites in Saccharomyces cerevisiae
title_sort extremely-randomized-tree-based prediction of n(6)-methyladenosine sites in saccharomyces cerevisiae
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324895/
https://www.ncbi.nlm.nih.gov/pubmed/32655295
http://dx.doi.org/10.2174/1389202921666200219125625
work_keys_str_mv AT govindarajrajivg extremelyrandomizedtreebasedpredictionofn6methyladenosinesitesinsaccharomycescerevisiae
AT subramaniyamsathiyamoorthy extremelyrandomizedtreebasedpredictionofn6methyladenosinesitesinsaccharomycescerevisiae
AT manavalanbalachandran extremelyrandomizedtreebasedpredictionofn6methyladenosinesitesinsaccharomycescerevisiae