Cargando…

Predicting A-to-I RNA Editing by Feature Selection and Random Forest

RNA editing is a post-transcriptional RNA process that provides RNA and protein complexity for regulating gene expression in eukaryotes. It is challenging to predict RNA editing by computational methods. In this study, we developed a novel method to predict RNA editing based on a random forest metho...

Descripción completa

Detalles Bibliográficos
Autores principales: Shu, Yang, Zhang, Ning, Kong, Xiangyin, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4206426/
https://www.ncbi.nlm.nih.gov/pubmed/25338210
http://dx.doi.org/10.1371/journal.pone.0110607
_version_ 1782340822452142080
author Shu, Yang
Zhang, Ning
Kong, Xiangyin
Huang, Tao
Cai, Yu-Dong
author_facet Shu, Yang
Zhang, Ning
Kong, Xiangyin
Huang, Tao
Cai, Yu-Dong
author_sort Shu, Yang
collection PubMed
description RNA editing is a post-transcriptional RNA process that provides RNA and protein complexity for regulating gene expression in eukaryotes. It is challenging to predict RNA editing by computational methods. In this study, we developed a novel method to predict RNA editing based on a random forest method. A careful feature selection procedure was performed based on the Maximum Relevance Minimum Redundancy (mRMR) and Incremental Feature Selection (IFS) algorithms. Eighteen optimal features were selected from the 77 features in our dataset and used to construct a final predictor. The accuracy and MCC (Matthews correlation coefficient) values for the training dataset were 0.866 and 0.742, respectively; for the testing dataset, the accuracy and MCC were 0.876 and 0.576, respectively. The performance was higher using 18 features than all 77, suggesting that a small feature set was sufficient to achieve accurate prediction. Analysis of the 18 features was performed and may shed light on the mechanism and dominant factors of RNA editing, providing a basis for future experimental validation.
format Online
Article
Text
id pubmed-4206426
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42064262014-10-27 Predicting A-to-I RNA Editing by Feature Selection and Random Forest Shu, Yang Zhang, Ning Kong, Xiangyin Huang, Tao Cai, Yu-Dong PLoS One Research Article RNA editing is a post-transcriptional RNA process that provides RNA and protein complexity for regulating gene expression in eukaryotes. It is challenging to predict RNA editing by computational methods. In this study, we developed a novel method to predict RNA editing based on a random forest method. A careful feature selection procedure was performed based on the Maximum Relevance Minimum Redundancy (mRMR) and Incremental Feature Selection (IFS) algorithms. Eighteen optimal features were selected from the 77 features in our dataset and used to construct a final predictor. The accuracy and MCC (Matthews correlation coefficient) values for the training dataset were 0.866 and 0.742, respectively; for the testing dataset, the accuracy and MCC were 0.876 and 0.576, respectively. The performance was higher using 18 features than all 77, suggesting that a small feature set was sufficient to achieve accurate prediction. Analysis of the 18 features was performed and may shed light on the mechanism and dominant factors of RNA editing, providing a basis for future experimental validation. Public Library of Science 2014-10-22 /pmc/articles/PMC4206426/ /pubmed/25338210 http://dx.doi.org/10.1371/journal.pone.0110607 Text en © 2014 Shu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Shu, Yang
Zhang, Ning
Kong, Xiangyin
Huang, Tao
Cai, Yu-Dong
Predicting A-to-I RNA Editing by Feature Selection and Random Forest
title Predicting A-to-I RNA Editing by Feature Selection and Random Forest
title_full Predicting A-to-I RNA Editing by Feature Selection and Random Forest
title_fullStr Predicting A-to-I RNA Editing by Feature Selection and Random Forest
title_full_unstemmed Predicting A-to-I RNA Editing by Feature Selection and Random Forest
title_short Predicting A-to-I RNA Editing by Feature Selection and Random Forest
title_sort predicting a-to-i rna editing by feature selection and random forest
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4206426/
https://www.ncbi.nlm.nih.gov/pubmed/25338210
http://dx.doi.org/10.1371/journal.pone.0110607
work_keys_str_mv AT shuyang predictingatoirnaeditingbyfeatureselectionandrandomforest
AT zhangning predictingatoirnaeditingbyfeatureselectionandrandomforest
AT kongxiangyin predictingatoirnaeditingbyfeatureselectionandrandomforest
AT huangtao predictingatoirnaeditingbyfeatureselectionandrandomforest
AT caiyudong predictingatoirnaeditingbyfeatureselectionandrandomforest