Cargando…

Research on folding diversity in statistical learning methods for RNA secondary structure prediction

How to improve the prediction accuracy of RNA secondary structure is currently a hot topic. The existing prediction methods for a single sequence do not fully consider the folding diversity which may occur among RNAs with different functions or sources. This paper explores the relationship between f...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yu, Xie, ZhaoYang, Li, YiZhou, Zhu, Min, Chen, Yi-Ping Phoebe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Ivyspring International Publisher 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036747/
https://www.ncbi.nlm.nih.gov/pubmed/29989089
http://dx.doi.org/10.7150/ijbs.24595
_version_ 1783338213734088704
author Zhu, Yu
Xie, ZhaoYang
Li, YiZhou
Zhu, Min
Chen, Yi-Ping Phoebe
author_facet Zhu, Yu
Xie, ZhaoYang
Li, YiZhou
Zhu, Min
Chen, Yi-Ping Phoebe
author_sort Zhu, Yu
collection PubMed
description How to improve the prediction accuracy of RNA secondary structure is currently a hot topic. The existing prediction methods for a single sequence do not fully consider the folding diversity which may occur among RNAs with different functions or sources. This paper explores the relationship between folding diversity and prediction accuracy, and puts forward a new method to improve the prediction accuracy of RNA secondary structure. Our research investigates the following: 1. The folding feature based on stochastic context-free grammar is proposed. By using dimension reduction and clustering techniques, some public data sets are analyzed. The results show that there is significant folding diversity among different RNA families. 2. To assign folding rules to RNAs without structural information, a classification method based on production probability is proposed. The experimental results show that the classification method proposed in this paper can effectively classify the RNAs of unknown structure. 3. Based on the existing prediction methods of statistical learning models, an RNA secondary structure prediction framework is proposed, namely “Cluster - Training - Parameter Selection - Prediction”. The results show that, with information on folding diversity, prediction accuracy can be significantly improved.
format Online
Article
Text
id pubmed-6036747
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Ivyspring International Publisher
record_format MEDLINE/PubMed
spelling pubmed-60367472018-07-09 Research on folding diversity in statistical learning methods for RNA secondary structure prediction Zhu, Yu Xie, ZhaoYang Li, YiZhou Zhu, Min Chen, Yi-Ping Phoebe Int J Biol Sci Research Paper How to improve the prediction accuracy of RNA secondary structure is currently a hot topic. The existing prediction methods for a single sequence do not fully consider the folding diversity which may occur among RNAs with different functions or sources. This paper explores the relationship between folding diversity and prediction accuracy, and puts forward a new method to improve the prediction accuracy of RNA secondary structure. Our research investigates the following: 1. The folding feature based on stochastic context-free grammar is proposed. By using dimension reduction and clustering techniques, some public data sets are analyzed. The results show that there is significant folding diversity among different RNA families. 2. To assign folding rules to RNAs without structural information, a classification method based on production probability is proposed. The experimental results show that the classification method proposed in this paper can effectively classify the RNAs of unknown structure. 3. Based on the existing prediction methods of statistical learning models, an RNA secondary structure prediction framework is proposed, namely “Cluster - Training - Parameter Selection - Prediction”. The results show that, with information on folding diversity, prediction accuracy can be significantly improved. Ivyspring International Publisher 2018-05-22 /pmc/articles/PMC6036747/ /pubmed/29989089 http://dx.doi.org/10.7150/ijbs.24595 Text en © Ivyspring International Publisher This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY-NC) license (https://creativecommons.org/licenses/by-nc/4.0/). See http://ivyspring.com/terms for full terms and conditions.
spellingShingle Research Paper
Zhu, Yu
Xie, ZhaoYang
Li, YiZhou
Zhu, Min
Chen, Yi-Ping Phoebe
Research on folding diversity in statistical learning methods for RNA secondary structure prediction
title Research on folding diversity in statistical learning methods for RNA secondary structure prediction
title_full Research on folding diversity in statistical learning methods for RNA secondary structure prediction
title_fullStr Research on folding diversity in statistical learning methods for RNA secondary structure prediction
title_full_unstemmed Research on folding diversity in statistical learning methods for RNA secondary structure prediction
title_short Research on folding diversity in statistical learning methods for RNA secondary structure prediction
title_sort research on folding diversity in statistical learning methods for rna secondary structure prediction
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036747/
https://www.ncbi.nlm.nih.gov/pubmed/29989089
http://dx.doi.org/10.7150/ijbs.24595
work_keys_str_mv AT zhuyu researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction
AT xiezhaoyang researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction
AT liyizhou researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction
AT zhumin researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction
AT chenyipingphoebe researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction