Cargando…
Research on folding diversity in statistical learning methods for RNA secondary structure prediction
How to improve the prediction accuracy of RNA secondary structure is currently a hot topic. The existing prediction methods for a single sequence do not fully consider the folding diversity which may occur among RNAs with different functions or sources. This paper explores the relationship between f...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Ivyspring International Publisher
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036747/ https://www.ncbi.nlm.nih.gov/pubmed/29989089 http://dx.doi.org/10.7150/ijbs.24595 |
_version_ | 1783338213734088704 |
---|---|
author | Zhu, Yu Xie, ZhaoYang Li, YiZhou Zhu, Min Chen, Yi-Ping Phoebe |
author_facet | Zhu, Yu Xie, ZhaoYang Li, YiZhou Zhu, Min Chen, Yi-Ping Phoebe |
author_sort | Zhu, Yu |
collection | PubMed |
description | How to improve the prediction accuracy of RNA secondary structure is currently a hot topic. The existing prediction methods for a single sequence do not fully consider the folding diversity which may occur among RNAs with different functions or sources. This paper explores the relationship between folding diversity and prediction accuracy, and puts forward a new method to improve the prediction accuracy of RNA secondary structure. Our research investigates the following: 1. The folding feature based on stochastic context-free grammar is proposed. By using dimension reduction and clustering techniques, some public data sets are analyzed. The results show that there is significant folding diversity among different RNA families. 2. To assign folding rules to RNAs without structural information, a classification method based on production probability is proposed. The experimental results show that the classification method proposed in this paper can effectively classify the RNAs of unknown structure. 3. Based on the existing prediction methods of statistical learning models, an RNA secondary structure prediction framework is proposed, namely “Cluster - Training - Parameter Selection - Prediction”. The results show that, with information on folding diversity, prediction accuracy can be significantly improved. |
format | Online Article Text |
id | pubmed-6036747 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Ivyspring International Publisher |
record_format | MEDLINE/PubMed |
spelling | pubmed-60367472018-07-09 Research on folding diversity in statistical learning methods for RNA secondary structure prediction Zhu, Yu Xie, ZhaoYang Li, YiZhou Zhu, Min Chen, Yi-Ping Phoebe Int J Biol Sci Research Paper How to improve the prediction accuracy of RNA secondary structure is currently a hot topic. The existing prediction methods for a single sequence do not fully consider the folding diversity which may occur among RNAs with different functions or sources. This paper explores the relationship between folding diversity and prediction accuracy, and puts forward a new method to improve the prediction accuracy of RNA secondary structure. Our research investigates the following: 1. The folding feature based on stochastic context-free grammar is proposed. By using dimension reduction and clustering techniques, some public data sets are analyzed. The results show that there is significant folding diversity among different RNA families. 2. To assign folding rules to RNAs without structural information, a classification method based on production probability is proposed. The experimental results show that the classification method proposed in this paper can effectively classify the RNAs of unknown structure. 3. Based on the existing prediction methods of statistical learning models, an RNA secondary structure prediction framework is proposed, namely “Cluster - Training - Parameter Selection - Prediction”. The results show that, with information on folding diversity, prediction accuracy can be significantly improved. Ivyspring International Publisher 2018-05-22 /pmc/articles/PMC6036747/ /pubmed/29989089 http://dx.doi.org/10.7150/ijbs.24595 Text en © Ivyspring International Publisher This is an open access article distributed under the terms of the Creative Commons Attribution (CC BY-NC) license (https://creativecommons.org/licenses/by-nc/4.0/). See http://ivyspring.com/terms for full terms and conditions. |
spellingShingle | Research Paper Zhu, Yu Xie, ZhaoYang Li, YiZhou Zhu, Min Chen, Yi-Ping Phoebe Research on folding diversity in statistical learning methods for RNA secondary structure prediction |
title | Research on folding diversity in statistical learning methods for RNA secondary structure prediction |
title_full | Research on folding diversity in statistical learning methods for RNA secondary structure prediction |
title_fullStr | Research on folding diversity in statistical learning methods for RNA secondary structure prediction |
title_full_unstemmed | Research on folding diversity in statistical learning methods for RNA secondary structure prediction |
title_short | Research on folding diversity in statistical learning methods for RNA secondary structure prediction |
title_sort | research on folding diversity in statistical learning methods for rna secondary structure prediction |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6036747/ https://www.ncbi.nlm.nih.gov/pubmed/29989089 http://dx.doi.org/10.7150/ijbs.24595 |
work_keys_str_mv | AT zhuyu researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction AT xiezhaoyang researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction AT liyizhou researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction AT zhumin researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction AT chenyipingphoebe researchonfoldingdiversityinstatisticallearningmethodsforrnasecondarystructureprediction |