Cargando…
Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data
Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is op...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551937/ https://www.ncbi.nlm.nih.gov/pubmed/26170232 http://dx.doi.org/10.1093/nar/gkv706 |
_version_ | 1782387650502590464 |
---|---|
author | Wu, Yang Shi, Binbin Ding, Xinqiang Liu, Tong Hu, Xihao Yip, Kevin Y. Yang, Zheng Rong Mathews, David H. Lu, Zhi John |
author_facet | Wu, Yang Shi, Binbin Ding, Xinqiang Liu, Tong Hu, Xihao Yip, Kevin Y. Yang, Zheng Rong Mathews, David H. Lu, Zhi John |
author_sort | Wu, Yang |
collection | PubMed |
description | Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is optimized for SHAPE data, while SeqFold is optimized for PARS data. Here, we report a new RNA secondary structure prediction method, restrained MaxExpect (RME), which can incorporate multiple types of experimental probing data and is based on a free energy model and an MEA (maximizing expected accuracy) algorithm. We first demonstrated that RME substantially improved secondary structure prediction with perfect restraints (base pair information of known structures). Next, we collected structure-probing data from diverse experiments (e.g. SHAPE, PARS and DMS-seq) and transformed them into a unified set of pairing probabilities with a posterior probabilistic model. By using the probability scores as restraints in RME, we compared its secondary structure prediction performance with two other well-known tools, RNAstructure-Fold (based on a free energy minimization algorithm) and SeqFold (based on a sampling algorithm). For SHAPE data, RME and RNAstructure-Fold performed better than SeqFold, because they markedly altered the energy model with the experimental restraints. For high-throughput data (e.g. PARS and DMS-seq) with lower probing efficiency, the secondary structure prediction performances of the tested tools were comparable, with performance improvements for only a portion of the tested RNAs. However, when the effects of tertiary structure and protein interactions were removed, RME showed the highest prediction accuracy in the DMS-accessible regions by incorporating in vivo DMS-seq data. |
format | Online Article Text |
id | pubmed-4551937 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-45519372015-08-28 Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data Wu, Yang Shi, Binbin Ding, Xinqiang Liu, Tong Hu, Xihao Yip, Kevin Y. Yang, Zheng Rong Mathews, David H. Lu, Zhi John Nucleic Acids Res Computational Biology Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is optimized for SHAPE data, while SeqFold is optimized for PARS data. Here, we report a new RNA secondary structure prediction method, restrained MaxExpect (RME), which can incorporate multiple types of experimental probing data and is based on a free energy model and an MEA (maximizing expected accuracy) algorithm. We first demonstrated that RME substantially improved secondary structure prediction with perfect restraints (base pair information of known structures). Next, we collected structure-probing data from diverse experiments (e.g. SHAPE, PARS and DMS-seq) and transformed them into a unified set of pairing probabilities with a posterior probabilistic model. By using the probability scores as restraints in RME, we compared its secondary structure prediction performance with two other well-known tools, RNAstructure-Fold (based on a free energy minimization algorithm) and SeqFold (based on a sampling algorithm). For SHAPE data, RME and RNAstructure-Fold performed better than SeqFold, because they markedly altered the energy model with the experimental restraints. For high-throughput data (e.g. PARS and DMS-seq) with lower probing efficiency, the secondary structure prediction performances of the tested tools were comparable, with performance improvements for only a portion of the tested RNAs. However, when the effects of tertiary structure and protein interactions were removed, RME showed the highest prediction accuracy in the DMS-accessible regions by incorporating in vivo DMS-seq data. Oxford University Press 2015-09-03 2015-07-13 /pmc/articles/PMC4551937/ /pubmed/26170232 http://dx.doi.org/10.1093/nar/gkv706 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Computational Biology Wu, Yang Shi, Binbin Ding, Xinqiang Liu, Tong Hu, Xihao Yip, Kevin Y. Yang, Zheng Rong Mathews, David H. Lu, Zhi John Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data |
title | Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data |
title_full | Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data |
title_fullStr | Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data |
title_full_unstemmed | Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data |
title_short | Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data |
title_sort | improved prediction of rna secondary structure by integrating the free energy model with restraints derived from experimental probing data |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551937/ https://www.ncbi.nlm.nih.gov/pubmed/26170232 http://dx.doi.org/10.1093/nar/gkv706 |
work_keys_str_mv | AT wuyang improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT shibinbin improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT dingxinqiang improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT liutong improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT huxihao improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT yipkeviny improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT yangzhengrong improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT mathewsdavidh improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata AT luzhijohn improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata |