Cargando…

Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data

Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is op...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Yang, Shi, Binbin, Ding, Xinqiang, Liu, Tong, Hu, Xihao, Yip, Kevin Y., Yang, Zheng Rong, Mathews, David H., Lu, Zhi John
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551937/
https://www.ncbi.nlm.nih.gov/pubmed/26170232
http://dx.doi.org/10.1093/nar/gkv706
_version_ 1782387650502590464
author Wu, Yang
Shi, Binbin
Ding, Xinqiang
Liu, Tong
Hu, Xihao
Yip, Kevin Y.
Yang, Zheng Rong
Mathews, David H.
Lu, Zhi John
author_facet Wu, Yang
Shi, Binbin
Ding, Xinqiang
Liu, Tong
Hu, Xihao
Yip, Kevin Y.
Yang, Zheng Rong
Mathews, David H.
Lu, Zhi John
author_sort Wu, Yang
collection PubMed
description Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is optimized for SHAPE data, while SeqFold is optimized for PARS data. Here, we report a new RNA secondary structure prediction method, restrained MaxExpect (RME), which can incorporate multiple types of experimental probing data and is based on a free energy model and an MEA (maximizing expected accuracy) algorithm. We first demonstrated that RME substantially improved secondary structure prediction with perfect restraints (base pair information of known structures). Next, we collected structure-probing data from diverse experiments (e.g. SHAPE, PARS and DMS-seq) and transformed them into a unified set of pairing probabilities with a posterior probabilistic model. By using the probability scores as restraints in RME, we compared its secondary structure prediction performance with two other well-known tools, RNAstructure-Fold (based on a free energy minimization algorithm) and SeqFold (based on a sampling algorithm). For SHAPE data, RME and RNAstructure-Fold performed better than SeqFold, because they markedly altered the energy model with the experimental restraints. For high-throughput data (e.g. PARS and DMS-seq) with lower probing efficiency, the secondary structure prediction performances of the tested tools were comparable, with performance improvements for only a portion of the tested RNAs. However, when the effects of tertiary structure and protein interactions were removed, RME showed the highest prediction accuracy in the DMS-accessible regions by incorporating in vivo DMS-seq data.
format Online
Article
Text
id pubmed-4551937
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45519372015-08-28 Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data Wu, Yang Shi, Binbin Ding, Xinqiang Liu, Tong Hu, Xihao Yip, Kevin Y. Yang, Zheng Rong Mathews, David H. Lu, Zhi John Nucleic Acids Res Computational Biology Recently, several experimental techniques have emerged for probing RNA structures based on high-throughput sequencing. However, most secondary structure prediction tools that incorporate probing data are designed and optimized for particular types of experiments. For example, RNAstructure-Fold is optimized for SHAPE data, while SeqFold is optimized for PARS data. Here, we report a new RNA secondary structure prediction method, restrained MaxExpect (RME), which can incorporate multiple types of experimental probing data and is based on a free energy model and an MEA (maximizing expected accuracy) algorithm. We first demonstrated that RME substantially improved secondary structure prediction with perfect restraints (base pair information of known structures). Next, we collected structure-probing data from diverse experiments (e.g. SHAPE, PARS and DMS-seq) and transformed them into a unified set of pairing probabilities with a posterior probabilistic model. By using the probability scores as restraints in RME, we compared its secondary structure prediction performance with two other well-known tools, RNAstructure-Fold (based on a free energy minimization algorithm) and SeqFold (based on a sampling algorithm). For SHAPE data, RME and RNAstructure-Fold performed better than SeqFold, because they markedly altered the energy model with the experimental restraints. For high-throughput data (e.g. PARS and DMS-seq) with lower probing efficiency, the secondary structure prediction performances of the tested tools were comparable, with performance improvements for only a portion of the tested RNAs. However, when the effects of tertiary structure and protein interactions were removed, RME showed the highest prediction accuracy in the DMS-accessible regions by incorporating in vivo DMS-seq data. Oxford University Press 2015-09-03 2015-07-13 /pmc/articles/PMC4551937/ /pubmed/26170232 http://dx.doi.org/10.1093/nar/gkv706 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Wu, Yang
Shi, Binbin
Ding, Xinqiang
Liu, Tong
Hu, Xihao
Yip, Kevin Y.
Yang, Zheng Rong
Mathews, David H.
Lu, Zhi John
Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data
title Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data
title_full Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data
title_fullStr Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data
title_full_unstemmed Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data
title_short Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data
title_sort improved prediction of rna secondary structure by integrating the free energy model with restraints derived from experimental probing data
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551937/
https://www.ncbi.nlm.nih.gov/pubmed/26170232
http://dx.doi.org/10.1093/nar/gkv706
work_keys_str_mv AT wuyang improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT shibinbin improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT dingxinqiang improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT liutong improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT huxihao improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT yipkeviny improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT yangzhengrong improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT mathewsdavidh improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata
AT luzhijohn improvedpredictionofrnasecondarystructurebyintegratingthefreeenergymodelwithrestraintsderivedfromexperimentalprobingdata