Cargando…

Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs

Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining th...

Descripción completa

Detalles Bibliográficos
Autores principales: Sloma, Michael F., Mathews, David H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5690697/
https://www.ncbi.nlm.nih.gov/pubmed/29107980
http://dx.doi.org/10.1371/journal.pcbi.1005827
_version_ 1783279660472205312
author Sloma, Michael F.
Mathews, David H.
author_facet Sloma, Michael F.
Mathews, David H.
author_sort Sloma, Michael F.
collection PubMed
description Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package.
format Online
Article
Text
id pubmed-5690697
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56906972017-11-29 Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs Sloma, Michael F. Mathews, David H. PLoS Comput Biol Research Article Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package. Public Library of Science 2017-11-06 /pmc/articles/PMC5690697/ /pubmed/29107980 http://dx.doi.org/10.1371/journal.pcbi.1005827 Text en © 2017 Sloma, Mathews http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sloma, Michael F.
Mathews, David H.
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs
title Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs
title_full Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs
title_fullStr Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs
title_full_unstemmed Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs
title_short Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs
title_sort base pair probability estimates improve the prediction accuracy of rna non-canonical base pairs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5690697/
https://www.ncbi.nlm.nih.gov/pubmed/29107980
http://dx.doi.org/10.1371/journal.pcbi.1005827
work_keys_str_mv AT slomamichaelf basepairprobabilityestimatesimprovethepredictionaccuracyofrnanoncanonicalbasepairs
AT mathewsdavidh basepairprobabilityestimatesimprovethepredictionaccuracyofrnanoncanonicalbasepairs