Cargando…

Improving RNA Branching Predictions: Advances and Limitations

Minimum free energy prediction of RNA secondary structures is based on the Nearest Neighbor Thermodynamics Model. While such predictions are typically good, the accuracy can vary widely even for short sequences, and the branching thermodynamics are an important factor in this variance. Recently, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Poznanović, Svetlana, Wood, Carson, Cloer, Michael, Heitsch, Christine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8064352/
https://www.ncbi.nlm.nih.gov/pubmed/33805944
http://dx.doi.org/10.3390/genes12040469
_version_ 1783682116766138368
author Poznanović, Svetlana
Wood, Carson
Cloer, Michael
Heitsch, Christine
author_facet Poznanović, Svetlana
Wood, Carson
Cloer, Michael
Heitsch, Christine
author_sort Poznanović, Svetlana
collection PubMed
description Minimum free energy prediction of RNA secondary structures is based on the Nearest Neighbor Thermodynamics Model. While such predictions are typically good, the accuracy can vary widely even for short sequences, and the branching thermodynamics are an important factor in this variance. Recently, the simplest model for multiloop energetics—a linear function of the number of branches and unpaired nucleotides—was found to be the best. Subsequently, a parametric analysis demonstrated that per family accuracy can be improved by changing the weightings in this linear function. However, the extent of improvement was not known due to the ad hoc method used to find the new parameters. Here we develop a branch-and-bound algorithm that finds the set of optimal parameters with the highest average accuracy for a given set of sequences. Our analysis shows that the previous ad hoc parameters are nearly optimal for tRNA and 5S rRNA sequences on both training and testing sets. Moreover, cross-family improvement is possible but more difficult because competing parameter regions favor different families. The results also indicate that restricting the unpaired nucleotide penalty to small values is warranted. This reduction makes analyzing longer sequences using the present techniques more feasible.
format Online
Article
Text
id pubmed-8064352
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80643522021-04-24 Improving RNA Branching Predictions: Advances and Limitations Poznanović, Svetlana Wood, Carson Cloer, Michael Heitsch, Christine Genes (Basel) Article Minimum free energy prediction of RNA secondary structures is based on the Nearest Neighbor Thermodynamics Model. While such predictions are typically good, the accuracy can vary widely even for short sequences, and the branching thermodynamics are an important factor in this variance. Recently, the simplest model for multiloop energetics—a linear function of the number of branches and unpaired nucleotides—was found to be the best. Subsequently, a parametric analysis demonstrated that per family accuracy can be improved by changing the weightings in this linear function. However, the extent of improvement was not known due to the ad hoc method used to find the new parameters. Here we develop a branch-and-bound algorithm that finds the set of optimal parameters with the highest average accuracy for a given set of sequences. Our analysis shows that the previous ad hoc parameters are nearly optimal for tRNA and 5S rRNA sequences on both training and testing sets. Moreover, cross-family improvement is possible but more difficult because competing parameter regions favor different families. The results also indicate that restricting the unpaired nucleotide penalty to small values is warranted. This reduction makes analyzing longer sequences using the present techniques more feasible. MDPI 2021-03-25 /pmc/articles/PMC8064352/ /pubmed/33805944 http://dx.doi.org/10.3390/genes12040469 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Poznanović, Svetlana
Wood, Carson
Cloer, Michael
Heitsch, Christine
Improving RNA Branching Predictions: Advances and Limitations
title Improving RNA Branching Predictions: Advances and Limitations
title_full Improving RNA Branching Predictions: Advances and Limitations
title_fullStr Improving RNA Branching Predictions: Advances and Limitations
title_full_unstemmed Improving RNA Branching Predictions: Advances and Limitations
title_short Improving RNA Branching Predictions: Advances and Limitations
title_sort improving rna branching predictions: advances and limitations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8064352/
https://www.ncbi.nlm.nih.gov/pubmed/33805944
http://dx.doi.org/10.3390/genes12040469
work_keys_str_mv AT poznanovicsvetlana improvingrnabranchingpredictionsadvancesandlimitations
AT woodcarson improvingrnabranchingpredictionsadvancesandlimitations
AT cloermichael improvingrnabranchingpredictionsadvancesandlimitations
AT heitschchristine improvingrnabranchingpredictionsadvancesandlimitations