Cargando…

Quantifying variances in comparative RNA secondary structure prediction

BACKGROUND: With the advancement of next-generation sequencing and transcriptomics technologies, regulatory effects involving RNA, in particular RNA structural changes are being detected. These results often rely on RNA secondary structure predictions. However, current approaches to RNA secondary st...

Descripción completa

Detalles Bibliográficos
Autores principales: Anderson, James WJ, Novák, Ádám, Sükösd, Zsuzsanna, Golden, Michael, Arunapuram, Preeti, Edvardsson, Ingolfur, Hein, Jotun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3667108/
https://www.ncbi.nlm.nih.gov/pubmed/23634662
http://dx.doi.org/10.1186/1471-2105-14-149
_version_ 1782271446485041152
author Anderson, James WJ
Novák, Ádám
Sükösd, Zsuzsanna
Golden, Michael
Arunapuram, Preeti
Edvardsson, Ingolfur
Hein, Jotun
author_facet Anderson, James WJ
Novák, Ádám
Sükösd, Zsuzsanna
Golden, Michael
Arunapuram, Preeti
Edvardsson, Ingolfur
Hein, Jotun
author_sort Anderson, James WJ
collection PubMed
description BACKGROUND: With the advancement of next-generation sequencing and transcriptomics technologies, regulatory effects involving RNA, in particular RNA structural changes are being detected. These results often rely on RNA secondary structure predictions. However, current approaches to RNA secondary structure modelling produce predictions with a high variance in predictive accuracy, and we have little quantifiable knowledge about the reasons for these variances. RESULTS: In this paper we explore a number of factors which can contribute to poor RNA secondary structure prediction quality. We establish a quantified relationship between alignment quality and loss of accuracy. Furthermore, we define two new measures to quantify uncertainty in alignment-based structure predictions. One of the measures improves on the “reliability score” reported by PPfold, and considers alignment uncertainty as well as base-pair probabilities. The other measure considers the information entropy for SCFGs over a space of input alignments. CONCLUSIONS: Our predictive accuracy improves on the PPfold reliability score. We can successfully characterize many of the underlying reasons for and variances in poor prediction. However, there is still variability unaccounted for, which we therefore suggest comes from the RNA secondary structure predictive model itself.
format Online
Article
Text
id pubmed-3667108
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36671082013-06-05 Quantifying variances in comparative RNA secondary structure prediction Anderson, James WJ Novák, Ádám Sükösd, Zsuzsanna Golden, Michael Arunapuram, Preeti Edvardsson, Ingolfur Hein, Jotun BMC Bioinformatics Research Article BACKGROUND: With the advancement of next-generation sequencing and transcriptomics technologies, regulatory effects involving RNA, in particular RNA structural changes are being detected. These results often rely on RNA secondary structure predictions. However, current approaches to RNA secondary structure modelling produce predictions with a high variance in predictive accuracy, and we have little quantifiable knowledge about the reasons for these variances. RESULTS: In this paper we explore a number of factors which can contribute to poor RNA secondary structure prediction quality. We establish a quantified relationship between alignment quality and loss of accuracy. Furthermore, we define two new measures to quantify uncertainty in alignment-based structure predictions. One of the measures improves on the “reliability score” reported by PPfold, and considers alignment uncertainty as well as base-pair probabilities. The other measure considers the information entropy for SCFGs over a space of input alignments. CONCLUSIONS: Our predictive accuracy improves on the PPfold reliability score. We can successfully characterize many of the underlying reasons for and variances in poor prediction. However, there is still variability unaccounted for, which we therefore suggest comes from the RNA secondary structure predictive model itself. BioMed Central 2013-05-01 /pmc/articles/PMC3667108/ /pubmed/23634662 http://dx.doi.org/10.1186/1471-2105-14-149 Text en Copyright © 2013 Anderson et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Anderson, James WJ
Novák, Ádám
Sükösd, Zsuzsanna
Golden, Michael
Arunapuram, Preeti
Edvardsson, Ingolfur
Hein, Jotun
Quantifying variances in comparative RNA secondary structure prediction
title Quantifying variances in comparative RNA secondary structure prediction
title_full Quantifying variances in comparative RNA secondary structure prediction
title_fullStr Quantifying variances in comparative RNA secondary structure prediction
title_full_unstemmed Quantifying variances in comparative RNA secondary structure prediction
title_short Quantifying variances in comparative RNA secondary structure prediction
title_sort quantifying variances in comparative rna secondary structure prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3667108/
https://www.ncbi.nlm.nih.gov/pubmed/23634662
http://dx.doi.org/10.1186/1471-2105-14-149
work_keys_str_mv AT andersonjameswj quantifyingvariancesincomparativernasecondarystructureprediction
AT novakadam quantifyingvariancesincomparativernasecondarystructureprediction
AT sukosdzsuzsanna quantifyingvariancesincomparativernasecondarystructureprediction
AT goldenmichael quantifyingvariancesincomparativernasecondarystructureprediction
AT arunapurampreeti quantifyingvariancesincomparativernasecondarystructureprediction
AT edvardssoningolfur quantifyingvariancesincomparativernasecondarystructureprediction
AT heinjotun quantifyingvariancesincomparativernasecondarystructureprediction