Cargando…

On the Normalization of the Minimum Free Energy of RNAs by Sequence Length

The minimum free energy (MFE) of ribonucleic acids (RNAs) increases at an apparent linear rate with sequence length. Simple indices, obtained by dividing the MFE by the number of nucleotides, have been used for a direct comparison of the folding stability of RNAs of various sizes. Although this norm...

Descripción completa

Detalles Bibliográficos
Autor principal: Trotta, Edoardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236180/
https://www.ncbi.nlm.nih.gov/pubmed/25405875
http://dx.doi.org/10.1371/journal.pone.0113380
_version_ 1782345121853865984
author Trotta, Edoardo
author_facet Trotta, Edoardo
author_sort Trotta, Edoardo
collection PubMed
description The minimum free energy (MFE) of ribonucleic acids (RNAs) increases at an apparent linear rate with sequence length. Simple indices, obtained by dividing the MFE by the number of nucleotides, have been used for a direct comparison of the folding stability of RNAs of various sizes. Although this normalization procedure has been used in several studies, the relationship between normalized MFE and length has not yet been investigated in detail. Here, we demonstrate that the variation of MFE with sequence length is not linear and is significantly biased by the mathematical formula used for the normalization procedure. For this reason, the normalized MFEs strongly decrease as hyperbolic functions of length and produce unreliable results when applied for the comparison of sequences with different sizes. We also propose a simple modification of the normalization formula that corrects the bias enabling the use of the normalized MFE for RNAs longer than 40 nt. Using the new corrected normalized index, we analyzed the folding free energies of different human RNA families showing that most of them present an average MFE density more negative than expected for a typical genomic sequence. Furthermore, we found that a well-defined and restricted range of MFE density characterizes each RNA family, suggesting the use of our corrected normalized index to improve RNA prediction algorithms. Finally, in coding and functional human RNAs the MFE density appears scarcely correlated with sequence length, consistent with a negligible role of thermodynamic stability demands in determining RNA size.
format Online
Article
Text
id pubmed-4236180
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42361802014-11-21 On the Normalization of the Minimum Free Energy of RNAs by Sequence Length Trotta, Edoardo PLoS One Research Article The minimum free energy (MFE) of ribonucleic acids (RNAs) increases at an apparent linear rate with sequence length. Simple indices, obtained by dividing the MFE by the number of nucleotides, have been used for a direct comparison of the folding stability of RNAs of various sizes. Although this normalization procedure has been used in several studies, the relationship between normalized MFE and length has not yet been investigated in detail. Here, we demonstrate that the variation of MFE with sequence length is not linear and is significantly biased by the mathematical formula used for the normalization procedure. For this reason, the normalized MFEs strongly decrease as hyperbolic functions of length and produce unreliable results when applied for the comparison of sequences with different sizes. We also propose a simple modification of the normalization formula that corrects the bias enabling the use of the normalized MFE for RNAs longer than 40 nt. Using the new corrected normalized index, we analyzed the folding free energies of different human RNA families showing that most of them present an average MFE density more negative than expected for a typical genomic sequence. Furthermore, we found that a well-defined and restricted range of MFE density characterizes each RNA family, suggesting the use of our corrected normalized index to improve RNA prediction algorithms. Finally, in coding and functional human RNAs the MFE density appears scarcely correlated with sequence length, consistent with a negligible role of thermodynamic stability demands in determining RNA size. Public Library of Science 2014-11-18 /pmc/articles/PMC4236180/ /pubmed/25405875 http://dx.doi.org/10.1371/journal.pone.0113380 Text en © 2014 Edoardo Trotta http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Trotta, Edoardo
On the Normalization of the Minimum Free Energy of RNAs by Sequence Length
title On the Normalization of the Minimum Free Energy of RNAs by Sequence Length
title_full On the Normalization of the Minimum Free Energy of RNAs by Sequence Length
title_fullStr On the Normalization of the Minimum Free Energy of RNAs by Sequence Length
title_full_unstemmed On the Normalization of the Minimum Free Energy of RNAs by Sequence Length
title_short On the Normalization of the Minimum Free Energy of RNAs by Sequence Length
title_sort on the normalization of the minimum free energy of rnas by sequence length
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236180/
https://www.ncbi.nlm.nih.gov/pubmed/25405875
http://dx.doi.org/10.1371/journal.pone.0113380
work_keys_str_mv AT trottaedoardo onthenormalizationoftheminimumfreeenergyofrnasbysequencelength