Cargando…
Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments
Computational methods for determining the secondary structure of RNA sequences from given alignments are currently either based on thermodynamic folding, compensatory base pair substitutions or both. However, there is currently no approach that combines both sources of information in a single optimi...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2582601/ https://www.ncbi.nlm.nih.gov/pubmed/18836192 http://dx.doi.org/10.1093/nar/gkn544 |
_version_ | 1782160680693006336 |
---|---|
author | Seemann, Stefan E. Gorodkin, Jan Backofen, Rolf |
author_facet | Seemann, Stefan E. Gorodkin, Jan Backofen, Rolf |
author_sort | Seemann, Stefan E. |
collection | PubMed |
description | Computational methods for determining the secondary structure of RNA sequences from given alignments are currently either based on thermodynamic folding, compensatory base pair substitutions or both. However, there is currently no approach that combines both sources of information in a single optimization problem. Here, we present a model that formally integrates both the energy-based and evolution-based approaches to predict the folding of multiple aligned RNA sequences. We have implemented an extended version of Pfold that identifies base pairs that have high probabilities of being conserved and of being energetically favorable. The consensus structure is predicted using a maximum expected accuracy scoring scheme to smoothen the effect of incorrectly predicted base pairs. Parameter tuning revealed that the probability of base pairing has a higher impact on the RNA structure prediction than the corresponding probability of being single stranded. Furthermore, we found that structurally conserved RNA motifs are mostly supported by folding energies. Other problems (e.g. RNA-folding kinetics) may also benefit from employing the principles of the model we introduce. Our implementation, PETfold, was tested on a set of 46 well-curated Rfam families and its performance compared favorably to that of Pfold and RNAalifold. |
format | Text |
id | pubmed-2582601 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-25826012008-11-13 Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments Seemann, Stefan E. Gorodkin, Jan Backofen, Rolf Nucleic Acids Res Computational Biology Computational methods for determining the secondary structure of RNA sequences from given alignments are currently either based on thermodynamic folding, compensatory base pair substitutions or both. However, there is currently no approach that combines both sources of information in a single optimization problem. Here, we present a model that formally integrates both the energy-based and evolution-based approaches to predict the folding of multiple aligned RNA sequences. We have implemented an extended version of Pfold that identifies base pairs that have high probabilities of being conserved and of being energetically favorable. The consensus structure is predicted using a maximum expected accuracy scoring scheme to smoothen the effect of incorrectly predicted base pairs. Parameter tuning revealed that the probability of base pairing has a higher impact on the RNA structure prediction than the corresponding probability of being single stranded. Furthermore, we found that structurally conserved RNA motifs are mostly supported by folding energies. Other problems (e.g. RNA-folding kinetics) may also benefit from employing the principles of the model we introduce. Our implementation, PETfold, was tested on a set of 46 well-curated Rfam families and its performance compared favorably to that of Pfold and RNAalifold. Oxford University Press 2008-11 2008-10-04 /pmc/articles/PMC2582601/ /pubmed/18836192 http://dx.doi.org/10.1093/nar/gkn544 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Seemann, Stefan E. Gorodkin, Jan Backofen, Rolf Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments |
title | Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments |
title_full | Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments |
title_fullStr | Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments |
title_full_unstemmed | Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments |
title_short | Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments |
title_sort | unifying evolutionary and thermodynamic information for rna folding of multiple alignments |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2582601/ https://www.ncbi.nlm.nih.gov/pubmed/18836192 http://dx.doi.org/10.1093/nar/gkn544 |
work_keys_str_mv | AT seemannstefane unifyingevolutionaryandthermodynamicinformationforrnafoldingofmultiplealignments AT gorodkinjan unifyingevolutionaryandthermodynamicinformationforrnafoldingofmultiplealignments AT backofenrolf unifyingevolutionaryandthermodynamicinformationforrnafoldingofmultiplealignments |