Cargando…

Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments

Computational methods for determining the secondary structure of RNA sequences from given alignments are currently either based on thermodynamic folding, compensatory base pair substitutions or both. However, there is currently no approach that combines both sources of information in a single optimi...

Descripción completa

Detalles Bibliográficos
Autores principales: Seemann, Stefan E., Gorodkin, Jan, Backofen, Rolf
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2582601/
https://www.ncbi.nlm.nih.gov/pubmed/18836192
http://dx.doi.org/10.1093/nar/gkn544
_version_ 1782160680693006336
author Seemann, Stefan E.
Gorodkin, Jan
Backofen, Rolf
author_facet Seemann, Stefan E.
Gorodkin, Jan
Backofen, Rolf
author_sort Seemann, Stefan E.
collection PubMed
description Computational methods for determining the secondary structure of RNA sequences from given alignments are currently either based on thermodynamic folding, compensatory base pair substitutions or both. However, there is currently no approach that combines both sources of information in a single optimization problem. Here, we present a model that formally integrates both the energy-based and evolution-based approaches to predict the folding of multiple aligned RNA sequences. We have implemented an extended version of Pfold that identifies base pairs that have high probabilities of being conserved and of being energetically favorable. The consensus structure is predicted using a maximum expected accuracy scoring scheme to smoothen the effect of incorrectly predicted base pairs. Parameter tuning revealed that the probability of base pairing has a higher impact on the RNA structure prediction than the corresponding probability of being single stranded. Furthermore, we found that structurally conserved RNA motifs are mostly supported by folding energies. Other problems (e.g. RNA-folding kinetics) may also benefit from employing the principles of the model we introduce. Our implementation, PETfold, was tested on a set of 46 well-curated Rfam families and its performance compared favorably to that of Pfold and RNAalifold.
format Text
id pubmed-2582601
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-25826012008-11-13 Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments Seemann, Stefan E. Gorodkin, Jan Backofen, Rolf Nucleic Acids Res Computational Biology Computational methods for determining the secondary structure of RNA sequences from given alignments are currently either based on thermodynamic folding, compensatory base pair substitutions or both. However, there is currently no approach that combines both sources of information in a single optimization problem. Here, we present a model that formally integrates both the energy-based and evolution-based approaches to predict the folding of multiple aligned RNA sequences. We have implemented an extended version of Pfold that identifies base pairs that have high probabilities of being conserved and of being energetically favorable. The consensus structure is predicted using a maximum expected accuracy scoring scheme to smoothen the effect of incorrectly predicted base pairs. Parameter tuning revealed that the probability of base pairing has a higher impact on the RNA structure prediction than the corresponding probability of being single stranded. Furthermore, we found that structurally conserved RNA motifs are mostly supported by folding energies. Other problems (e.g. RNA-folding kinetics) may also benefit from employing the principles of the model we introduce. Our implementation, PETfold, was tested on a set of 46 well-curated Rfam families and its performance compared favorably to that of Pfold and RNAalifold. Oxford University Press 2008-11 2008-10-04 /pmc/articles/PMC2582601/ /pubmed/18836192 http://dx.doi.org/10.1093/nar/gkn544 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Seemann, Stefan E.
Gorodkin, Jan
Backofen, Rolf
Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments
title Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments
title_full Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments
title_fullStr Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments
title_full_unstemmed Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments
title_short Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments
title_sort unifying evolutionary and thermodynamic information for rna folding of multiple alignments
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2582601/
https://www.ncbi.nlm.nih.gov/pubmed/18836192
http://dx.doi.org/10.1093/nar/gkn544
work_keys_str_mv AT seemannstefane unifyingevolutionaryandthermodynamicinformationforrnafoldingofmultiplealignments
AT gorodkinjan unifyingevolutionaryandthermodynamicinformationforrnafoldingofmultiplealignments
AT backofenrolf unifyingevolutionaryandthermodynamicinformationforrnafoldingofmultiplealignments