Cargando…
Accelerated probabilistic inference of RNA structure evolution
BACKGROUND: Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is face...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1090553/ https://www.ncbi.nlm.nih.gov/pubmed/15790387 http://dx.doi.org/10.1186/1471-2105-6-73 |
_version_ | 1782123883397120000 |
---|---|
author | Holmes, Ian |
author_facet | Holmes, Ian |
author_sort | Holmes, Ian |
collection | PubMed |
description | BACKGROUND: Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefore desirable to constrain such algorithms, by pre-processing the sequences and using this first pass to limit the range of structures and/or alignments that can be considered. RESULTS: We demonstrate how flexible classes of constraint can be imposed, greatly reducing the computational costs while maintaining a high quality of structural homology prediction. Any score-attributed context-free grammar (e.g. energy-based scoring schemes, or conditionally normalized Pair SCFGs) is amenable to this treatment. It is now possible to combine independent structural and alignment constraints of unprecedented general flexibility in Pair SCFG alignment algorithms. We outline several applications to the bioinformatics of RNA sequence and structure, including Waterman-Eggert N-best alignments and progressive multiple alignment. We evaluate the performance of the algorithm on test examples from the RFAM database. CONCLUSION: A program, Stemloc, that implements these algorithms for efficient RNA sequence alignment and structure prediction is available under the GNU General Public License. |
format | Text |
id | pubmed-1090553 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-10905532005-05-07 Accelerated probabilistic inference of RNA structure evolution Holmes, Ian BMC Bioinformatics Methodology Article BACKGROUND: Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefore desirable to constrain such algorithms, by pre-processing the sequences and using this first pass to limit the range of structures and/or alignments that can be considered. RESULTS: We demonstrate how flexible classes of constraint can be imposed, greatly reducing the computational costs while maintaining a high quality of structural homology prediction. Any score-attributed context-free grammar (e.g. energy-based scoring schemes, or conditionally normalized Pair SCFGs) is amenable to this treatment. It is now possible to combine independent structural and alignment constraints of unprecedented general flexibility in Pair SCFG alignment algorithms. We outline several applications to the bioinformatics of RNA sequence and structure, including Waterman-Eggert N-best alignments and progressive multiple alignment. We evaluate the performance of the algorithm on test examples from the RFAM database. CONCLUSION: A program, Stemloc, that implements these algorithms for efficient RNA sequence alignment and structure prediction is available under the GNU General Public License. BioMed Central 2005-03-24 /pmc/articles/PMC1090553/ /pubmed/15790387 http://dx.doi.org/10.1186/1471-2105-6-73 Text en Copyright © 2005 Holmes; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Holmes, Ian Accelerated probabilistic inference of RNA structure evolution |
title | Accelerated probabilistic inference of RNA structure evolution |
title_full | Accelerated probabilistic inference of RNA structure evolution |
title_fullStr | Accelerated probabilistic inference of RNA structure evolution |
title_full_unstemmed | Accelerated probabilistic inference of RNA structure evolution |
title_short | Accelerated probabilistic inference of RNA structure evolution |
title_sort | accelerated probabilistic inference of rna structure evolution |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1090553/ https://www.ncbi.nlm.nih.gov/pubmed/15790387 http://dx.doi.org/10.1186/1471-2105-6-73 |
work_keys_str_mv | AT holmesian acceleratedprobabilisticinferenceofrnastructureevolution |