Cargando…

Accelerated probabilistic inference of RNA structure evolution

BACKGROUND: Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is face...

Descripción completa

Detalles Bibliográficos
Autor principal: Holmes, Ian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1090553/
https://www.ncbi.nlm.nih.gov/pubmed/15790387
http://dx.doi.org/10.1186/1471-2105-6-73
_version_ 1782123883397120000
author Holmes, Ian
author_facet Holmes, Ian
author_sort Holmes, Ian
collection PubMed
description BACKGROUND: Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefore desirable to constrain such algorithms, by pre-processing the sequences and using this first pass to limit the range of structures and/or alignments that can be considered. RESULTS: We demonstrate how flexible classes of constraint can be imposed, greatly reducing the computational costs while maintaining a high quality of structural homology prediction. Any score-attributed context-free grammar (e.g. energy-based scoring schemes, or conditionally normalized Pair SCFGs) is amenable to this treatment. It is now possible to combine independent structural and alignment constraints of unprecedented general flexibility in Pair SCFG alignment algorithms. We outline several applications to the bioinformatics of RNA sequence and structure, including Waterman-Eggert N-best alignments and progressive multiple alignment. We evaluate the performance of the algorithm on test examples from the RFAM database. CONCLUSION: A program, Stemloc, that implements these algorithms for efficient RNA sequence alignment and structure prediction is available under the GNU General Public License.
format Text
id pubmed-1090553
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-10905532005-05-07 Accelerated probabilistic inference of RNA structure evolution Holmes, Ian BMC Bioinformatics Methodology Article BACKGROUND: Pairwise stochastic context-free grammars (Pair SCFGs) are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefore desirable to constrain such algorithms, by pre-processing the sequences and using this first pass to limit the range of structures and/or alignments that can be considered. RESULTS: We demonstrate how flexible classes of constraint can be imposed, greatly reducing the computational costs while maintaining a high quality of structural homology prediction. Any score-attributed context-free grammar (e.g. energy-based scoring schemes, or conditionally normalized Pair SCFGs) is amenable to this treatment. It is now possible to combine independent structural and alignment constraints of unprecedented general flexibility in Pair SCFG alignment algorithms. We outline several applications to the bioinformatics of RNA sequence and structure, including Waterman-Eggert N-best alignments and progressive multiple alignment. We evaluate the performance of the algorithm on test examples from the RFAM database. CONCLUSION: A program, Stemloc, that implements these algorithms for efficient RNA sequence alignment and structure prediction is available under the GNU General Public License. BioMed Central 2005-03-24 /pmc/articles/PMC1090553/ /pubmed/15790387 http://dx.doi.org/10.1186/1471-2105-6-73 Text en Copyright © 2005 Holmes; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Holmes, Ian
Accelerated probabilistic inference of RNA structure evolution
title Accelerated probabilistic inference of RNA structure evolution
title_full Accelerated probabilistic inference of RNA structure evolution
title_fullStr Accelerated probabilistic inference of RNA structure evolution
title_full_unstemmed Accelerated probabilistic inference of RNA structure evolution
title_short Accelerated probabilistic inference of RNA structure evolution
title_sort accelerated probabilistic inference of rna structure evolution
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1090553/
https://www.ncbi.nlm.nih.gov/pubmed/15790387
http://dx.doi.org/10.1186/1471-2105-6-73
work_keys_str_mv AT holmesian acceleratedprobabilisticinferenceofrnastructureevolution