Cargando…

Fast filtering for RNA homology search

Motivation: Homology search for RNAs can use secondary structure information to increase power by modeling base pairs, as in covariance models, but the resulting computational costs are high. Typical acceleration strategies rely on at least one filtering stage using sequence-only search. Results: He...

Descripción completa

Detalles Bibliográficos
Autores principales: Kolbe, Diana L., Eddy, Sean R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3208395/
https://www.ncbi.nlm.nih.gov/pubmed/21965818
http://dx.doi.org/10.1093/bioinformatics/btr545
Descripción
Sumario:Motivation: Homology search for RNAs can use secondary structure information to increase power by modeling base pairs, as in covariance models, but the resulting computational costs are high. Typical acceleration strategies rely on at least one filtering stage using sequence-only search. Results: Here we present the multi-segment CYK (MSCYK) filter, which implements a heuristic of ungapped structural alignment for RNA homology search. Compared to gapped alignment, this approximation has lower computation time requirements (O(N(4)) reduced to O(N(3))), and space requirements (O(N(3)) reduced to O(N(2))). A vector-parallel implementation of this method gives up to 100-fold speed-up; vector-parallel implementations of standard gapped alignment at two levels of precision give 3- and 6-fold speed-ups. These approaches are combined to create a filtering pipeline that scores RNA secondary structure at all stages, with results that are synergistic with existing methods. Availability: http://selab.janelia.org/publications.html#KolbeEddy11 Contact: eddys@janelia.hhmi.org