Cargando…

Query-Dependent Banding (QDB) for Faster RNA Similarity Searches

When searching sequence databases for RNAs, it is desirable to score both primary sequence and RNA secondary structure similarity. Covariance models (CMs) are probabilistic models well-suited for RNA similarity search applications. However, the computational complexity of CM dynamic programming alig...

Descripción completa

Detalles Bibliográficos
Autores principales: Nawrocki, Eric P, Eddy, Sean R
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847999/
https://www.ncbi.nlm.nih.gov/pubmed/17397253
http://dx.doi.org/10.1371/journal.pcbi.0030056
_version_ 1782132932545085440
author Nawrocki, Eric P
Eddy, Sean R
author_facet Nawrocki, Eric P
Eddy, Sean R
author_sort Nawrocki, Eric P
collection PubMed
description When searching sequence databases for RNAs, it is desirable to score both primary sequence and RNA secondary structure similarity. Covariance models (CMs) are probabilistic models well-suited for RNA similarity search applications. However, the computational complexity of CM dynamic programming alignment algorithms has limited their practical application. Here we describe an acceleration method called query-dependent banding (QDB), which uses the probabilistic query CM to precalculate regions of the dynamic programming lattice that have negligible probability, independently of the target database. We have implemented QDB in the freely available Infernal software package. QDB reduces the average case time complexity of CM alignment from LN (2.4) to LN (1.3) for a query RNA of N residues and a target database of L residues, resulting in a 4-fold speedup for typical RNA queries. Combined with other improvements to Infernal, including informative mixture Dirichlet priors on model parameters, benchmarks also show increased sensitivity and specificity resulting from improved parameterization.
format Text
id pubmed-1847999
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-18479992007-04-06 Query-Dependent Banding (QDB) for Faster RNA Similarity Searches Nawrocki, Eric P Eddy, Sean R PLoS Comput Biol Research Article When searching sequence databases for RNAs, it is desirable to score both primary sequence and RNA secondary structure similarity. Covariance models (CMs) are probabilistic models well-suited for RNA similarity search applications. However, the computational complexity of CM dynamic programming alignment algorithms has limited their practical application. Here we describe an acceleration method called query-dependent banding (QDB), which uses the probabilistic query CM to precalculate regions of the dynamic programming lattice that have negligible probability, independently of the target database. We have implemented QDB in the freely available Infernal software package. QDB reduces the average case time complexity of CM alignment from LN (2.4) to LN (1.3) for a query RNA of N residues and a target database of L residues, resulting in a 4-fold speedup for typical RNA queries. Combined with other improvements to Infernal, including informative mixture Dirichlet priors on model parameters, benchmarks also show increased sensitivity and specificity resulting from improved parameterization. Public Library of Science 2007-03 2007-03-30 /pmc/articles/PMC1847999/ /pubmed/17397253 http://dx.doi.org/10.1371/journal.pcbi.0030056 Text en © 2007 Nawrocki and Eddy. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Nawrocki, Eric P
Eddy, Sean R
Query-Dependent Banding (QDB) for Faster RNA Similarity Searches
title Query-Dependent Banding (QDB) for Faster RNA Similarity Searches
title_full Query-Dependent Banding (QDB) for Faster RNA Similarity Searches
title_fullStr Query-Dependent Banding (QDB) for Faster RNA Similarity Searches
title_full_unstemmed Query-Dependent Banding (QDB) for Faster RNA Similarity Searches
title_short Query-Dependent Banding (QDB) for Faster RNA Similarity Searches
title_sort query-dependent banding (qdb) for faster rna similarity searches
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1847999/
https://www.ncbi.nlm.nih.gov/pubmed/17397253
http://dx.doi.org/10.1371/journal.pcbi.0030056
work_keys_str_mv AT nawrockiericp querydependentbandingqdbforfasterrnasimilaritysearches
AT eddyseanr querydependentbandingqdbforfasterrnasimilaritysearches