Cargando…

Computing expectation values for RNA motifs using discrete convolutions

BACKGROUND: Computational biologists use Expectation values (E-values) to estimate the number of solutions that can be expected by chance during a database scan. Here we focus on computing Expectation values for RNA motifs defined by single-strand and helix lod-score profiles with variable helix spa...

Descripción completa

Detalles Bibliográficos
Autores principales: Lambert, André, Legendre, Matthieu, Fontaine, Jean-Fred, Gautheret, Daniel
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1168889/
https://www.ncbi.nlm.nih.gov/pubmed/15892887
http://dx.doi.org/10.1186/1471-2105-6-118
_version_ 1782124437092433920
author Lambert, André
Legendre, Matthieu
Fontaine, Jean-Fred
Gautheret, Daniel
author_facet Lambert, André
Legendre, Matthieu
Fontaine, Jean-Fred
Gautheret, Daniel
author_sort Lambert, André
collection PubMed
description BACKGROUND: Computational biologists use Expectation values (E-values) to estimate the number of solutions that can be expected by chance during a database scan. Here we focus on computing Expectation values for RNA motifs defined by single-strand and helix lod-score profiles with variable helix spans. Such E-values cannot be computed assuming a normal score distribution and their estimation previously required lengthy simulations. RESULTS: We introduce discrete convolutions as an accurate and fast mean to estimate score distributions of lod-score profiles. This method provides excellent score estimations for all single-strand or helical elements tested and also applies to the combination of elements into larger, complex, motifs. Further, the estimated distributions remain accurate even when pseudocounts are introduced into the lod-score profiles. Estimated score distributions are then easily converted into E-values. CONCLUSION: A good agreement was observed between computed E-values and simulations for a number of complete RNA motifs. This method is now implemented into the ERPIN software, but it can be applied as well to any search procedure based on ungapped profiles with statistically independent columns.
format Text
id pubmed-1168889
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11688892005-07-02 Computing expectation values for RNA motifs using discrete convolutions Lambert, André Legendre, Matthieu Fontaine, Jean-Fred Gautheret, Daniel BMC Bioinformatics Methodology Article BACKGROUND: Computational biologists use Expectation values (E-values) to estimate the number of solutions that can be expected by chance during a database scan. Here we focus on computing Expectation values for RNA motifs defined by single-strand and helix lod-score profiles with variable helix spans. Such E-values cannot be computed assuming a normal score distribution and their estimation previously required lengthy simulations. RESULTS: We introduce discrete convolutions as an accurate and fast mean to estimate score distributions of lod-score profiles. This method provides excellent score estimations for all single-strand or helical elements tested and also applies to the combination of elements into larger, complex, motifs. Further, the estimated distributions remain accurate even when pseudocounts are introduced into the lod-score profiles. Estimated score distributions are then easily converted into E-values. CONCLUSION: A good agreement was observed between computed E-values and simulations for a number of complete RNA motifs. This method is now implemented into the ERPIN software, but it can be applied as well to any search procedure based on ungapped profiles with statistically independent columns. BioMed Central 2005-05-13 /pmc/articles/PMC1168889/ /pubmed/15892887 http://dx.doi.org/10.1186/1471-2105-6-118 Text en Copyright © 2005 Lambert et al; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Lambert, André
Legendre, Matthieu
Fontaine, Jean-Fred
Gautheret, Daniel
Computing expectation values for RNA motifs using discrete convolutions
title Computing expectation values for RNA motifs using discrete convolutions
title_full Computing expectation values for RNA motifs using discrete convolutions
title_fullStr Computing expectation values for RNA motifs using discrete convolutions
title_full_unstemmed Computing expectation values for RNA motifs using discrete convolutions
title_short Computing expectation values for RNA motifs using discrete convolutions
title_sort computing expectation values for rna motifs using discrete convolutions
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1168889/
https://www.ncbi.nlm.nih.gov/pubmed/15892887
http://dx.doi.org/10.1186/1471-2105-6-118
work_keys_str_mv AT lambertandre computingexpectationvaluesforrnamotifsusingdiscreteconvolutions
AT legendrematthieu computingexpectationvaluesforrnamotifsusingdiscreteconvolutions
AT fontainejeanfred computingexpectationvaluesforrnamotifsusingdiscreteconvolutions
AT gautheretdaniel computingexpectationvaluesforrnamotifsusingdiscreteconvolutions