Cargando…

Effective ambiguity checking in biosequence analysis

BACKGROUND: Ambiguity is a problem in biosequence analysis that arises in various analysis tasks solved via dynamic programming, and in particular, in the modeling of families of RNA secondary structures with stochastic context free grammars. Several types of analysis are invalidated by the presence...

Descripción completa

Detalles Bibliográficos
Autores principales: Reeder, Janina, Steffen, Peter, Giegerich, Robert
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1215473/
https://www.ncbi.nlm.nih.gov/pubmed/15967024
http://dx.doi.org/10.1186/1471-2105-6-153
_version_ 1782124953640894464
author Reeder, Janina
Steffen, Peter
Giegerich, Robert
author_facet Reeder, Janina
Steffen, Peter
Giegerich, Robert
author_sort Reeder, Janina
collection PubMed
description BACKGROUND: Ambiguity is a problem in biosequence analysis that arises in various analysis tasks solved via dynamic programming, and in particular, in the modeling of families of RNA secondary structures with stochastic context free grammars. Several types of analysis are invalidated by the presence of ambiguity. As this problem inherits undecidability (as we show here) from the namely problem for context free languages, there is no complete algorithmic solution to the problem of ambiguity checking. RESULTS: We explain frequently observed sources of ambiguity, and show how to avoid them. We suggest four testing procedures that may help to detect ambiguity when present, including a just-in-time test that permits to work safely with a potentially ambiguous grammar. We introduce, for the special case of stochastic context free grammars and RNA structure modeling, an automated partial procedure for proving non-ambiguity. It is used to demonstrate non-ambiguity for several relevant grammars. CONCLUSION: Our mechanical proof procedure and our testing methods provide a powerful arsenal of methods to ensure non-ambiguity.
format Text
id pubmed-1215473
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12154732005-09-17 Effective ambiguity checking in biosequence analysis Reeder, Janina Steffen, Peter Giegerich, Robert BMC Bioinformatics Research Article BACKGROUND: Ambiguity is a problem in biosequence analysis that arises in various analysis tasks solved via dynamic programming, and in particular, in the modeling of families of RNA secondary structures with stochastic context free grammars. Several types of analysis are invalidated by the presence of ambiguity. As this problem inherits undecidability (as we show here) from the namely problem for context free languages, there is no complete algorithmic solution to the problem of ambiguity checking. RESULTS: We explain frequently observed sources of ambiguity, and show how to avoid them. We suggest four testing procedures that may help to detect ambiguity when present, including a just-in-time test that permits to work safely with a potentially ambiguous grammar. We introduce, for the special case of stochastic context free grammars and RNA structure modeling, an automated partial procedure for proving non-ambiguity. It is used to demonstrate non-ambiguity for several relevant grammars. CONCLUSION: Our mechanical proof procedure and our testing methods provide a powerful arsenal of methods to ensure non-ambiguity. BioMed Central 2005-06-20 /pmc/articles/PMC1215473/ /pubmed/15967024 http://dx.doi.org/10.1186/1471-2105-6-153 Text en Copyright © 2005 Reeder et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Reeder, Janina
Steffen, Peter
Giegerich, Robert
Effective ambiguity checking in biosequence analysis
title Effective ambiguity checking in biosequence analysis
title_full Effective ambiguity checking in biosequence analysis
title_fullStr Effective ambiguity checking in biosequence analysis
title_full_unstemmed Effective ambiguity checking in biosequence analysis
title_short Effective ambiguity checking in biosequence analysis
title_sort effective ambiguity checking in biosequence analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1215473/
https://www.ncbi.nlm.nih.gov/pubmed/15967024
http://dx.doi.org/10.1186/1471-2105-6-153
work_keys_str_mv AT reederjanina effectiveambiguitycheckinginbiosequenceanalysis
AT steffenpeter effectiveambiguitycheckinginbiosequenceanalysis
AT giegerichrobert effectiveambiguitycheckinginbiosequenceanalysis