Cargando…

Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics

The technique of Finite Markov Chain Imbedding (FMCI) is a classical approach to complex combinatorial problems related to sequences. In order to get efficient algorithms, it is known that such approaches need to be first rewritten using recursive relations. We propose here to give here a general re...

Descripción completa

Detalles Bibliográficos
Autor principal: Nuel, Grégory
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1479348/
https://www.ncbi.nlm.nih.gov/pubmed/16722531
http://dx.doi.org/10.1186/1748-7188-1-5
_version_ 1782128182503145472
author Nuel, Grégory
author_facet Nuel, Grégory
author_sort Nuel, Grégory
collection PubMed
description The technique of Finite Markov Chain Imbedding (FMCI) is a classical approach to complex combinatorial problems related to sequences. In order to get efficient algorithms, it is known that such approaches need to be first rewritten using recursive relations. We propose here to give here a general recursive algorithms allowing to compute in a numerically stable manner exact Cumulative Distribution Function (CDF) or complementary CDF (CCDF). These algorithms are then applied in two particular cases: the local score of one sequence and pattern statistics. In both cases, asymptotic developments are derived. For the local score, our new approach allows for the very first time to compute exact p-values for a practical study (finding hydrophobic segments in a protein database) where only approximations were available before. In this study, the asymptotic approximations appear to be completely unreliable for 99.5% of the considered sequences. Concerning the pattern statistics, the new FMCI algorithms dramatically outperform the previous ones as they are more reliable, easier to implement, faster and with lower memory requirements.
format Text
id pubmed-1479348
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14793482006-06-15 Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics Nuel, Grégory Algorithms Mol Biol Research The technique of Finite Markov Chain Imbedding (FMCI) is a classical approach to complex combinatorial problems related to sequences. In order to get efficient algorithms, it is known that such approaches need to be first rewritten using recursive relations. We propose here to give here a general recursive algorithms allowing to compute in a numerically stable manner exact Cumulative Distribution Function (CDF) or complementary CDF (CCDF). These algorithms are then applied in two particular cases: the local score of one sequence and pattern statistics. In both cases, asymptotic developments are derived. For the local score, our new approach allows for the very first time to compute exact p-values for a practical study (finding hydrophobic segments in a protein database) where only approximations were available before. In this study, the asymptotic approximations appear to be completely unreliable for 99.5% of the considered sequences. Concerning the pattern statistics, the new FMCI algorithms dramatically outperform the previous ones as they are more reliable, easier to implement, faster and with lower memory requirements. BioMed Central 2006-04-07 /pmc/articles/PMC1479348/ /pubmed/16722531 http://dx.doi.org/10.1186/1748-7188-1-5 Text en Copyright © 2006 Nuel; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Nuel, Grégory
Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics
title Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics
title_full Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics
title_fullStr Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics
title_full_unstemmed Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics
title_short Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics
title_sort effective p-value computations using finite markov chain imbedding (fmci): application to local score and to pattern statistics
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1479348/
https://www.ncbi.nlm.nih.gov/pubmed/16722531
http://dx.doi.org/10.1186/1748-7188-1-5
work_keys_str_mv AT nuelgregory effectivepvaluecomputationsusingfinitemarkovchainimbeddingfmciapplicationtolocalscoreandtopatternstatistics