Cargando…
Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics
The technique of Finite Markov Chain Imbedding (FMCI) is a classical approach to complex combinatorial problems related to sequences. In order to get efficient algorithms, it is known that such approaches need to be first rewritten using recursive relations. We propose here to give here a general re...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1479348/ https://www.ncbi.nlm.nih.gov/pubmed/16722531 http://dx.doi.org/10.1186/1748-7188-1-5 |
_version_ | 1782128182503145472 |
---|---|
author | Nuel, Grégory |
author_facet | Nuel, Grégory |
author_sort | Nuel, Grégory |
collection | PubMed |
description | The technique of Finite Markov Chain Imbedding (FMCI) is a classical approach to complex combinatorial problems related to sequences. In order to get efficient algorithms, it is known that such approaches need to be first rewritten using recursive relations. We propose here to give here a general recursive algorithms allowing to compute in a numerically stable manner exact Cumulative Distribution Function (CDF) or complementary CDF (CCDF). These algorithms are then applied in two particular cases: the local score of one sequence and pattern statistics. In both cases, asymptotic developments are derived. For the local score, our new approach allows for the very first time to compute exact p-values for a practical study (finding hydrophobic segments in a protein database) where only approximations were available before. In this study, the asymptotic approximations appear to be completely unreliable for 99.5% of the considered sequences. Concerning the pattern statistics, the new FMCI algorithms dramatically outperform the previous ones as they are more reliable, easier to implement, faster and with lower memory requirements. |
format | Text |
id | pubmed-1479348 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14793482006-06-15 Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics Nuel, Grégory Algorithms Mol Biol Research The technique of Finite Markov Chain Imbedding (FMCI) is a classical approach to complex combinatorial problems related to sequences. In order to get efficient algorithms, it is known that such approaches need to be first rewritten using recursive relations. We propose here to give here a general recursive algorithms allowing to compute in a numerically stable manner exact Cumulative Distribution Function (CDF) or complementary CDF (CCDF). These algorithms are then applied in two particular cases: the local score of one sequence and pattern statistics. In both cases, asymptotic developments are derived. For the local score, our new approach allows for the very first time to compute exact p-values for a practical study (finding hydrophobic segments in a protein database) where only approximations were available before. In this study, the asymptotic approximations appear to be completely unreliable for 99.5% of the considered sequences. Concerning the pattern statistics, the new FMCI algorithms dramatically outperform the previous ones as they are more reliable, easier to implement, faster and with lower memory requirements. BioMed Central 2006-04-07 /pmc/articles/PMC1479348/ /pubmed/16722531 http://dx.doi.org/10.1186/1748-7188-1-5 Text en Copyright © 2006 Nuel; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Nuel, Grégory Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics |
title | Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics |
title_full | Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics |
title_fullStr | Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics |
title_full_unstemmed | Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics |
title_short | Effective p-value computations using Finite Markov Chain Imbedding (FMCI): application to local score and to pattern statistics |
title_sort | effective p-value computations using finite markov chain imbedding (fmci): application to local score and to pattern statistics |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1479348/ https://www.ncbi.nlm.nih.gov/pubmed/16722531 http://dx.doi.org/10.1186/1748-7188-1-5 |
work_keys_str_mv | AT nuelgregory effectivepvaluecomputationsusingfinitemarkovchainimbeddingfmciapplicationtolocalscoreandtopatternstatistics |