Cargando…
New decoding algorithms for Hidden Markov Models using distance measures on labellings
BACKGROUND: Existing hidden Markov model decoding algorithms do not focus on approximately identifying the sequence feature boundaries. RESULTS: We give a set of algorithms to compute the conditional probability of all labellings "near" a reference labelling λ for a sequence y for a variet...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3009513/ https://www.ncbi.nlm.nih.gov/pubmed/20122214 http://dx.doi.org/10.1186/1471-2105-11-S1-S40 |
Sumario: | BACKGROUND: Existing hidden Markov model decoding algorithms do not focus on approximately identifying the sequence feature boundaries. RESULTS: We give a set of algorithms to compute the conditional probability of all labellings "near" a reference labelling λ for a sequence y for a variety of definitions of "near". In addition, we give optimization algorithms to find the best labelling for a sequence in the robust sense of having all of its feature boundaries nearly correct. Natural problems in this domain are NP-hard to optimize. For membrane proteins, our algorithms find the approximate topology of such proteins with comparable success to existing programs, while being substantially more accurate in estimating the positions of transmembrane helix boundaries. CONCLUSION: More robust HMM decoding may allow for better analysis of sequence features, in reasonable runtimes. |
---|