Cargando…

On calculating the probability of a set of orthologous sequences

Probabilistic DNA sequence models have been intensively applied to genome research. Within the evolutionary biology framework, this article investigates the feasibility for rigorously estimating the probability of a set of orthologous DNA sequences which evolve from a common progenitor. We propose M...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Junfeng, Chen, Liang, Zhao, Hongyu, Moore, Dirk F, Lin, Yong, Shih, Weichung Joe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove Medical Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3169941/
https://www.ncbi.nlm.nih.gov/pubmed/21918614
Descripción
Sumario:Probabilistic DNA sequence models have been intensively applied to genome research. Within the evolutionary biology framework, this article investigates the feasibility for rigorously estimating the probability of a set of orthologous DNA sequences which evolve from a common progenitor. We propose Monte Carlo integration algorithms to sample the unknown ancestral and/or root sequences a posteriori conditional on a reference sequence and apply pairwise Needleman–Wunsch alignment between the sampled and nonreference species sequences to estimate the probability. We test our algorithms on both simulated and real sequences and compare calculated probabilities from Monte Carlo integration to those induced by single multiple alignment.