Cargando…

DNA motif alignment by evolving a population of Markov chains

BACKGROUND: Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem th...

Descripción completa

Detalles Bibliográficos
Autor principal:	Bi, Chengpeng
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2009
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648755/ https://www.ncbi.nlm.nih.gov/pubmed/19208112 http://dx.doi.org/10.1186/1471-2105-10-S1-S13

_version_	1782164980600143872
author	Bi, Chengpeng
author_facet	Bi, Chengpeng
author_sort	Bi, Chengpeng
collection	PubMed
description	BACKGROUND: Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. RESULTS: This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. CONCLUSION: Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.
format	Text
id	pubmed-2648755
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-26487552009-03-03 DNA motif alignment by evolving a population of Markov chains Bi, Chengpeng BMC Bioinformatics Research BACKGROUND: Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. RESULTS: This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. CONCLUSION: Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences. BioMed Central 2009-01-30 /pmc/articles/PMC2648755/ /pubmed/19208112 http://dx.doi.org/10.1186/1471-2105-10-S1-S13 Text en Copyright © 2009 Bi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Bi, Chengpeng DNA motif alignment by evolving a population of Markov chains
title	DNA motif alignment by evolving a population of Markov chains
title_full	DNA motif alignment by evolving a population of Markov chains
title_fullStr	DNA motif alignment by evolving a population of Markov chains
title_full_unstemmed	DNA motif alignment by evolving a population of Markov chains
title_short	DNA motif alignment by evolving a population of Markov chains
title_sort	dna motif alignment by evolving a population of markov chains
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648755/ https://www.ncbi.nlm.nih.gov/pubmed/19208112 http://dx.doi.org/10.1186/1471-2105-10-S1-S13
work_keys_str_mv	AT bichengpeng dnamotifalignmentbyevolvingapopulationofmarkovchains

DNA motif alignment by evolving a population of Markov chains

Ejemplares similares