Cargando…

GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge

BACKGROUND: Position-specific priors (PSP) have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior info...

Descripción completa

Detalles Bibliográficos
Autores principales: Carvalho, Alexandra M, Oliveira, Arlindo L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112114/
https://www.ncbi.nlm.nih.gov/pubmed/21513505
http://dx.doi.org/10.1186/1748-7188-6-13
_version_ 1782205700946001920
author Carvalho, Alexandra M
Oliveira, Arlindo L
author_facet Carvalho, Alexandra M
Oliveira, Arlindo L
author_sort Carvalho, Alexandra M
collection PubMed
description BACKGROUND: Position-specific priors (PSP) have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior information has not yet been used in the context of combinatorial algorithms. Moreover, priors have been used only independently, and the gain of combining priors from different sources has not yet been studied. RESULTS: We extend RISOTTO, a combinatorial algorithm for motif discovery, by post-processing its output with a greedy procedure that uses prior information. PSP's from different sources are combined into a scoring criterion that guides the greedy search procedure. The resulting method, called GRISOTTO, was evaluated over 156 yeast TF ChIP-chip sequence-sets commonly used to benchmark prior-based motif discovery algorithms. Results show that GRISOTTO is at least as accurate as other twelve state-of-the-art approaches for the same task, even without combining priors. Furthermore, by considering combined priors, GRISOTTO is considerably more accurate than the state-of-the-art approaches for the same task. We also show that PSP's improve GRISOTTO ability to retrieve motifs from mouse ChiP-seq data, indicating that the proposed algorithm can be applied to data from a different technology and for a higher eukaryote. CONCLUSIONS: The conclusions of this work are twofold. First, post-processing the output of combinatorial algorithms by incorporating prior information leads to a very efficient and effective motif discovery method. Second, combining priors from different sources is even more beneficial than considering them separately.
format Online
Article
Text
id pubmed-3112114
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31121142011-06-11 GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge Carvalho, Alexandra M Oliveira, Arlindo L Algorithms Mol Biol Research BACKGROUND: Position-specific priors (PSP) have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior information has not yet been used in the context of combinatorial algorithms. Moreover, priors have been used only independently, and the gain of combining priors from different sources has not yet been studied. RESULTS: We extend RISOTTO, a combinatorial algorithm for motif discovery, by post-processing its output with a greedy procedure that uses prior information. PSP's from different sources are combined into a scoring criterion that guides the greedy search procedure. The resulting method, called GRISOTTO, was evaluated over 156 yeast TF ChIP-chip sequence-sets commonly used to benchmark prior-based motif discovery algorithms. Results show that GRISOTTO is at least as accurate as other twelve state-of-the-art approaches for the same task, even without combining priors. Furthermore, by considering combined priors, GRISOTTO is considerably more accurate than the state-of-the-art approaches for the same task. We also show that PSP's improve GRISOTTO ability to retrieve motifs from mouse ChiP-seq data, indicating that the proposed algorithm can be applied to data from a different technology and for a higher eukaryote. CONCLUSIONS: The conclusions of this work are twofold. First, post-processing the output of combinatorial algorithms by incorporating prior information leads to a very efficient and effective motif discovery method. Second, combining priors from different sources is even more beneficial than considering them separately. BioMed Central 2011-04-22 /pmc/articles/PMC3112114/ /pubmed/21513505 http://dx.doi.org/10.1186/1748-7188-6-13 Text en Copyright ©2011 Carvalho and Oliveira; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Carvalho, Alexandra M
Oliveira, Arlindo L
GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
title GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
title_full GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
title_fullStr GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
title_full_unstemmed GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
title_short GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
title_sort grisotto: a greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112114/
https://www.ncbi.nlm.nih.gov/pubmed/21513505
http://dx.doi.org/10.1186/1748-7188-6-13
work_keys_str_mv AT carvalhoalexandram grisottoagreedyapproachtoimprovecombinatorialalgorithmsformotifdiscoverywithpriorknowledge
AT oliveiraarlindol grisottoagreedyapproachtoimprovecombinatorialalgorithmsformotifdiscoverywithpriorknowledge