Cargando…
GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge
BACKGROUND: Position-specific priors (PSP) have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior info...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112114/ https://www.ncbi.nlm.nih.gov/pubmed/21513505 http://dx.doi.org/10.1186/1748-7188-6-13 |
_version_ | 1782205700946001920 |
---|---|
author | Carvalho, Alexandra M Oliveira, Arlindo L |
author_facet | Carvalho, Alexandra M Oliveira, Arlindo L |
author_sort | Carvalho, Alexandra M |
collection | PubMed |
description | BACKGROUND: Position-specific priors (PSP) have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior information has not yet been used in the context of combinatorial algorithms. Moreover, priors have been used only independently, and the gain of combining priors from different sources has not yet been studied. RESULTS: We extend RISOTTO, a combinatorial algorithm for motif discovery, by post-processing its output with a greedy procedure that uses prior information. PSP's from different sources are combined into a scoring criterion that guides the greedy search procedure. The resulting method, called GRISOTTO, was evaluated over 156 yeast TF ChIP-chip sequence-sets commonly used to benchmark prior-based motif discovery algorithms. Results show that GRISOTTO is at least as accurate as other twelve state-of-the-art approaches for the same task, even without combining priors. Furthermore, by considering combined priors, GRISOTTO is considerably more accurate than the state-of-the-art approaches for the same task. We also show that PSP's improve GRISOTTO ability to retrieve motifs from mouse ChiP-seq data, indicating that the proposed algorithm can be applied to data from a different technology and for a higher eukaryote. CONCLUSIONS: The conclusions of this work are twofold. First, post-processing the output of combinatorial algorithms by incorporating prior information leads to a very efficient and effective motif discovery method. Second, combining priors from different sources is even more beneficial than considering them separately. |
format | Online Article Text |
id | pubmed-3112114 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31121142011-06-11 GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge Carvalho, Alexandra M Oliveira, Arlindo L Algorithms Mol Biol Research BACKGROUND: Position-specific priors (PSP) have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior information has not yet been used in the context of combinatorial algorithms. Moreover, priors have been used only independently, and the gain of combining priors from different sources has not yet been studied. RESULTS: We extend RISOTTO, a combinatorial algorithm for motif discovery, by post-processing its output with a greedy procedure that uses prior information. PSP's from different sources are combined into a scoring criterion that guides the greedy search procedure. The resulting method, called GRISOTTO, was evaluated over 156 yeast TF ChIP-chip sequence-sets commonly used to benchmark prior-based motif discovery algorithms. Results show that GRISOTTO is at least as accurate as other twelve state-of-the-art approaches for the same task, even without combining priors. Furthermore, by considering combined priors, GRISOTTO is considerably more accurate than the state-of-the-art approaches for the same task. We also show that PSP's improve GRISOTTO ability to retrieve motifs from mouse ChiP-seq data, indicating that the proposed algorithm can be applied to data from a different technology and for a higher eukaryote. CONCLUSIONS: The conclusions of this work are twofold. First, post-processing the output of combinatorial algorithms by incorporating prior information leads to a very efficient and effective motif discovery method. Second, combining priors from different sources is even more beneficial than considering them separately. BioMed Central 2011-04-22 /pmc/articles/PMC3112114/ /pubmed/21513505 http://dx.doi.org/10.1186/1748-7188-6-13 Text en Copyright ©2011 Carvalho and Oliveira; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Carvalho, Alexandra M Oliveira, Arlindo L GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge |
title | GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge |
title_full | GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge |
title_fullStr | GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge |
title_full_unstemmed | GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge |
title_short | GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge |
title_sort | grisotto: a greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3112114/ https://www.ncbi.nlm.nih.gov/pubmed/21513505 http://dx.doi.org/10.1186/1748-7188-6-13 |
work_keys_str_mv | AT carvalhoalexandram grisottoagreedyapproachtoimprovecombinatorialalgorithmsformotifdiscoverywithpriorknowledge AT oliveiraarlindol grisottoagreedyapproachtoimprovecombinatorialalgorithmsformotifdiscoverywithpriorknowledge |