Cargando…

The value of position-specific priors in motif discovery using MEME

BACKGROUND: Position-specific priors have been shown to be a flexible and elegant way to extend the power of Gibbs sampler-based motif discovery algorithms. Information of many types–including sequence conservation, nucleosome positioning, and negative examples–can be converted into a prior over the...

Descripción completa

Detalles Bibliográficos
Autores principales: Bailey, Timothy L, Bodén, Mikael, Whitington, Tom, Machanick, Philip
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2868008/
https://www.ncbi.nlm.nih.gov/pubmed/20380693
http://dx.doi.org/10.1186/1471-2105-11-179
_version_ 1782181025488568320
author Bailey, Timothy L
Bodén, Mikael
Whitington, Tom
Machanick, Philip
author_facet Bailey, Timothy L
Bodén, Mikael
Whitington, Tom
Machanick, Philip
author_sort Bailey, Timothy L
collection PubMed
description BACKGROUND: Position-specific priors have been shown to be a flexible and elegant way to extend the power of Gibbs sampler-based motif discovery algorithms. Information of many types–including sequence conservation, nucleosome positioning, and negative examples–can be converted into a prior over the location of motif sites, which then guides the sequence motif discovery algorithm. This approach has been shown to confer many of the benefits of conservation-based and discriminative motif discovery approaches on Gibbs sampler-based motif discovery methods, but has not previously been studied with methods based on expectation maximization (EM). RESULTS: We extend the popular EM-based MEME algorithm to utilize position-specific priors and demonstrate their effectiveness for discovering transcription factor (TF) motifs in yeast and mouse DNA sequences. Utilizing a discriminative, conservation-based prior dramatically improves MEME's ability to discover motifs in 156 yeast TF ChIP-chip datasets, more than doubling the number of datasets where it finds the correct motif. On these datasets, MEME using the prior has a higher success rate than eight other conservation-based motif discovery approaches. We also show that the same type of prior improves the accuracy of motifs discovered by MEME in mouse TF ChIP-seq data, and that the motifs tend to be of slightly higher quality those found by a Gibbs sampling algorithm using the same prior. CONCLUSIONS: We conclude that using position-specific priors can substantially increase the power of EM-based motif discovery algorithms such as MEME algorithm.
format Text
id pubmed-2868008
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28680082010-05-12 The value of position-specific priors in motif discovery using MEME Bailey, Timothy L Bodén, Mikael Whitington, Tom Machanick, Philip BMC Bioinformatics Research article BACKGROUND: Position-specific priors have been shown to be a flexible and elegant way to extend the power of Gibbs sampler-based motif discovery algorithms. Information of many types–including sequence conservation, nucleosome positioning, and negative examples–can be converted into a prior over the location of motif sites, which then guides the sequence motif discovery algorithm. This approach has been shown to confer many of the benefits of conservation-based and discriminative motif discovery approaches on Gibbs sampler-based motif discovery methods, but has not previously been studied with methods based on expectation maximization (EM). RESULTS: We extend the popular EM-based MEME algorithm to utilize position-specific priors and demonstrate their effectiveness for discovering transcription factor (TF) motifs in yeast and mouse DNA sequences. Utilizing a discriminative, conservation-based prior dramatically improves MEME's ability to discover motifs in 156 yeast TF ChIP-chip datasets, more than doubling the number of datasets where it finds the correct motif. On these datasets, MEME using the prior has a higher success rate than eight other conservation-based motif discovery approaches. We also show that the same type of prior improves the accuracy of motifs discovered by MEME in mouse TF ChIP-seq data, and that the motifs tend to be of slightly higher quality those found by a Gibbs sampling algorithm using the same prior. CONCLUSIONS: We conclude that using position-specific priors can substantially increase the power of EM-based motif discovery algorithms such as MEME algorithm. BioMed Central 2010-04-09 /pmc/articles/PMC2868008/ /pubmed/20380693 http://dx.doi.org/10.1186/1471-2105-11-179 Text en Copyright ©2010 Bailey et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Bailey, Timothy L
Bodén, Mikael
Whitington, Tom
Machanick, Philip
The value of position-specific priors in motif discovery using MEME
title The value of position-specific priors in motif discovery using MEME
title_full The value of position-specific priors in motif discovery using MEME
title_fullStr The value of position-specific priors in motif discovery using MEME
title_full_unstemmed The value of position-specific priors in motif discovery using MEME
title_short The value of position-specific priors in motif discovery using MEME
title_sort value of position-specific priors in motif discovery using meme
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2868008/
https://www.ncbi.nlm.nih.gov/pubmed/20380693
http://dx.doi.org/10.1186/1471-2105-11-179
work_keys_str_mv AT baileytimothyl thevalueofpositionspecificpriorsinmotifdiscoveryusingmeme
AT bodenmikael thevalueofpositionspecificpriorsinmotifdiscoveryusingmeme
AT whitingtontom thevalueofpositionspecificpriorsinmotifdiscoveryusingmeme
AT machanickphilip thevalueofpositionspecificpriorsinmotifdiscoveryusingmeme
AT baileytimothyl valueofpositionspecificpriorsinmotifdiscoveryusingmeme
AT bodenmikael valueofpositionspecificpriorsinmotifdiscoveryusingmeme
AT whitingtontom valueofpositionspecificpriorsinmotifdiscoveryusingmeme
AT machanickphilip valueofpositionspecificpriorsinmotifdiscoveryusingmeme