Cargando…

c-REDUCE: Incorporating sequence conservation to detect motifs that correlate with expression

BACKGROUND: Computational methods for characterizing novel transcription factor binding sites search for sequence patterns or "motifs" that appear repeatedly in genomic regions of interest. Correlation-based motif finding strategies are used to identify motifs that correlate with expressio...

Descripción completa

Detalles Bibliográficos
Autores principales: Kechris, Katerina, Li, Hao
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2626603/
https://www.ncbi.nlm.nih.gov/pubmed/19040743
http://dx.doi.org/10.1186/1471-2105-9-506
Descripción
Sumario:BACKGROUND: Computational methods for characterizing novel transcription factor binding sites search for sequence patterns or "motifs" that appear repeatedly in genomic regions of interest. Correlation-based motif finding strategies are used to identify motifs that correlate with expression data and do not rely on promoter sequences from a pre-determined set of genes. RESULTS: In this work, we describe a method for predicting motifs that combines the correlation-based strategy with phylogenetic footprinting, where motifs are identified by evaluating orthologous sequence regions from multiple species. Our method, c-REDUCE, can account for variability at a motif position inferred from evolutionary information. c-REDUCE has been tested on ChIP-chip data for yeast transcription factors and on gene expression data in Drosophila. CONCLUSION: Our results indicate that utilizing sequence conservation information in addition to correlation-based methods improves the identification of known motifs.