Cargando…

Inferring direct DNA binding from ChIP-seq

Genome-wide binding data from transcription factor ChIP-seq experiments is the best source of information for inferring the relative DNA-binding affinity of these proteins in vivo. However, standard motif enrichment analysis and motif discovery approaches sometimes fail to correctly identify the bin...

Descripción completa

Detalles Bibliográficos
Autores principales: Bailey, Timothy L., Machanick, Philip
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458523/
https://www.ncbi.nlm.nih.gov/pubmed/22610855
http://dx.doi.org/10.1093/nar/gks433
_version_ 1782244663228366848
author Bailey, Timothy L.
Machanick, Philip
author_facet Bailey, Timothy L.
Machanick, Philip
author_sort Bailey, Timothy L.
collection PubMed
description Genome-wide binding data from transcription factor ChIP-seq experiments is the best source of information for inferring the relative DNA-binding affinity of these proteins in vivo. However, standard motif enrichment analysis and motif discovery approaches sometimes fail to correctly identify the binding motif for the ChIP-ed factor. To overcome this problem, we propose ‘central motif enrichment analysis’ (CMEA), which is based on the observation that the positional distribution of binding sites matching the direct-binding motif tends to be unimodal, well centered and maximal in the precise center of the ChIP-seq peak regions. We describe a novel visualization and statistical analysis tool—CentriMo—that identifies the region of maximum central enrichment in a set of ChIP-seq peak regions and displays the positional distributions of predicted sites. Using CentriMo for motif enrichment analysis, we provide evidence that one transcription factor (Nanog) has different binding affinity in vivo than in vitro, that another binds DNA cooperatively (E2f1), and confirm the in vivo affinity of NFIC, rescuing a difficult ChIP-seq data set. In another data set, CentriMo strongly suggests that there is no evidence of direct DNA binding by the ChIP-ed factor (Smad1). CentriMo is now part of the MEME Suite software package available at http://meme.nbcr.net. All data and output files presented here are available at: http://research.imb.uq.edu.au/t.bailey/sd/Bailey2011a.
format Online
Article
Text
id pubmed-3458523
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34585232012-09-27 Inferring direct DNA binding from ChIP-seq Bailey, Timothy L. Machanick, Philip Nucleic Acids Res Methods Online Genome-wide binding data from transcription factor ChIP-seq experiments is the best source of information for inferring the relative DNA-binding affinity of these proteins in vivo. However, standard motif enrichment analysis and motif discovery approaches sometimes fail to correctly identify the binding motif for the ChIP-ed factor. To overcome this problem, we propose ‘central motif enrichment analysis’ (CMEA), which is based on the observation that the positional distribution of binding sites matching the direct-binding motif tends to be unimodal, well centered and maximal in the precise center of the ChIP-seq peak regions. We describe a novel visualization and statistical analysis tool—CentriMo—that identifies the region of maximum central enrichment in a set of ChIP-seq peak regions and displays the positional distributions of predicted sites. Using CentriMo for motif enrichment analysis, we provide evidence that one transcription factor (Nanog) has different binding affinity in vivo than in vitro, that another binds DNA cooperatively (E2f1), and confirm the in vivo affinity of NFIC, rescuing a difficult ChIP-seq data set. In another data set, CentriMo strongly suggests that there is no evidence of direct DNA binding by the ChIP-ed factor (Smad1). CentriMo is now part of the MEME Suite software package available at http://meme.nbcr.net. All data and output files presented here are available at: http://research.imb.uq.edu.au/t.bailey/sd/Bailey2011a. Oxford University Press 2012-09 2012-05-18 /pmc/articles/PMC3458523/ /pubmed/22610855 http://dx.doi.org/10.1093/nar/gks433 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Bailey, Timothy L.
Machanick, Philip
Inferring direct DNA binding from ChIP-seq
title Inferring direct DNA binding from ChIP-seq
title_full Inferring direct DNA binding from ChIP-seq
title_fullStr Inferring direct DNA binding from ChIP-seq
title_full_unstemmed Inferring direct DNA binding from ChIP-seq
title_short Inferring direct DNA binding from ChIP-seq
title_sort inferring direct dna binding from chip-seq
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3458523/
https://www.ncbi.nlm.nih.gov/pubmed/22610855
http://dx.doi.org/10.1093/nar/gks433
work_keys_str_mv AT baileytimothyl inferringdirectdnabindingfromchipseq
AT machanickphilip inferringdirectdnabindingfromchipseq