Cargando…

Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments

Recent advances in technology have led to a dramatic increase in the number of available transcription factor ChIP-seq and ChIP-chip data sets. Understanding the motif content of these data sets is an important step in understanding the underlying mechanisms of regulation. Here we provide a systemat...

Descripción completa

Detalles Bibliográficos
Autores principales: Kheradpour, Pouya, Kellis, Manolis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3950668/
https://www.ncbi.nlm.nih.gov/pubmed/24335146
http://dx.doi.org/10.1093/nar/gkt1249
_version_ 1782307027878412288
author Kheradpour, Pouya
Kellis, Manolis
author_facet Kheradpour, Pouya
Kellis, Manolis
author_sort Kheradpour, Pouya
collection PubMed
description Recent advances in technology have led to a dramatic increase in the number of available transcription factor ChIP-seq and ChIP-chip data sets. Understanding the motif content of these data sets is an important step in understanding the underlying mechanisms of regulation. Here we provide a systematic motif analysis for 427 human ChIP-seq data sets using motifs curated from the literature and also discovered de novo using five established motif discovery tools. We use a systematic pipeline for calculating motif enrichment in each data set, providing a principled way for choosing between motif variants found in the literature and for flagging potentially problematic data sets. Our analysis confirms the known specificity of 41 of the 56 analyzed factor groups and reveals motifs of potential cofactors. We also use cell type-specific binding to find factors active in specific conditions. The resource we provide is accessible both for browsing a small number of factors and for performing large-scale systematic analyses. We provide motif matrices, instances and enrichments in each of the ENCODE data sets. The motifs discovered here have been used in parallel studies to validate the specificity of antibodies, understand cooperativity between data sets and measure the variation of motif binding across individuals and species.
format Online
Article
Text
id pubmed-3950668
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39506682014-03-12 Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments Kheradpour, Pouya Kellis, Manolis Nucleic Acids Res Recent advances in technology have led to a dramatic increase in the number of available transcription factor ChIP-seq and ChIP-chip data sets. Understanding the motif content of these data sets is an important step in understanding the underlying mechanisms of regulation. Here we provide a systematic motif analysis for 427 human ChIP-seq data sets using motifs curated from the literature and also discovered de novo using five established motif discovery tools. We use a systematic pipeline for calculating motif enrichment in each data set, providing a principled way for choosing between motif variants found in the literature and for flagging potentially problematic data sets. Our analysis confirms the known specificity of 41 of the 56 analyzed factor groups and reveals motifs of potential cofactors. We also use cell type-specific binding to find factors active in specific conditions. The resource we provide is accessible both for browsing a small number of factors and for performing large-scale systematic analyses. We provide motif matrices, instances and enrichments in each of the ENCODE data sets. The motifs discovered here have been used in parallel studies to validate the specificity of antibodies, understand cooperativity between data sets and measure the variation of motif binding across individuals and species. Oxford University Press 2014-03 2013-12-12 /pmc/articles/PMC3950668/ /pubmed/24335146 http://dx.doi.org/10.1093/nar/gkt1249 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Kheradpour, Pouya
Kellis, Manolis
Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments
title Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments
title_full Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments
title_fullStr Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments
title_full_unstemmed Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments
title_short Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments
title_sort systematic discovery and characterization of regulatory motifs in encode tf binding experiments
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3950668/
https://www.ncbi.nlm.nih.gov/pubmed/24335146
http://dx.doi.org/10.1093/nar/gkt1249
work_keys_str_mv AT kheradpourpouya systematicdiscoveryandcharacterizationofregulatorymotifsinencodetfbindingexperiments
AT kellismanolis systematicdiscoveryandcharacterizationofregulatorymotifsinencodetfbindingexperiments