Cargando…

Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors

Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE C...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jie, Zhuang, Jiali, Iyer, Sowmya, Lin, XinYing, Whitfield, Troy W., Greven, Melissa C., Pierce, Brian G., Dong, Xianjun, Kundaje, Anshul, Cheng, Yong, Rando, Oliver J., Birney, Ewan, Myers, Richard M., Noble, William S., Snyder, Michael, Weng, Zhiping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3431495/
https://www.ncbi.nlm.nih.gov/pubmed/22955990
http://dx.doi.org/10.1101/gr.139105.112
_version_ 1782242093005012992
author Wang, Jie
Zhuang, Jiali
Iyer, Sowmya
Lin, XinYing
Whitfield, Troy W.
Greven, Melissa C.
Pierce, Brian G.
Dong, Xianjun
Kundaje, Anshul
Cheng, Yong
Rando, Oliver J.
Birney, Ewan
Myers, Richard M.
Noble, William S.
Snyder, Michael
Weng, Zhiping
author_facet Wang, Jie
Zhuang, Jiali
Iyer, Sowmya
Lin, XinYing
Whitfield, Troy W.
Greven, Melissa C.
Pierce, Brian G.
Dong, Xianjun
Kundaje, Anshul
Cheng, Yong
Rando, Oliver J.
Birney, Ewan
Myers, Richard M.
Noble, William S.
Snyder, Michael
Weng, Zhiping
author_sort Wang, Jie
collection PubMed
description Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE Consortium. We identified highly enriched sequence motifs in most data sets, revealing new motifs and validating known ones. The motif sites (TF binding sites) are highly conserved evolutionarily and show distinct footprints upon DNase I digestion. We frequently detected secondary motifs in addition to the canonical motifs of the TFs, indicating tethered binding and cobinding between multiple TFs. We observed significant position and orientation preferences between many cobinding TFs. Genes specifically expressed in a cell line are often associated with a greater occurrence of nearby TF binding in that cell line. We observed cell-line–specific secondary motifs that mediate the binding of the histone deacetylase HDAC2 and the enhancer-binding protein EP300. TF binding sites are located in GC-rich, nucleosome-depleted, and DNase I sensitive regions, flanked by well-positioned nucleosomes, and many of these features show cell type specificity. The GC-richness may be beneficial for regulating TF binding because, when unoccupied by a TF, these regions are occupied by nucleosomes in vivo. We present the results of our analysis in a TF-centric web repository Factorbook (http://factorbook.org) and will continually update this repository as more ENCODE data are generated.
format Online
Article
Text
id pubmed-3431495
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-34314952012-09-08 Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors Wang, Jie Zhuang, Jiali Iyer, Sowmya Lin, XinYing Whitfield, Troy W. Greven, Melissa C. Pierce, Brian G. Dong, Xianjun Kundaje, Anshul Cheng, Yong Rando, Oliver J. Birney, Ewan Myers, Richard M. Noble, William S. Snyder, Michael Weng, Zhiping Genome Res Resource Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE Consortium. We identified highly enriched sequence motifs in most data sets, revealing new motifs and validating known ones. The motif sites (TF binding sites) are highly conserved evolutionarily and show distinct footprints upon DNase I digestion. We frequently detected secondary motifs in addition to the canonical motifs of the TFs, indicating tethered binding and cobinding between multiple TFs. We observed significant position and orientation preferences between many cobinding TFs. Genes specifically expressed in a cell line are often associated with a greater occurrence of nearby TF binding in that cell line. We observed cell-line–specific secondary motifs that mediate the binding of the histone deacetylase HDAC2 and the enhancer-binding protein EP300. TF binding sites are located in GC-rich, nucleosome-depleted, and DNase I sensitive regions, flanked by well-positioned nucleosomes, and many of these features show cell type specificity. The GC-richness may be beneficial for regulating TF binding because, when unoccupied by a TF, these regions are occupied by nucleosomes in vivo. We present the results of our analysis in a TF-centric web repository Factorbook (http://factorbook.org) and will continually update this repository as more ENCODE data are generated. Cold Spring Harbor Laboratory Press 2012-09 /pmc/articles/PMC3431495/ /pubmed/22955990 http://dx.doi.org/10.1101/gr.139105.112 Text en © 2012, Published by Cold Spring Harbor Laboratory Press This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 3.0 Unported License), as described at http://creativecommons.org/licenses/by-nc/3.0/.
spellingShingle Resource
Wang, Jie
Zhuang, Jiali
Iyer, Sowmya
Lin, XinYing
Whitfield, Troy W.
Greven, Melissa C.
Pierce, Brian G.
Dong, Xianjun
Kundaje, Anshul
Cheng, Yong
Rando, Oliver J.
Birney, Ewan
Myers, Richard M.
Noble, William S.
Snyder, Michael
Weng, Zhiping
Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors
title Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors
title_full Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors
title_fullStr Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors
title_full_unstemmed Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors
title_short Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors
title_sort sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3431495/
https://www.ncbi.nlm.nih.gov/pubmed/22955990
http://dx.doi.org/10.1101/gr.139105.112
work_keys_str_mv AT wangjie sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT zhuangjiali sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT iyersowmya sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT linxinying sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT whitfieldtroyw sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT grevenmelissac sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT piercebriang sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT dongxianjun sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT kundajeanshul sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT chengyong sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT randooliverj sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT birneyewan sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT myersrichardm sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT noblewilliams sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT snydermichael sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors
AT wengzhiping sequencefeaturesandchromatinstructurearoundthegenomicregionsboundby119humantranscriptionfactors