Cargando…
A systematic genome-wide account of binding sites for the model transcription factor Gcn4
Sequence-specific DNA-binding transcription factors are central to gene regulation. They are often associated with consensus binding sites that predict far more genomic sites than are bound in vivo. One explanation is that most sites are blocked by nucleosomes, such that only sites in nucleosome-dep...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8805717/ https://www.ncbi.nlm.nih.gov/pubmed/34916251 http://dx.doi.org/10.1101/gr.276080.121 |
_version_ | 1784643286284107776 |
---|---|
author | Coey, Christopher T. Clark, David J. |
author_facet | Coey, Christopher T. Clark, David J. |
author_sort | Coey, Christopher T. |
collection | PubMed |
description | Sequence-specific DNA-binding transcription factors are central to gene regulation. They are often associated with consensus binding sites that predict far more genomic sites than are bound in vivo. One explanation is that most sites are blocked by nucleosomes, such that only sites in nucleosome-depleted regulatory regions are bound. We compared the binding of the yeast transcription factor Gcn4 in vivo using published ChIP-seq data (546 sites) and in vitro, using a modified SELEX method (“G-SELEX”), which utilizes short genomic DNA fragments to quantify binding at all sites. We confirm that Gcn4 binds strongly to an AP-1-like sequence (TGACTCA) and weakly to half-sites. However, Gcn4 binds only some of the 1078 exact matches to this sequence, even in vitro. We show that there are only 166 copies of the high-affinity RTGACTCAY site (exact match) in the yeast genome, all occupied in vivo, largely independently of whether they are located in nucleosome-depleted or nucleosomal regions. Generally, RTGACTCAR/YTGACTCAY sites are bound much more weakly and YTGACTCAR sites are unbound, with biological implications for determining induction levels. We conclude that, to a first approximation, Gcn4 binding can be predicted using the high-affinity site, without reference to chromatin structure. We propose that transcription factor binding sites should be defined more precisely using quantitative data, allowing more accurate genome-wide prediction of binding sites and greater insight into gene regulation. |
format | Online Article Text |
id | pubmed-8805717 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-88057172022-08-01 A systematic genome-wide account of binding sites for the model transcription factor Gcn4 Coey, Christopher T. Clark, David J. Genome Res Method Sequence-specific DNA-binding transcription factors are central to gene regulation. They are often associated with consensus binding sites that predict far more genomic sites than are bound in vivo. One explanation is that most sites are blocked by nucleosomes, such that only sites in nucleosome-depleted regulatory regions are bound. We compared the binding of the yeast transcription factor Gcn4 in vivo using published ChIP-seq data (546 sites) and in vitro, using a modified SELEX method (“G-SELEX”), which utilizes short genomic DNA fragments to quantify binding at all sites. We confirm that Gcn4 binds strongly to an AP-1-like sequence (TGACTCA) and weakly to half-sites. However, Gcn4 binds only some of the 1078 exact matches to this sequence, even in vitro. We show that there are only 166 copies of the high-affinity RTGACTCAY site (exact match) in the yeast genome, all occupied in vivo, largely independently of whether they are located in nucleosome-depleted or nucleosomal regions. Generally, RTGACTCAR/YTGACTCAY sites are bound much more weakly and YTGACTCAR sites are unbound, with biological implications for determining induction levels. We conclude that, to a first approximation, Gcn4 binding can be predicted using the high-affinity site, without reference to chromatin structure. We propose that transcription factor binding sites should be defined more precisely using quantitative data, allowing more accurate genome-wide prediction of binding sites and greater insight into gene regulation. Cold Spring Harbor Laboratory Press 2022-02 /pmc/articles/PMC8805717/ /pubmed/34916251 http://dx.doi.org/10.1101/gr.276080.121 Text en Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This is a work of the US Government. |
spellingShingle | Method Coey, Christopher T. Clark, David J. A systematic genome-wide account of binding sites for the model transcription factor Gcn4 |
title | A systematic genome-wide account of binding sites for the model transcription factor Gcn4 |
title_full | A systematic genome-wide account of binding sites for the model transcription factor Gcn4 |
title_fullStr | A systematic genome-wide account of binding sites for the model transcription factor Gcn4 |
title_full_unstemmed | A systematic genome-wide account of binding sites for the model transcription factor Gcn4 |
title_short | A systematic genome-wide account of binding sites for the model transcription factor Gcn4 |
title_sort | systematic genome-wide account of binding sites for the model transcription factor gcn4 |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8805717/ https://www.ncbi.nlm.nih.gov/pubmed/34916251 http://dx.doi.org/10.1101/gr.276080.121 |
work_keys_str_mv | AT coeychristophert asystematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4 AT clarkdavidj asystematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4 AT coeychristophert systematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4 AT clarkdavidj systematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4 |