Cargando…

A systematic genome-wide account of binding sites for the model transcription factor Gcn4

Sequence-specific DNA-binding transcription factors are central to gene regulation. They are often associated with consensus binding sites that predict far more genomic sites than are bound in vivo. One explanation is that most sites are blocked by nucleosomes, such that only sites in nucleosome-dep...

Descripción completa

Detalles Bibliográficos
Autores principales: Coey, Christopher T., Clark, David J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8805717/
https://www.ncbi.nlm.nih.gov/pubmed/34916251
http://dx.doi.org/10.1101/gr.276080.121
_version_ 1784643286284107776
author Coey, Christopher T.
Clark, David J.
author_facet Coey, Christopher T.
Clark, David J.
author_sort Coey, Christopher T.
collection PubMed
description Sequence-specific DNA-binding transcription factors are central to gene regulation. They are often associated with consensus binding sites that predict far more genomic sites than are bound in vivo. One explanation is that most sites are blocked by nucleosomes, such that only sites in nucleosome-depleted regulatory regions are bound. We compared the binding of the yeast transcription factor Gcn4 in vivo using published ChIP-seq data (546 sites) and in vitro, using a modified SELEX method (“G-SELEX”), which utilizes short genomic DNA fragments to quantify binding at all sites. We confirm that Gcn4 binds strongly to an AP-1-like sequence (TGACTCA) and weakly to half-sites. However, Gcn4 binds only some of the 1078 exact matches to this sequence, even in vitro. We show that there are only 166 copies of the high-affinity RTGACTCAY site (exact match) in the yeast genome, all occupied in vivo, largely independently of whether they are located in nucleosome-depleted or nucleosomal regions. Generally, RTGACTCAR/YTGACTCAY sites are bound much more weakly and YTGACTCAR sites are unbound, with biological implications for determining induction levels. We conclude that, to a first approximation, Gcn4 binding can be predicted using the high-affinity site, without reference to chromatin structure. We propose that transcription factor binding sites should be defined more precisely using quantitative data, allowing more accurate genome-wide prediction of binding sites and greater insight into gene regulation.
format Online
Article
Text
id pubmed-8805717
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-88057172022-08-01 A systematic genome-wide account of binding sites for the model transcription factor Gcn4 Coey, Christopher T. Clark, David J. Genome Res Method Sequence-specific DNA-binding transcription factors are central to gene regulation. They are often associated with consensus binding sites that predict far more genomic sites than are bound in vivo. One explanation is that most sites are blocked by nucleosomes, such that only sites in nucleosome-depleted regulatory regions are bound. We compared the binding of the yeast transcription factor Gcn4 in vivo using published ChIP-seq data (546 sites) and in vitro, using a modified SELEX method (“G-SELEX”), which utilizes short genomic DNA fragments to quantify binding at all sites. We confirm that Gcn4 binds strongly to an AP-1-like sequence (TGACTCA) and weakly to half-sites. However, Gcn4 binds only some of the 1078 exact matches to this sequence, even in vitro. We show that there are only 166 copies of the high-affinity RTGACTCAY site (exact match) in the yeast genome, all occupied in vivo, largely independently of whether they are located in nucleosome-depleted or nucleosomal regions. Generally, RTGACTCAR/YTGACTCAY sites are bound much more weakly and YTGACTCAR sites are unbound, with biological implications for determining induction levels. We conclude that, to a first approximation, Gcn4 binding can be predicted using the high-affinity site, without reference to chromatin structure. We propose that transcription factor binding sites should be defined more precisely using quantitative data, allowing more accurate genome-wide prediction of binding sites and greater insight into gene regulation. Cold Spring Harbor Laboratory Press 2022-02 /pmc/articles/PMC8805717/ /pubmed/34916251 http://dx.doi.org/10.1101/gr.276080.121 Text en Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This is a work of the US Government.
spellingShingle Method
Coey, Christopher T.
Clark, David J.
A systematic genome-wide account of binding sites for the model transcription factor Gcn4
title A systematic genome-wide account of binding sites for the model transcription factor Gcn4
title_full A systematic genome-wide account of binding sites for the model transcription factor Gcn4
title_fullStr A systematic genome-wide account of binding sites for the model transcription factor Gcn4
title_full_unstemmed A systematic genome-wide account of binding sites for the model transcription factor Gcn4
title_short A systematic genome-wide account of binding sites for the model transcription factor Gcn4
title_sort systematic genome-wide account of binding sites for the model transcription factor gcn4
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8805717/
https://www.ncbi.nlm.nih.gov/pubmed/34916251
http://dx.doi.org/10.1101/gr.276080.121
work_keys_str_mv AT coeychristophert asystematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4
AT clarkdavidj asystematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4
AT coeychristophert systematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4
AT clarkdavidj systematicgenomewideaccountofbindingsitesforthemodeltranscriptionfactorgcn4