Cargando…

Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding

Transcription factors (TFs) are primary regulators of gene expression in cells, where they bind specific genomic target sites to control transcription. Quantitative measurements of TF–DNA binding energies can improve the accuracy of predictions of TF occupancy and downstream gene expression in vivo...

Descripción completa

Detalles Bibliográficos
Autores principales: Le, Daniel D., Shimko, Tyler C., Aditham, Arjun K., Keys, Allison M., Longwell, Scott A., Orenstein, Yaron, Fordyce, Polly M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910820/
https://www.ncbi.nlm.nih.gov/pubmed/29588420
http://dx.doi.org/10.1073/pnas.1715888115
_version_ 1783316124960555008
author Le, Daniel D.
Shimko, Tyler C.
Aditham, Arjun K.
Keys, Allison M.
Longwell, Scott A.
Orenstein, Yaron
Fordyce, Polly M.
author_facet Le, Daniel D.
Shimko, Tyler C.
Aditham, Arjun K.
Keys, Allison M.
Longwell, Scott A.
Orenstein, Yaron
Fordyce, Polly M.
author_sort Le, Daniel D.
collection PubMed
description Transcription factors (TFs) are primary regulators of gene expression in cells, where they bind specific genomic target sites to control transcription. Quantitative measurements of TF–DNA binding energies can improve the accuracy of predictions of TF occupancy and downstream gene expression in vivo and shed light on how transcriptional networks are rewired throughout evolution. Here, we present a sequencing-based TF binding assay and analysis pipeline (BET-seq, for Binding Energy Topography by sequencing) capable of providing quantitative estimates of binding energies for more than one million DNA sequences in parallel at high energetic resolution. Using this platform, we measured the binding energies associated with all possible combinations of 10 nucleotides flanking the known consensus DNA target interacting with two model yeast TFs, Pho4 and Cbf1. A large fraction of these flanking mutations change overall binding energies by an amount equal to or greater than consensus site mutations, suggesting that current definitions of TF binding sites may be too restrictive. By systematically comparing estimates of binding energies output by deep neural networks (NNs) and biophysical models trained on these data, we establish that dinucleotide (DN) specificities are sufficient to explain essentially all variance in observed binding behavior, with Cbf1 binding exhibiting significantly more nonadditivity than Pho4. NN-derived binding energies agree with orthogonal biochemical measurements and reveal that dynamically occupied sites in vivo are both energetically and mutationally distant from the highest affinity sites.
format Online
Article
Text
id pubmed-5910820
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-59108202018-04-25 Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding Le, Daniel D. Shimko, Tyler C. Aditham, Arjun K. Keys, Allison M. Longwell, Scott A. Orenstein, Yaron Fordyce, Polly M. Proc Natl Acad Sci U S A PNAS Plus Transcription factors (TFs) are primary regulators of gene expression in cells, where they bind specific genomic target sites to control transcription. Quantitative measurements of TF–DNA binding energies can improve the accuracy of predictions of TF occupancy and downstream gene expression in vivo and shed light on how transcriptional networks are rewired throughout evolution. Here, we present a sequencing-based TF binding assay and analysis pipeline (BET-seq, for Binding Energy Topography by sequencing) capable of providing quantitative estimates of binding energies for more than one million DNA sequences in parallel at high energetic resolution. Using this platform, we measured the binding energies associated with all possible combinations of 10 nucleotides flanking the known consensus DNA target interacting with two model yeast TFs, Pho4 and Cbf1. A large fraction of these flanking mutations change overall binding energies by an amount equal to or greater than consensus site mutations, suggesting that current definitions of TF binding sites may be too restrictive. By systematically comparing estimates of binding energies output by deep neural networks (NNs) and biophysical models trained on these data, we establish that dinucleotide (DN) specificities are sufficient to explain essentially all variance in observed binding behavior, with Cbf1 binding exhibiting significantly more nonadditivity than Pho4. NN-derived binding energies agree with orthogonal biochemical measurements and reveal that dynamically occupied sites in vivo are both energetically and mutationally distant from the highest affinity sites. National Academy of Sciences 2018-04-17 2018-03-27 /pmc/articles/PMC5910820/ /pubmed/29588420 http://dx.doi.org/10.1073/pnas.1715888115 Text en Copyright © 2018 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle PNAS Plus
Le, Daniel D.
Shimko, Tyler C.
Aditham, Arjun K.
Keys, Allison M.
Longwell, Scott A.
Orenstein, Yaron
Fordyce, Polly M.
Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
title Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
title_full Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
title_fullStr Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
title_full_unstemmed Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
title_short Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
title_sort comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding
topic PNAS Plus
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910820/
https://www.ncbi.nlm.nih.gov/pubmed/29588420
http://dx.doi.org/10.1073/pnas.1715888115
work_keys_str_mv AT ledanield comprehensivehighresolutionbindingenergylandscapesrevealcontextdependenciesoftranscriptionfactorbinding
AT shimkotylerc comprehensivehighresolutionbindingenergylandscapesrevealcontextdependenciesoftranscriptionfactorbinding
AT adithamarjunk comprehensivehighresolutionbindingenergylandscapesrevealcontextdependenciesoftranscriptionfactorbinding
AT keysallisonm comprehensivehighresolutionbindingenergylandscapesrevealcontextdependenciesoftranscriptionfactorbinding
AT longwellscotta comprehensivehighresolutionbindingenergylandscapesrevealcontextdependenciesoftranscriptionfactorbinding
AT orensteinyaron comprehensivehighresolutionbindingenergylandscapesrevealcontextdependenciesoftranscriptionfactorbinding
AT fordycepollym comprehensivehighresolutionbindingenergylandscapesrevealcontextdependenciesoftranscriptionfactorbinding