Cargando…

Accurate and sensitive quantification of protein-DNA binding affinity

Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum l...

Descripción completa

Detalles Bibliográficos
Autores principales: Rastogi, Chaitanya, Rube, H. Tomas, Kribelbauer, Judith F., Crocker, Justin, Loker, Ryan E., Martini, Gabriella D., Laptenko, Oleg, Freed-Pastor, William A., Prives, Carol, Stern, David L., Mann, Richard S., Bussemaker, Harmen J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5910815/
https://www.ncbi.nlm.nih.gov/pubmed/29610332
http://dx.doi.org/10.1073/pnas.1714376115
Descripción
Sumario:Transcription factors (TFs) control gene expression by binding to genomic DNA in a sequence-specific manner. Mutations in TF binding sites are increasingly found to be associated with human disease, yet we currently lack robust methods to predict these sites. Here, we developed a versatile maximum likelihood framework named No Read Left Behind (NRLB) that infers a biophysical model of protein-DNA recognition across the full affinity range from a library of in vitro selected DNA binding sites. NRLB predicts human Max homodimer binding in near-perfect agreement with existing low-throughput measurements. It can capture the specificity of the p53 tetramer and distinguish multiple binding modes within a single sample. Additionally, we confirm that newly identified low-affinity enhancer binding sites are functional in vivo, and that their contribution to gene expression matches their predicted affinity. Our results establish a powerful paradigm for identifying protein binding sites and interpreting gene regulatory sequences in eukaryotic genomes.