Cargando…

A unified approach for quantifying and interpreting DNA shape readout by transcription factors

Transcription factors (TFs) interpret DNA sequence by probing the chemical and structural properties of the nucleotide polymer. DNA shape is thought to enable a parsimonious representation of dependencies between nucleotide positions. Here, we propose a unified mathematical representation of the DNA...

Descripción completa

Detalles Bibliográficos
Autores principales: Rube, H Tomas, Rastogi, Chaitanya, Kribelbauer, Judith F, Bussemaker, Harmen J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5822049/
https://www.ncbi.nlm.nih.gov/pubmed/29472273
http://dx.doi.org/10.15252/msb.20177902
_version_ 1783301615646670848
author Rube, H Tomas
Rastogi, Chaitanya
Kribelbauer, Judith F
Bussemaker, Harmen J
author_facet Rube, H Tomas
Rastogi, Chaitanya
Kribelbauer, Judith F
Bussemaker, Harmen J
author_sort Rube, H Tomas
collection PubMed
description Transcription factors (TFs) interpret DNA sequence by probing the chemical and structural properties of the nucleotide polymer. DNA shape is thought to enable a parsimonious representation of dependencies between nucleotide positions. Here, we propose a unified mathematical representation of the DNA sequence dependence of shape and TF binding, respectively, which simplifies and enhances analysis of shape readout. First, we demonstrate that linear models based on mononucleotide features alone account for 60–70% of the variance in minor groove width, roll, helix twist, and propeller twist. This explains why simple scoring matrices that ignore all dependencies between nucleotide positions can partially account for DNA shape readout by a TF. Adding dinucleotide features as sequence‐to‐shape predictors to our model, we can almost perfectly explain the shape parameters. Building on this observation, we developed a post hoc analysis method that can be used to analyze any mechanism‐agnostic protein–DNA binding model in terms of shape readout. Our insights provide an alternative strategy for using DNA shape information to enhance our understanding of how cis‐regulatory codes are interpreted by the cellular machinery.
format Online
Article
Text
id pubmed-5822049
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-58220492018-02-26 A unified approach for quantifying and interpreting DNA shape readout by transcription factors Rube, H Tomas Rastogi, Chaitanya Kribelbauer, Judith F Bussemaker, Harmen J Mol Syst Biol Methods Transcription factors (TFs) interpret DNA sequence by probing the chemical and structural properties of the nucleotide polymer. DNA shape is thought to enable a parsimonious representation of dependencies between nucleotide positions. Here, we propose a unified mathematical representation of the DNA sequence dependence of shape and TF binding, respectively, which simplifies and enhances analysis of shape readout. First, we demonstrate that linear models based on mononucleotide features alone account for 60–70% of the variance in minor groove width, roll, helix twist, and propeller twist. This explains why simple scoring matrices that ignore all dependencies between nucleotide positions can partially account for DNA shape readout by a TF. Adding dinucleotide features as sequence‐to‐shape predictors to our model, we can almost perfectly explain the shape parameters. Building on this observation, we developed a post hoc analysis method that can be used to analyze any mechanism‐agnostic protein–DNA binding model in terms of shape readout. Our insights provide an alternative strategy for using DNA shape information to enhance our understanding of how cis‐regulatory codes are interpreted by the cellular machinery. John Wiley and Sons Inc. 2018-02-22 /pmc/articles/PMC5822049/ /pubmed/29472273 http://dx.doi.org/10.15252/msb.20177902 Text en © 2018 The Authors. Published under the terms of the CC BY 4.0 license This is an open access article under the terms of the Creative Commons Attribution 4.0 (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Rube, H Tomas
Rastogi, Chaitanya
Kribelbauer, Judith F
Bussemaker, Harmen J
A unified approach for quantifying and interpreting DNA shape readout by transcription factors
title A unified approach for quantifying and interpreting DNA shape readout by transcription factors
title_full A unified approach for quantifying and interpreting DNA shape readout by transcription factors
title_fullStr A unified approach for quantifying and interpreting DNA shape readout by transcription factors
title_full_unstemmed A unified approach for quantifying and interpreting DNA shape readout by transcription factors
title_short A unified approach for quantifying and interpreting DNA shape readout by transcription factors
title_sort unified approach for quantifying and interpreting dna shape readout by transcription factors
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5822049/
https://www.ncbi.nlm.nih.gov/pubmed/29472273
http://dx.doi.org/10.15252/msb.20177902
work_keys_str_mv AT rubehtomas aunifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors
AT rastogichaitanya aunifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors
AT kribelbauerjudithf aunifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors
AT bussemakerharmenj aunifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors
AT rubehtomas unifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors
AT rastogichaitanya unifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors
AT kribelbauerjudithf unifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors
AT bussemakerharmenj unifiedapproachforquantifyingandinterpretingdnashapereadoutbytranscriptionfactors