Cargando…

Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models

Transcription factors (TFs) achieve DNA‐binding specificity through contacts with functional groups of bases (base readout) and readout of structural properties of the double helix (shape readout). Currently, it remains unclear whether DNA shape readout is utilized by only a few selected TF families...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Lin, Orenstein, Yaron, Jolma, Arttu, Yin, Yimeng, Taipale, Jussi, Shamir, Ron, Rohs, Remo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5327724/
https://www.ncbi.nlm.nih.gov/pubmed/28167566
http://dx.doi.org/10.15252/msb.20167238
_version_ 1782510793868181504
author Yang, Lin
Orenstein, Yaron
Jolma, Arttu
Yin, Yimeng
Taipale, Jussi
Shamir, Ron
Rohs, Remo
author_facet Yang, Lin
Orenstein, Yaron
Jolma, Arttu
Yin, Yimeng
Taipale, Jussi
Shamir, Ron
Rohs, Remo
author_sort Yang, Lin
collection PubMed
description Transcription factors (TFs) achieve DNA‐binding specificity through contacts with functional groups of bases (base readout) and readout of structural properties of the double helix (shape readout). Currently, it remains unclear whether DNA shape readout is utilized by only a few selected TF families, or whether this mechanism is used extensively by most TF families. We resequenced data from previously published HT‐SELEX experiments, the most extensive mammalian TF–DNA binding data available to date. Using these data, we demonstrated the contributions of DNA shape readout across diverse TF families and its importance in core motif‐flanking regions. Statistical machine‐learning models combined with feature‐selection techniques helped to reveal the nucleotide position‐dependent DNA shape readout in TF‐binding sites and the TF family‐specific position dependence. Based on these results, we proposed novel DNA shape logos to visualize the DNA shape preferences of TFs. Overall, this work suggests a way of obtaining mechanistic insights into TF–DNA binding without relying on experimentally solved all‐atom structures.
format Online
Article
Text
id pubmed-5327724
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-53277242017-03-01 Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models Yang, Lin Orenstein, Yaron Jolma, Arttu Yin, Yimeng Taipale, Jussi Shamir, Ron Rohs, Remo Mol Syst Biol Articles Transcription factors (TFs) achieve DNA‐binding specificity through contacts with functional groups of bases (base readout) and readout of structural properties of the double helix (shape readout). Currently, it remains unclear whether DNA shape readout is utilized by only a few selected TF families, or whether this mechanism is used extensively by most TF families. We resequenced data from previously published HT‐SELEX experiments, the most extensive mammalian TF–DNA binding data available to date. Using these data, we demonstrated the contributions of DNA shape readout across diverse TF families and its importance in core motif‐flanking regions. Statistical machine‐learning models combined with feature‐selection techniques helped to reveal the nucleotide position‐dependent DNA shape readout in TF‐binding sites and the TF family‐specific position dependence. Based on these results, we proposed novel DNA shape logos to visualize the DNA shape preferences of TFs. Overall, this work suggests a way of obtaining mechanistic insights into TF–DNA binding without relying on experimentally solved all‐atom structures. John Wiley and Sons Inc. 2017-02-06 /pmc/articles/PMC5327724/ /pubmed/28167566 http://dx.doi.org/10.15252/msb.20167238 Text en © 2017 The Authors. Published under the terms of the CC BY 4.0 license This is an open access article under the terms of the Creative Commons Attribution 4.0 (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Yang, Lin
Orenstein, Yaron
Jolma, Arttu
Yin, Yimeng
Taipale, Jussi
Shamir, Ron
Rohs, Remo
Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models
title Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models
title_full Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models
title_fullStr Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models
title_full_unstemmed Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models
title_short Transcription factor family‐specific DNA shape readout revealed by quantitative specificity models
title_sort transcription factor family‐specific dna shape readout revealed by quantitative specificity models
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5327724/
https://www.ncbi.nlm.nih.gov/pubmed/28167566
http://dx.doi.org/10.15252/msb.20167238
work_keys_str_mv AT yanglin transcriptionfactorfamilyspecificdnashapereadoutrevealedbyquantitativespecificitymodels
AT orensteinyaron transcriptionfactorfamilyspecificdnashapereadoutrevealedbyquantitativespecificitymodels
AT jolmaarttu transcriptionfactorfamilyspecificdnashapereadoutrevealedbyquantitativespecificitymodels
AT yinyimeng transcriptionfactorfamilyspecificdnashapereadoutrevealedbyquantitativespecificitymodels
AT taipalejussi transcriptionfactorfamilyspecificdnashapereadoutrevealedbyquantitativespecificitymodels
AT shamirron transcriptionfactorfamilyspecificdnashapereadoutrevealedbyquantitativespecificitymodels
AT rohsremo transcriptionfactorfamilyspecificdnashapereadoutrevealedbyquantitativespecificitymodels