Cargando…

An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding

BACKGROUND: Transcription factor (TF) binding specificity is determined via a complex interplay between the transcription factor’s DNA binding preference and cell type-specific chromatin environments. The chromatin features that correlate with transcription factor binding in a given cell type have b...

Descripción completa

Detalles Bibliográficos
Autores principales: Srivastava, Divyanshi, Aydin, Begüm, Mazzoni, Esteban O., Mahony, Shaun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7788824/
https://www.ncbi.nlm.nih.gov/pubmed/33413545
http://dx.doi.org/10.1186/s13059-020-02218-6
_version_ 1783633107369328640
author Srivastava, Divyanshi
Aydin, Begüm
Mazzoni, Esteban O.
Mahony, Shaun
author_facet Srivastava, Divyanshi
Aydin, Begüm
Mazzoni, Esteban O.
Mahony, Shaun
author_sort Srivastava, Divyanshi
collection PubMed
description BACKGROUND: Transcription factor (TF) binding specificity is determined via a complex interplay between the transcription factor’s DNA binding preference and cell type-specific chromatin environments. The chromatin features that correlate with transcription factor binding in a given cell type have been well characterized. For instance, the binding sites for a majority of transcription factors display concurrent chromatin accessibility. However, concurrent chromatin features reflect the binding activities of the transcription factor itself and thus provide limited insight into how genome-wide TF-DNA binding patterns became established in the first place. To understand the determinants of transcription factor binding specificity, we therefore need to examine how newly activated transcription factors interact with sequence and preexisting chromatin landscapes. RESULTS: Here, we investigate the sequence and preexisting chromatin predictors of TF-DNA binding by examining the genome-wide occupancy of transcription factors that have been induced in well-characterized chromatin environments. We develop Bichrom, a bimodal neural network that jointly models sequence and preexisting chromatin data to interpret the genome-wide binding patterns of induced transcription factors. We find that the preexisting chromatin landscape is a differential global predictor of TF-DNA binding; incorporating preexisting chromatin features improves our ability to explain the binding specificity of some transcription factors substantially, but not others. Furthermore, by analyzing site-level predictors, we show that transcription factor binding in previously inaccessible chromatin tends to correspond to the presence of more favorable cognate DNA sequences. CONCLUSIONS: Bichrom thus provides a framework for modeling, interpreting, and visualizing the joint sequence and chromatin landscapes that determine TF-DNA binding dynamics.
format Online
Article
Text
id pubmed-7788824
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77888242021-01-07 An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding Srivastava, Divyanshi Aydin, Begüm Mazzoni, Esteban O. Mahony, Shaun Genome Biol Research BACKGROUND: Transcription factor (TF) binding specificity is determined via a complex interplay between the transcription factor’s DNA binding preference and cell type-specific chromatin environments. The chromatin features that correlate with transcription factor binding in a given cell type have been well characterized. For instance, the binding sites for a majority of transcription factors display concurrent chromatin accessibility. However, concurrent chromatin features reflect the binding activities of the transcription factor itself and thus provide limited insight into how genome-wide TF-DNA binding patterns became established in the first place. To understand the determinants of transcription factor binding specificity, we therefore need to examine how newly activated transcription factors interact with sequence and preexisting chromatin landscapes. RESULTS: Here, we investigate the sequence and preexisting chromatin predictors of TF-DNA binding by examining the genome-wide occupancy of transcription factors that have been induced in well-characterized chromatin environments. We develop Bichrom, a bimodal neural network that jointly models sequence and preexisting chromatin data to interpret the genome-wide binding patterns of induced transcription factors. We find that the preexisting chromatin landscape is a differential global predictor of TF-DNA binding; incorporating preexisting chromatin features improves our ability to explain the binding specificity of some transcription factors substantially, but not others. Furthermore, by analyzing site-level predictors, we show that transcription factor binding in previously inaccessible chromatin tends to correspond to the presence of more favorable cognate DNA sequences. CONCLUSIONS: Bichrom thus provides a framework for modeling, interpreting, and visualizing the joint sequence and chromatin landscapes that determine TF-DNA binding dynamics. BioMed Central 2021-01-07 /pmc/articles/PMC7788824/ /pubmed/33413545 http://dx.doi.org/10.1186/s13059-020-02218-6 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Srivastava, Divyanshi
Aydin, Begüm
Mazzoni, Esteban O.
Mahony, Shaun
An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding
title An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding
title_full An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding
title_fullStr An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding
title_full_unstemmed An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding
title_short An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding
title_sort interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7788824/
https://www.ncbi.nlm.nih.gov/pubmed/33413545
http://dx.doi.org/10.1186/s13059-020-02218-6
work_keys_str_mv AT srivastavadivyanshi aninterpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding
AT aydinbegum aninterpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding
AT mazzoniestebano aninterpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding
AT mahonyshaun aninterpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding
AT srivastavadivyanshi interpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding
AT aydinbegum interpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding
AT mazzoniestebano interpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding
AT mahonyshaun interpretablebimodalneuralnetworkcharacterizesthesequenceandpreexistingchromatinpredictorsofinducedtranscriptionfactorbinding