Cargando…
Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2
BACKGROUND: Many transcription factors (TFs), such as multi zinc-finger (ZF) TFs, have multiple DNA binding domains (DBDs), and deciphering the DNA binding motifs of individual DBDs is a major challenge. One example of such a TF is CCCTC-binding factor (CTCF), a TF with eleven ZFs that plays a varie...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9004084/ https://www.ncbi.nlm.nih.gov/pubmed/35410161 http://dx.doi.org/10.1186/s12864-022-08486-9 |
_version_ | 1784686215078871040 |
---|---|
author | Kaplow, Irene M. Banerjee, Abhimanyu Foo, Chuan Sheng |
author_facet | Kaplow, Irene M. Banerjee, Abhimanyu Foo, Chuan Sheng |
author_sort | Kaplow, Irene M. |
collection | PubMed |
description | BACKGROUND: Many transcription factors (TFs), such as multi zinc-finger (ZF) TFs, have multiple DNA binding domains (DBDs), and deciphering the DNA binding motifs of individual DBDs is a major challenge. One example of such a TF is CCCTC-binding factor (CTCF), a TF with eleven ZFs that plays a variety of roles in transcriptional regulation, most notably anchoring DNA loops. Previous studies found that CTCF ZFs 3–7 bind CTCF’s core motif and ZFs 9–11 bind a specific upstream motif, but the motifs of ZFs 1–2 have yet to be identified. RESULTS: We developed a new approach to identifying the binding motifs of individual DBDs of a TF through analyzing chromatin immunoprecipitation sequencing (ChIP-seq) experiments in which a single DBD is mutated: we train a deep convolutional neural network to predict whether wild-type TF binding sites are preserved in the mutant TF dataset and interpret the model. We applied this approach to mouse CTCF ChIP-seq data and identified the known binding preferences of CTCF ZFs 3–11 as well as a putative GAG binding motif for ZF 1. We analyzed other CTCF datasets to provide additional evidence that ZF 1 is associated with binding at the motif we identified, and we found that the presence of the motif for ZF 1 is associated with CTCF ChIP-seq peak strength. CONCLUSIONS: Our approach can be applied to any TF for which in vivo binding data from both the wild-type and mutated versions of the TF are available, and our findings provide new potential insights binding preferences of CTCF’s DBDs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08486-9. |
format | Online Article Text |
id | pubmed-9004084 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-90040842022-04-13 Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 Kaplow, Irene M. Banerjee, Abhimanyu Foo, Chuan Sheng BMC Genomics Research BACKGROUND: Many transcription factors (TFs), such as multi zinc-finger (ZF) TFs, have multiple DNA binding domains (DBDs), and deciphering the DNA binding motifs of individual DBDs is a major challenge. One example of such a TF is CCCTC-binding factor (CTCF), a TF with eleven ZFs that plays a variety of roles in transcriptional regulation, most notably anchoring DNA loops. Previous studies found that CTCF ZFs 3–7 bind CTCF’s core motif and ZFs 9–11 bind a specific upstream motif, but the motifs of ZFs 1–2 have yet to be identified. RESULTS: We developed a new approach to identifying the binding motifs of individual DBDs of a TF through analyzing chromatin immunoprecipitation sequencing (ChIP-seq) experiments in which a single DBD is mutated: we train a deep convolutional neural network to predict whether wild-type TF binding sites are preserved in the mutant TF dataset and interpret the model. We applied this approach to mouse CTCF ChIP-seq data and identified the known binding preferences of CTCF ZFs 3–11 as well as a putative GAG binding motif for ZF 1. We analyzed other CTCF datasets to provide additional evidence that ZF 1 is associated with binding at the motif we identified, and we found that the presence of the motif for ZF 1 is associated with CTCF ChIP-seq peak strength. CONCLUSIONS: Our approach can be applied to any TF for which in vivo binding data from both the wild-type and mutated versions of the TF are available, and our findings provide new potential insights binding preferences of CTCF’s DBDs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08486-9. BioMed Central 2022-04-12 /pmc/articles/PMC9004084/ /pubmed/35410161 http://dx.doi.org/10.1186/s12864-022-08486-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Kaplow, Irene M. Banerjee, Abhimanyu Foo, Chuan Sheng Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 |
title | Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 |
title_full | Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 |
title_fullStr | Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 |
title_full_unstemmed | Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 |
title_short | Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 |
title_sort | neural network modeling of differential binding between wild-type and mutant ctcf reveals putative binding preferences for zinc fingers 1–2 |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9004084/ https://www.ncbi.nlm.nih.gov/pubmed/35410161 http://dx.doi.org/10.1186/s12864-022-08486-9 |
work_keys_str_mv | AT kaplowirenem neuralnetworkmodelingofdifferentialbindingbetweenwildtypeandmutantctcfrevealsputativebindingpreferencesforzincfingers12 AT banerjeeabhimanyu neuralnetworkmodelingofdifferentialbindingbetweenwildtypeandmutantctcfrevealsputativebindingpreferencesforzincfingers12 AT foochuansheng neuralnetworkmodelingofdifferentialbindingbetweenwildtypeandmutantctcfrevealsputativebindingpreferencesforzincfingers12 |