Cargando…

Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2

BACKGROUND: Many transcription factors (TFs), such as multi zinc-finger (ZF) TFs, have multiple DNA binding domains (DBDs), and deciphering the DNA binding motifs of individual DBDs is a major challenge. One example of such a TF is CCCTC-binding factor (CTCF), a TF with eleven ZFs that plays a varie...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaplow, Irene M., Banerjee, Abhimanyu, Foo, Chuan Sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9004084/
https://www.ncbi.nlm.nih.gov/pubmed/35410161
http://dx.doi.org/10.1186/s12864-022-08486-9
_version_ 1784686215078871040
author Kaplow, Irene M.
Banerjee, Abhimanyu
Foo, Chuan Sheng
author_facet Kaplow, Irene M.
Banerjee, Abhimanyu
Foo, Chuan Sheng
author_sort Kaplow, Irene M.
collection PubMed
description BACKGROUND: Many transcription factors (TFs), such as multi zinc-finger (ZF) TFs, have multiple DNA binding domains (DBDs), and deciphering the DNA binding motifs of individual DBDs is a major challenge. One example of such a TF is CCCTC-binding factor (CTCF), a TF with eleven ZFs that plays a variety of roles in transcriptional regulation, most notably anchoring DNA loops. Previous studies found that CTCF ZFs 3–7 bind CTCF’s core motif and ZFs 9–11 bind a specific upstream motif, but the motifs of ZFs 1–2 have yet to be identified. RESULTS: We developed a new approach to identifying the binding motifs of individual DBDs of a TF through analyzing chromatin immunoprecipitation sequencing (ChIP-seq) experiments in which a single DBD is mutated: we train a deep convolutional neural network to predict whether wild-type TF binding sites are preserved in the mutant TF dataset and interpret the model. We applied this approach to mouse CTCF ChIP-seq data and identified the known binding preferences of CTCF ZFs 3–11 as well as a putative GAG binding motif for ZF 1. We analyzed other CTCF datasets to provide additional evidence that ZF 1 is associated with binding at the motif we identified, and we found that the presence of the motif for ZF 1 is associated with CTCF ChIP-seq peak strength. CONCLUSIONS: Our approach can be applied to any TF for which in vivo binding data from both the wild-type and mutated versions of the TF are available, and our findings provide new potential insights binding preferences of CTCF’s DBDs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08486-9.
format Online
Article
Text
id pubmed-9004084
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90040842022-04-13 Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2 Kaplow, Irene M. Banerjee, Abhimanyu Foo, Chuan Sheng BMC Genomics Research BACKGROUND: Many transcription factors (TFs), such as multi zinc-finger (ZF) TFs, have multiple DNA binding domains (DBDs), and deciphering the DNA binding motifs of individual DBDs is a major challenge. One example of such a TF is CCCTC-binding factor (CTCF), a TF with eleven ZFs that plays a variety of roles in transcriptional regulation, most notably anchoring DNA loops. Previous studies found that CTCF ZFs 3–7 bind CTCF’s core motif and ZFs 9–11 bind a specific upstream motif, but the motifs of ZFs 1–2 have yet to be identified. RESULTS: We developed a new approach to identifying the binding motifs of individual DBDs of a TF through analyzing chromatin immunoprecipitation sequencing (ChIP-seq) experiments in which a single DBD is mutated: we train a deep convolutional neural network to predict whether wild-type TF binding sites are preserved in the mutant TF dataset and interpret the model. We applied this approach to mouse CTCF ChIP-seq data and identified the known binding preferences of CTCF ZFs 3–11 as well as a putative GAG binding motif for ZF 1. We analyzed other CTCF datasets to provide additional evidence that ZF 1 is associated with binding at the motif we identified, and we found that the presence of the motif for ZF 1 is associated with CTCF ChIP-seq peak strength. CONCLUSIONS: Our approach can be applied to any TF for which in vivo binding data from both the wild-type and mutated versions of the TF are available, and our findings provide new potential insights binding preferences of CTCF’s DBDs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-022-08486-9. BioMed Central 2022-04-12 /pmc/articles/PMC9004084/ /pubmed/35410161 http://dx.doi.org/10.1186/s12864-022-08486-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Kaplow, Irene M.
Banerjee, Abhimanyu
Foo, Chuan Sheng
Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2
title Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2
title_full Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2
title_fullStr Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2
title_full_unstemmed Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2
title_short Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1–2
title_sort neural network modeling of differential binding between wild-type and mutant ctcf reveals putative binding preferences for zinc fingers 1–2
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9004084/
https://www.ncbi.nlm.nih.gov/pubmed/35410161
http://dx.doi.org/10.1186/s12864-022-08486-9
work_keys_str_mv AT kaplowirenem neuralnetworkmodelingofdifferentialbindingbetweenwildtypeandmutantctcfrevealsputativebindingpreferencesforzincfingers12
AT banerjeeabhimanyu neuralnetworkmodelingofdifferentialbindingbetweenwildtypeandmutantctcfrevealsputativebindingpreferencesforzincfingers12
AT foochuansheng neuralnetworkmodelingofdifferentialbindingbetweenwildtypeandmutantctcfrevealsputativebindingpreferencesforzincfingers12