Cargando…

Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments

MOTIVATION: High-throughput chromatin immunoprecipitation (ChIP) sequencing-based assays capture genomic regions associated with the profiled transcription factor (TF). ChIP-exo is a modified protocol, which uses lambda exonuclease to digest DNA close to the TF-DNA complex, in order to improve on th...

Descripción completa

Detalles Bibliográficos
Autores principales: Biswas, Anushua, Narlikar, Leelavati
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275329/
https://www.ncbi.nlm.nih.gov/pubmed/34252930
http://dx.doi.org/10.1093/bioinformatics/btab274
_version_ 1783721691001651200
author Biswas, Anushua
Narlikar, Leelavati
author_facet Biswas, Anushua
Narlikar, Leelavati
author_sort Biswas, Anushua
collection PubMed
description MOTIVATION: High-throughput chromatin immunoprecipitation (ChIP) sequencing-based assays capture genomic regions associated with the profiled transcription factor (TF). ChIP-exo is a modified protocol, which uses lambda exonuclease to digest DNA close to the TF-DNA complex, in order to improve on the positional resolution of the TF-DNA contact. Because the digestion occurs in the 5 [Formula: see text] –3 [Formula: see text] orientation, the protocol produces directional footprints close to the complex, on both sides of the double stranded DNA. Like all ChIP-based methods, ChIP-exo reports a mixture of different regions associated with the TF: those bound directly to the TF as well as via intermediaries. However, the distribution of footprints are likely to be indicative of the complex forming at the DNA. RESULTS: We present ExoDiversity, which uses a model-based framework to learn a joint distribution over footprints and motifs, thus resolving the mixture of ChIP-exo footprints into diverse binding modes. It uses no prior motif or TF information and automatically learns the number of different modes from the data. We show its application on a wide range of TFs and organisms/cell-types. Because its goal is to explain the complete set of reported regions, it is able to identify co-factor TF motifs that appear in a small fraction of the dataset. Further, ExoDiversity discovers small nucleotide variations within and outside canonical motifs, which co-occur with variations in footprints, suggesting that the TF-DNA structural configuration at those regions is likely to be different. Finally, we show that detected modes have specific DNA shape features and conservation signals, giving insights into the structure and function of the putative TF-DNA complexes. AVAILABILITY AND IMPLEMENTATION: The code for ExoDiversity is available on https://github.com/NarlikarLab/exoDIVERSITY. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8275329
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-82753292021-07-13 Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments Biswas, Anushua Narlikar, Leelavati Bioinformatics Regulatory and Functional Genomics MOTIVATION: High-throughput chromatin immunoprecipitation (ChIP) sequencing-based assays capture genomic regions associated with the profiled transcription factor (TF). ChIP-exo is a modified protocol, which uses lambda exonuclease to digest DNA close to the TF-DNA complex, in order to improve on the positional resolution of the TF-DNA contact. Because the digestion occurs in the 5 [Formula: see text] –3 [Formula: see text] orientation, the protocol produces directional footprints close to the complex, on both sides of the double stranded DNA. Like all ChIP-based methods, ChIP-exo reports a mixture of different regions associated with the TF: those bound directly to the TF as well as via intermediaries. However, the distribution of footprints are likely to be indicative of the complex forming at the DNA. RESULTS: We present ExoDiversity, which uses a model-based framework to learn a joint distribution over footprints and motifs, thus resolving the mixture of ChIP-exo footprints into diverse binding modes. It uses no prior motif or TF information and automatically learns the number of different modes from the data. We show its application on a wide range of TFs and organisms/cell-types. Because its goal is to explain the complete set of reported regions, it is able to identify co-factor TF motifs that appear in a small fraction of the dataset. Further, ExoDiversity discovers small nucleotide variations within and outside canonical motifs, which co-occur with variations in footprints, suggesting that the TF-DNA structural configuration at those regions is likely to be different. Finally, we show that detected modes have specific DNA shape features and conservation signals, giving insights into the structure and function of the putative TF-DNA complexes. AVAILABILITY AND IMPLEMENTATION: The code for ExoDiversity is available on https://github.com/NarlikarLab/exoDIVERSITY. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-12 /pmc/articles/PMC8275329/ /pubmed/34252930 http://dx.doi.org/10.1093/bioinformatics/btab274 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regulatory and Functional Genomics
Biswas, Anushua
Narlikar, Leelavati
Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments
title Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments
title_full Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments
title_fullStr Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments
title_full_unstemmed Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments
title_short Resolving diverse protein–DNA footprints from exonuclease-based ChIP experiments
title_sort resolving diverse protein–dna footprints from exonuclease-based chip experiments
topic Regulatory and Functional Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275329/
https://www.ncbi.nlm.nih.gov/pubmed/34252930
http://dx.doi.org/10.1093/bioinformatics/btab274
work_keys_str_mv AT biswasanushua resolvingdiverseproteindnafootprintsfromexonucleasebasedchipexperiments
AT narlikarleelavati resolvingdiverseproteindnafootprintsfromexonucleasebasedchipexperiments