Cargando…

Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA

BACKGROUND: ChIP-seq provides a wealth of information on the approximate location of DNA-binding proteins genome-wide. It is known that the targeted motifs in most cases can be found at the peak centers. A high resolution mapping of ChIP-seq peaks could in principle allow the fine mapping of the pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Nagy, Gergely, Czipa, Erik, Steiner, László, Nagy, Tibor, Pongor, Sándor, Nagy, László, Barta, Endre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986361/
https://www.ncbi.nlm.nih.gov/pubmed/27526722
http://dx.doi.org/10.1186/s12864-016-2940-7
_version_ 1782448191929581568
author Nagy, Gergely
Czipa, Erik
Steiner, László
Nagy, Tibor
Pongor, Sándor
Nagy, László
Barta, Endre
author_facet Nagy, Gergely
Czipa, Erik
Steiner, László
Nagy, Tibor
Pongor, Sándor
Nagy, László
Barta, Endre
author_sort Nagy, Gergely
collection PubMed
description BACKGROUND: ChIP-seq provides a wealth of information on the approximate location of DNA-binding proteins genome-wide. It is known that the targeted motifs in most cases can be found at the peak centers. A high resolution mapping of ChIP-seq peaks could in principle allow the fine mapping of the protein constituents within protein complexes, but the current ChIP-seq analysis pipelines do not target the basepair resolution strand specific mapping of peak summits. RESULTS: The approach proposed here is based on i) locating regions that are bound by a sufficient number of proteins constituting a complex; ii) determining the position of the underlying motif using either a direct or a de novo motif search approach; and iii) determining the exact location of the peak summits with respect to the binding motif in a strand specific manner. We applied this method for analyzing the CTCF/cohesin complex, which holds together DNA loops. The relative positions of the constituents of the complex were determined with one-basepair estimated accuracy. Mapping the positions on a 3D model of DNA made it possible to deduce the approximate local topology of the complex that allowed us to predict how the CTCF/cohesin complex locks the DNA loops. As the positioning of the proteins was not compatible with previous models of loop closure, we proposed a plausible “double embrace” model in which the DNA loop is held together by two adjacent cohesin rings in such a way that the ring anchored by CTCF to one DNA duplex encircles the other DNA double helix and vice versa. CONCLUSIONS: A motif-centered, strand specific analysis of ChIP-seq data improves the accuracy of determining peak positions. If a genome contains a large number of binding sites for a given protein complex, such as transcription factor heterodimers or transcription factor/cofactor complexes, the relative position of the constituent proteins on the DNA can be established with an accuracy that allow one to deduce the local topology of the protein complex. The proposed high resolution mapping approach of ChIP-seq data is applicable for detecting the contact topology of DNA-binding protein complexes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2940-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4986361
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49863612016-08-17 Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA Nagy, Gergely Czipa, Erik Steiner, László Nagy, Tibor Pongor, Sándor Nagy, László Barta, Endre BMC Genomics Research Article BACKGROUND: ChIP-seq provides a wealth of information on the approximate location of DNA-binding proteins genome-wide. It is known that the targeted motifs in most cases can be found at the peak centers. A high resolution mapping of ChIP-seq peaks could in principle allow the fine mapping of the protein constituents within protein complexes, but the current ChIP-seq analysis pipelines do not target the basepair resolution strand specific mapping of peak summits. RESULTS: The approach proposed here is based on i) locating regions that are bound by a sufficient number of proteins constituting a complex; ii) determining the position of the underlying motif using either a direct or a de novo motif search approach; and iii) determining the exact location of the peak summits with respect to the binding motif in a strand specific manner. We applied this method for analyzing the CTCF/cohesin complex, which holds together DNA loops. The relative positions of the constituents of the complex were determined with one-basepair estimated accuracy. Mapping the positions on a 3D model of DNA made it possible to deduce the approximate local topology of the complex that allowed us to predict how the CTCF/cohesin complex locks the DNA loops. As the positioning of the proteins was not compatible with previous models of loop closure, we proposed a plausible “double embrace” model in which the DNA loop is held together by two adjacent cohesin rings in such a way that the ring anchored by CTCF to one DNA duplex encircles the other DNA double helix and vice versa. CONCLUSIONS: A motif-centered, strand specific analysis of ChIP-seq data improves the accuracy of determining peak positions. If a genome contains a large number of binding sites for a given protein complex, such as transcription factor heterodimers or transcription factor/cofactor complexes, the relative position of the constituent proteins on the DNA can be established with an accuracy that allow one to deduce the local topology of the protein complex. The proposed high resolution mapping approach of ChIP-seq data is applicable for detecting the contact topology of DNA-binding protein complexes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2940-7) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-15 /pmc/articles/PMC4986361/ /pubmed/27526722 http://dx.doi.org/10.1186/s12864-016-2940-7 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Nagy, Gergely
Czipa, Erik
Steiner, László
Nagy, Tibor
Pongor, Sándor
Nagy, László
Barta, Endre
Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA
title Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA
title_full Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA
title_fullStr Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA
title_full_unstemmed Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA
title_short Motif oriented high-resolution analysis of ChIP-seq data reveals the topological order of CTCF and cohesin proteins on DNA
title_sort motif oriented high-resolution analysis of chip-seq data reveals the topological order of ctcf and cohesin proteins on dna
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986361/
https://www.ncbi.nlm.nih.gov/pubmed/27526722
http://dx.doi.org/10.1186/s12864-016-2940-7
work_keys_str_mv AT nagygergely motiforientedhighresolutionanalysisofchipseqdatarevealsthetopologicalorderofctcfandcohesinproteinsondna
AT czipaerik motiforientedhighresolutionanalysisofchipseqdatarevealsthetopologicalorderofctcfandcohesinproteinsondna
AT steinerlaszlo motiforientedhighresolutionanalysisofchipseqdatarevealsthetopologicalorderofctcfandcohesinproteinsondna
AT nagytibor motiforientedhighresolutionanalysisofchipseqdatarevealsthetopologicalorderofctcfandcohesinproteinsondna
AT pongorsandor motiforientedhighresolutionanalysisofchipseqdatarevealsthetopologicalorderofctcfandcohesinproteinsondna
AT nagylaszlo motiforientedhighresolutionanalysisofchipseqdatarevealsthetopologicalorderofctcfandcohesinproteinsondna
AT bartaendre motiforientedhighresolutionanalysisofchipseqdatarevealsthetopologicalorderofctcfandcohesinproteinsondna