Cargando…

Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules

Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-Seq) has opened new avenues of research in the genome-wide characterization of regulatory DNA-protein interactions at the genetic and epigenetic level. As a consequence, it has become the de facto standard for studies on the...

Descripción completa

Detalles Bibliográficos
Autores principales: Ronzio, Mirko, Zambelli, Federico, Dolfini, Diletta, Mantovani, Roberto, Pavesi, Giulio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7046753/
https://www.ncbi.nlm.nih.gov/pubmed/32153638
http://dx.doi.org/10.3389/fgene.2020.00072
_version_ 1783502015443238912
author Ronzio, Mirko
Zambelli, Federico
Dolfini, Diletta
Mantovani, Roberto
Pavesi, Giulio
author_facet Ronzio, Mirko
Zambelli, Federico
Dolfini, Diletta
Mantovani, Roberto
Pavesi, Giulio
author_sort Ronzio, Mirko
collection PubMed
description Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-Seq) has opened new avenues of research in the genome-wide characterization of regulatory DNA-protein interactions at the genetic and epigenetic level. As a consequence, it has become the de facto standard for studies on the regulation of transcription, and literally thousands of data sets for transcription factors and cofactors in different conditions and species are now available to the scientific community. However, while pipelines and best practices have been established for the analysis of a single experiment, there is still no consensus on the best way to perform an integrated analysis of multiple datasets in the same condition, in order to identify the most relevant and widespread regulatory modules composed by different transcription factors and cofactors. We present here a computational pipeline for this task, that integrates peak summit colocalization, a novel statistical framework for the evaluation of its significance, and motif enrichment analysis. We show examples of its application to ENCODE data, that led to the identification of relevant regulatory modules composed of different factors, as well as the organization on DNA of the binding motifs responsible for their recruitment.
format Online
Article
Text
id pubmed-7046753
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70467532020-03-09 Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules Ronzio, Mirko Zambelli, Federico Dolfini, Diletta Mantovani, Roberto Pavesi, Giulio Front Genet Genetics Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-Seq) has opened new avenues of research in the genome-wide characterization of regulatory DNA-protein interactions at the genetic and epigenetic level. As a consequence, it has become the de facto standard for studies on the regulation of transcription, and literally thousands of data sets for transcription factors and cofactors in different conditions and species are now available to the scientific community. However, while pipelines and best practices have been established for the analysis of a single experiment, there is still no consensus on the best way to perform an integrated analysis of multiple datasets in the same condition, in order to identify the most relevant and widespread regulatory modules composed by different transcription factors and cofactors. We present here a computational pipeline for this task, that integrates peak summit colocalization, a novel statistical framework for the evaluation of its significance, and motif enrichment analysis. We show examples of its application to ENCODE data, that led to the identification of relevant regulatory modules composed of different factors, as well as the organization on DNA of the binding motifs responsible for their recruitment. Frontiers Media S.A. 2020-02-21 /pmc/articles/PMC7046753/ /pubmed/32153638 http://dx.doi.org/10.3389/fgene.2020.00072 Text en Copyright © 2020 Ronzio, Zambelli, Dolfini, Mantovani and Pavesi http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ronzio, Mirko
Zambelli, Federico
Dolfini, Diletta
Mantovani, Roberto
Pavesi, Giulio
Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules
title Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules
title_full Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules
title_fullStr Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules
title_full_unstemmed Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules
title_short Integrating Peak Colocalization and Motif Enrichment Analysis for the Discovery of Genome-Wide Regulatory Modules and Transcription Factor Recruitment Rules
title_sort integrating peak colocalization and motif enrichment analysis for the discovery of genome-wide regulatory modules and transcription factor recruitment rules
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7046753/
https://www.ncbi.nlm.nih.gov/pubmed/32153638
http://dx.doi.org/10.3389/fgene.2020.00072
work_keys_str_mv AT ronziomirko integratingpeakcolocalizationandmotifenrichmentanalysisforthediscoveryofgenomewideregulatorymodulesandtranscriptionfactorrecruitmentrules
AT zambellifederico integratingpeakcolocalizationandmotifenrichmentanalysisforthediscoveryofgenomewideregulatorymodulesandtranscriptionfactorrecruitmentrules
AT dolfinidiletta integratingpeakcolocalizationandmotifenrichmentanalysisforthediscoveryofgenomewideregulatorymodulesandtranscriptionfactorrecruitmentrules
AT mantovaniroberto integratingpeakcolocalizationandmotifenrichmentanalysisforthediscoveryofgenomewideregulatorymodulesandtranscriptionfactorrecruitmentrules
AT pavesigiulio integratingpeakcolocalizationandmotifenrichmentanalysisforthediscoveryofgenomewideregulatorymodulesandtranscriptionfactorrecruitmentrules