Cargando…

Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape

The large collections of ChIP-seq data rapidly accumulating in public data warehouses provide genome-wide binding site maps for hundreds of transcription factors (TFs). However, the extent of the regulatory occupancy space in the human genome has not yet been fully apprehended by integrating public...

Descripción completa

Detalles Bibliográficos
Autores principales: Griffon, Aurélien, Barbier, Quentin, Dalino, Jordi, van Helden, Jacques, Spicuglia, Salvatore, Ballester, Benoit
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4344487/
https://www.ncbi.nlm.nih.gov/pubmed/25477382
http://dx.doi.org/10.1093/nar/gku1280
_version_ 1782359420280242176
author Griffon, Aurélien
Barbier, Quentin
Dalino, Jordi
van Helden, Jacques
Spicuglia, Salvatore
Ballester, Benoit
author_facet Griffon, Aurélien
Barbier, Quentin
Dalino, Jordi
van Helden, Jacques
Spicuglia, Salvatore
Ballester, Benoit
author_sort Griffon, Aurélien
collection PubMed
description The large collections of ChIP-seq data rapidly accumulating in public data warehouses provide genome-wide binding site maps for hundreds of transcription factors (TFs). However, the extent of the regulatory occupancy space in the human genome has not yet been fully apprehended by integrating public ChIP-seq data sets and combining it with ENCODE TFs map. To enable genome-wide identification of regulatory elements we have collected, analysed and retained 395 available ChIP-seq data sets merged with ENCODE peaks covering a total of 237 TFs. This enhanced repertoire complements and refines current genome-wide occupancy maps by increasing the human genome regulatory search space by 14% compared to ENCODE alone, and also increases the complexity of the regulatory dictionary. As a direct application we used this unified binding repertoire to annotate variant enhancer loci (VELs) from H3K4me1 mark in two cancer cell lines (MCF-7, CRC) and observed enrichments of specific TFs involved in biological key functions to cancer development and proliferation. Those enrichments of TFs within VELs provide a direct annotation of non-coding regions detected in cancer genomes. Finally, full access to this catalogue is available online together with the TFs enrichment analysis tool (http://tagc.univ-mrs.fr/remap/).
format Online
Article
Text
id pubmed-4344487
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43444872015-03-17 Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape Griffon, Aurélien Barbier, Quentin Dalino, Jordi van Helden, Jacques Spicuglia, Salvatore Ballester, Benoit Nucleic Acids Res Methods Online The large collections of ChIP-seq data rapidly accumulating in public data warehouses provide genome-wide binding site maps for hundreds of transcription factors (TFs). However, the extent of the regulatory occupancy space in the human genome has not yet been fully apprehended by integrating public ChIP-seq data sets and combining it with ENCODE TFs map. To enable genome-wide identification of regulatory elements we have collected, analysed and retained 395 available ChIP-seq data sets merged with ENCODE peaks covering a total of 237 TFs. This enhanced repertoire complements and refines current genome-wide occupancy maps by increasing the human genome regulatory search space by 14% compared to ENCODE alone, and also increases the complexity of the regulatory dictionary. As a direct application we used this unified binding repertoire to annotate variant enhancer loci (VELs) from H3K4me1 mark in two cancer cell lines (MCF-7, CRC) and observed enrichments of specific TFs involved in biological key functions to cancer development and proliferation. Those enrichments of TFs within VELs provide a direct annotation of non-coding regions detected in cancer genomes. Finally, full access to this catalogue is available online together with the TFs enrichment analysis tool (http://tagc.univ-mrs.fr/remap/). Oxford University Press 2015-02-27 2014-12-03 /pmc/articles/PMC4344487/ /pubmed/25477382 http://dx.doi.org/10.1093/nar/gku1280 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Griffon, Aurélien
Barbier, Quentin
Dalino, Jordi
van Helden, Jacques
Spicuglia, Salvatore
Ballester, Benoit
Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape
title Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape
title_full Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape
title_fullStr Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape
title_full_unstemmed Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape
title_short Integrative analysis of public ChIP-seq experiments reveals a complex multi-cell regulatory landscape
title_sort integrative analysis of public chip-seq experiments reveals a complex multi-cell regulatory landscape
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4344487/
https://www.ncbi.nlm.nih.gov/pubmed/25477382
http://dx.doi.org/10.1093/nar/gku1280
work_keys_str_mv AT griffonaurelien integrativeanalysisofpublicchipseqexperimentsrevealsacomplexmulticellregulatorylandscape
AT barbierquentin integrativeanalysisofpublicchipseqexperimentsrevealsacomplexmulticellregulatorylandscape
AT dalinojordi integrativeanalysisofpublicchipseqexperimentsrevealsacomplexmulticellregulatorylandscape
AT vanheldenjacques integrativeanalysisofpublicchipseqexperimentsrevealsacomplexmulticellregulatorylandscape
AT spicugliasalvatore integrativeanalysisofpublicchipseqexperimentsrevealsacomplexmulticellregulatorylandscape
AT ballesterbenoit integrativeanalysisofpublicchipseqexperimentsrevealsacomplexmulticellregulatorylandscape