Cargando…

Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks

Cellular identity relies on cell-type–specific gene expression controlled at the transcriptional level by cis-regulatory elements (CREs). CREs are unevenly distributed across the genome, giving rise to individual CREs and clusters of CREs (COREs). Technical and biological features hinder CORE identi...

Descripción completa

Detalles Bibliográficos
Autores principales: Madani Tonekaboni, Seyed Ali, Mazrooei, Parisa, Kofia, Victor, Haibe-Kains, Benjamin, Lupien, Mathieu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771399/
https://www.ncbi.nlm.nih.gov/pubmed/31533978
http://dx.doi.org/10.1101/gr.248658.119
_version_ 1783455675136868352
author Madani Tonekaboni, Seyed Ali
Mazrooei, Parisa
Kofia, Victor
Haibe-Kains, Benjamin
Lupien, Mathieu
author_facet Madani Tonekaboni, Seyed Ali
Mazrooei, Parisa
Kofia, Victor
Haibe-Kains, Benjamin
Lupien, Mathieu
author_sort Madani Tonekaboni, Seyed Ali
collection PubMed
description Cellular identity relies on cell-type–specific gene expression controlled at the transcriptional level by cis-regulatory elements (CREs). CREs are unevenly distributed across the genome, giving rise to individual CREs and clusters of CREs (COREs). Technical and biological features hinder CORE identification. We addressed these issues by developing an unsupervised machine learning approach termed clustering of genomic regions analysis method (CREAM). CREAM automates CORE detection from chromatin accessibility profiles that are enriched in CREs strongly bound by master transcription regulators, proximal to highly expressed and essential genes, and discriminating cell identity. Although COREs share similarities with super-enhancers, we highlight differences in terms of the genomic distribution and structure of these cis-regulatory units. We further show the enhanced value of COREs over super-enhancers to identify master transcription regulators, highly expressed and essential genes defining cell identity. COREs enrich at topologically associated domain (TAD) boundaries. They are also preferentially bound by the chromatin looping factors CTCF and cohesin, in contrast to super-enhancers, forming clusters of CTCF and cohesin binding regions and defining homotypic clusters of transcription regulator binding regions (HCTs). Finally, we show the clinical utility of CREAM to identify COREs across chromatin accessibility profiles to stratify more than 400 tumor samples according to their cancer type and to delineate cancer type–specific active biological pathways. Collectively, our results support the utility of CREAM to delineate COREs underlying, with greater accuracy than individual CREs or super-enhancers, the cell-type–specific biological underpinning across a wide range of normal and cancer cell types.
format Online
Article
Text
id pubmed-6771399
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-67713992019-10-21 Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks Madani Tonekaboni, Seyed Ali Mazrooei, Parisa Kofia, Victor Haibe-Kains, Benjamin Lupien, Mathieu Genome Res Method Cellular identity relies on cell-type–specific gene expression controlled at the transcriptional level by cis-regulatory elements (CREs). CREs are unevenly distributed across the genome, giving rise to individual CREs and clusters of CREs (COREs). Technical and biological features hinder CORE identification. We addressed these issues by developing an unsupervised machine learning approach termed clustering of genomic regions analysis method (CREAM). CREAM automates CORE detection from chromatin accessibility profiles that are enriched in CREs strongly bound by master transcription regulators, proximal to highly expressed and essential genes, and discriminating cell identity. Although COREs share similarities with super-enhancers, we highlight differences in terms of the genomic distribution and structure of these cis-regulatory units. We further show the enhanced value of COREs over super-enhancers to identify master transcription regulators, highly expressed and essential genes defining cell identity. COREs enrich at topologically associated domain (TAD) boundaries. They are also preferentially bound by the chromatin looping factors CTCF and cohesin, in contrast to super-enhancers, forming clusters of CTCF and cohesin binding regions and defining homotypic clusters of transcription regulator binding regions (HCTs). Finally, we show the clinical utility of CREAM to identify COREs across chromatin accessibility profiles to stratify more than 400 tumor samples according to their cancer type and to delineate cancer type–specific active biological pathways. Collectively, our results support the utility of CREAM to delineate COREs underlying, with greater accuracy than individual CREs or super-enhancers, the cell-type–specific biological underpinning across a wide range of normal and cancer cell types. Cold Spring Harbor Laboratory Press 2019-10 /pmc/articles/PMC6771399/ /pubmed/31533978 http://dx.doi.org/10.1101/gr.248658.119 Text en © 2019 Madani Tonekaboni et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Madani Tonekaboni, Seyed Ali
Mazrooei, Parisa
Kofia, Victor
Haibe-Kains, Benjamin
Lupien, Mathieu
Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks
title Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks
title_full Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks
title_fullStr Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks
title_full_unstemmed Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks
title_short Identifying clusters of cis-regulatory elements underpinning TAD structures and lineage-specific regulatory networks
title_sort identifying clusters of cis-regulatory elements underpinning tad structures and lineage-specific regulatory networks
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771399/
https://www.ncbi.nlm.nih.gov/pubmed/31533978
http://dx.doi.org/10.1101/gr.248658.119
work_keys_str_mv AT madanitonekaboniseyedali identifyingclustersofcisregulatoryelementsunderpinningtadstructuresandlineagespecificregulatorynetworks
AT mazrooeiparisa identifyingclustersofcisregulatoryelementsunderpinningtadstructuresandlineagespecificregulatorynetworks
AT kofiavictor identifyingclustersofcisregulatoryelementsunderpinningtadstructuresandlineagespecificregulatorynetworks
AT haibekainsbenjamin identifyingclustersofcisregulatoryelementsunderpinningtadstructuresandlineagespecificregulatorynetworks
AT lupienmathieu identifyingclustersofcisregulatoryelementsunderpinningtadstructuresandlineagespecificregulatorynetworks