Cargando…

Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation

Recent advances in long-range Hi-C contact mapping have revealed the importance of the 3D structure of chromosomes in gene expression. A current challenge is to identify the key molecular drivers of this 3D structure. Several genomic features, such as architectural proteins and functional elements,...

Descripción completa

Detalles Bibliográficos
Autores principales: Mourad, Raphaël, Cuvier, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4874696/
https://www.ncbi.nlm.nih.gov/pubmed/27203237
http://dx.doi.org/10.1371/journal.pcbi.1004908
_version_ 1782433078377971712
author Mourad, Raphaël
Cuvier, Olivier
author_facet Mourad, Raphaël
Cuvier, Olivier
author_sort Mourad, Raphaël
collection PubMed
description Recent advances in long-range Hi-C contact mapping have revealed the importance of the 3D structure of chromosomes in gene expression. A current challenge is to identify the key molecular drivers of this 3D structure. Several genomic features, such as architectural proteins and functional elements, were shown to be enriched at topological domain borders using classical enrichment tests. Here we propose multiple logistic regression to identify those genomic features that positively or negatively influence domain border establishment or maintenance. The model is flexible, and can account for statistical interactions among multiple genomic features. Using both simulated and real data, we show that our model outperforms enrichment test and non-parametric models, such as random forests, for the identification of genomic features that influence domain borders. Using Drosophila Hi-C data at a very high resolution of 1 kb, our model suggests that, among architectural proteins, BEAF-32 and CP190 are the main positive drivers of 3D domain borders. In humans, our model identifies well-known architectural proteins CTCF and cohesin, as well as ZNF143 and Polycomb group proteins as positive drivers of domain borders. The model also reveals the existence of several negative drivers that counteract the presence of domain borders including P300, RXRA, BCL11A and ELK1.
format Online
Article
Text
id pubmed-4874696
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48746962016-06-09 Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation Mourad, Raphaël Cuvier, Olivier PLoS Comput Biol Research Article Recent advances in long-range Hi-C contact mapping have revealed the importance of the 3D structure of chromosomes in gene expression. A current challenge is to identify the key molecular drivers of this 3D structure. Several genomic features, such as architectural proteins and functional elements, were shown to be enriched at topological domain borders using classical enrichment tests. Here we propose multiple logistic regression to identify those genomic features that positively or negatively influence domain border establishment or maintenance. The model is flexible, and can account for statistical interactions among multiple genomic features. Using both simulated and real data, we show that our model outperforms enrichment test and non-parametric models, such as random forests, for the identification of genomic features that influence domain borders. Using Drosophila Hi-C data at a very high resolution of 1 kb, our model suggests that, among architectural proteins, BEAF-32 and CP190 are the main positive drivers of 3D domain borders. In humans, our model identifies well-known architectural proteins CTCF and cohesin, as well as ZNF143 and Polycomb group proteins as positive drivers of domain borders. The model also reveals the existence of several negative drivers that counteract the presence of domain borders including P300, RXRA, BCL11A and ELK1. Public Library of Science 2016-05-20 /pmc/articles/PMC4874696/ /pubmed/27203237 http://dx.doi.org/10.1371/journal.pcbi.1004908 Text en © 2016 Mourad, Cuvier http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Mourad, Raphaël
Cuvier, Olivier
Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation
title Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation
title_full Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation
title_fullStr Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation
title_full_unstemmed Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation
title_short Computational Identification of Genomic Features That Influence 3D Chromatin Domain Formation
title_sort computational identification of genomic features that influence 3d chromatin domain formation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4874696/
https://www.ncbi.nlm.nih.gov/pubmed/27203237
http://dx.doi.org/10.1371/journal.pcbi.1004908
work_keys_str_mv AT mouradraphael computationalidentificationofgenomicfeaturesthatinfluence3dchromatindomainformation
AT cuvierolivier computationalidentificationofgenomicfeaturesthatinfluence3dchromatindomainformation