Cargando…
Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features
BACKGROUND: Epigenetic modifications, transcription factor (TF) availability and differences in chromatin folding influence how the genome is interpreted by the transcriptional machinery responsible for gene expression. Enhancers buried in non-coding regions are found to be associated with significa...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3406964/ https://www.ncbi.nlm.nih.gov/pubmed/22537144 http://dx.doi.org/10.1186/1471-2164-13-152 |
_version_ | 1782239259554480128 |
---|---|
author | Chen, Chih-yu Morris, Quaid Mitchell, Jennifer A |
author_facet | Chen, Chih-yu Morris, Quaid Mitchell, Jennifer A |
author_sort | Chen, Chih-yu |
collection | PubMed |
description | BACKGROUND: Epigenetic modifications, transcription factor (TF) availability and differences in chromatin folding influence how the genome is interpreted by the transcriptional machinery responsible for gene expression. Enhancers buried in non-coding regions are found to be associated with significant differences in histone marks between different cell types. In contrast, gene promoters show more uniform modifications across cell types. Here we used histone modification and chromatin-associated protein ChIP-Seq data sets in mouse embryonic stem (ES) cells as well as genomic features to identify functional enhancer regions. Using co-bound sites of OCT4, SOX2 and NANOG (co-OSN, validated enhancers) and co-bound sites of MYC and MYCN (limited enhancer activity) as enhancer positive and negative training sets, we performed multinomial logistic regression with LASSO regularization to identify key features. RESULTS: Cross validations reveal that a combination of p300, H3K4me1, MED12 and NIPBL features to be top signatures of co-OSN regions. Using a model from 10 signatures, 83% of top 1277 putative 1 kb enhancer regions (probability greater than or equal to 0.8) overlapped with at least one TF peak from 7 mouse ES cell ChIP-Seq data sets. These putative enhancers are associated with increased gene expression of neighbouring genes and significantly enriched in multiple TF bound loci in agreement with combinatorial models of TF binding. Furthermore, we identified several motifs of known TFs significantly enriched in putative enhancer regions compared to random promoter regions and background. Comparison with an active H3K27ac mark in various cell types confirmed cell type-specificity of these enhancers. CONCLUSIONS: The top enhancer signatures we identified (p300, H3K4me1, MED12 and NIPBL) will allow for the identification of cell type-specific enhancer regions in diverse cell types. |
format | Online Article Text |
id | pubmed-3406964 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34069642012-07-28 Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features Chen, Chih-yu Morris, Quaid Mitchell, Jennifer A BMC Genomics Research Article BACKGROUND: Epigenetic modifications, transcription factor (TF) availability and differences in chromatin folding influence how the genome is interpreted by the transcriptional machinery responsible for gene expression. Enhancers buried in non-coding regions are found to be associated with significant differences in histone marks between different cell types. In contrast, gene promoters show more uniform modifications across cell types. Here we used histone modification and chromatin-associated protein ChIP-Seq data sets in mouse embryonic stem (ES) cells as well as genomic features to identify functional enhancer regions. Using co-bound sites of OCT4, SOX2 and NANOG (co-OSN, validated enhancers) and co-bound sites of MYC and MYCN (limited enhancer activity) as enhancer positive and negative training sets, we performed multinomial logistic regression with LASSO regularization to identify key features. RESULTS: Cross validations reveal that a combination of p300, H3K4me1, MED12 and NIPBL features to be top signatures of co-OSN regions. Using a model from 10 signatures, 83% of top 1277 putative 1 kb enhancer regions (probability greater than or equal to 0.8) overlapped with at least one TF peak from 7 mouse ES cell ChIP-Seq data sets. These putative enhancers are associated with increased gene expression of neighbouring genes and significantly enriched in multiple TF bound loci in agreement with combinatorial models of TF binding. Furthermore, we identified several motifs of known TFs significantly enriched in putative enhancer regions compared to random promoter regions and background. Comparison with an active H3K27ac mark in various cell types confirmed cell type-specificity of these enhancers. CONCLUSIONS: The top enhancer signatures we identified (p300, H3K4me1, MED12 and NIPBL) will allow for the identification of cell type-specific enhancer regions in diverse cell types. BioMed Central 2012-04-26 /pmc/articles/PMC3406964/ /pubmed/22537144 http://dx.doi.org/10.1186/1471-2164-13-152 Text en Copyright ©2012 Chen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chen, Chih-yu Morris, Quaid Mitchell, Jennifer A Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features |
title | Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features |
title_full | Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features |
title_fullStr | Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features |
title_full_unstemmed | Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features |
title_short | Enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features |
title_sort | enhancer identification in mouse embryonic stem cells using integrative modeling of chromatin and genomic features |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3406964/ https://www.ncbi.nlm.nih.gov/pubmed/22537144 http://dx.doi.org/10.1186/1471-2164-13-152 |
work_keys_str_mv | AT chenchihyu enhanceridentificationinmouseembryonicstemcellsusingintegrativemodelingofchromatinandgenomicfeatures AT morrisquaid enhanceridentificationinmouseembryonicstemcellsusingintegrativemodelingofchromatinandgenomicfeatures AT mitchelljennifera enhanceridentificationinmouseembryonicstemcellsusingintegrativemodelingofchromatinandgenomicfeatures |