Cargando…
Motif signatures in stretch enhancers are enriched for disease-associated genetic variants
BACKGROUND: Stretch enhancers (SEs) are large chromatin-defined regulatory elements that are at least 3,000 base pairs (bps) long, in contrast to the median enhancer length of 800 bps. SEs tend to be cell-type specific, regulate cell-type specific gene expression, and are enriched in disease-associa...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4502539/ https://www.ncbi.nlm.nih.gov/pubmed/26180553 http://dx.doi.org/10.1186/s13072-015-0015-7 |
_version_ | 1782381226327277568 |
---|---|
author | Quang, Daniel X Erdos, Michael R Parker, Stephen C J Collins, Francis S |
author_facet | Quang, Daniel X Erdos, Michael R Parker, Stephen C J Collins, Francis S |
author_sort | Quang, Daniel X |
collection | PubMed |
description | BACKGROUND: Stretch enhancers (SEs) are large chromatin-defined regulatory elements that are at least 3,000 base pairs (bps) long, in contrast to the median enhancer length of 800 bps. SEs tend to be cell-type specific, regulate cell-type specific gene expression, and are enriched in disease-associated genetic variants in disease-relevant cell types. Transcription factors (TFs) can bind to enhancers to modulate enhancer activity, and their sequence specificity can be represented by motifs. We hypothesize motifs can provide a biological context for how genetic variants contribute to disease. RESULTS: We integrated chromatin state, gene expression, and chromatin accessibility [measured as DNase I Hypersensitive Sites (DHSs)] maps across nine different cell types. Motif enrichment analyses of chromatin-defined enhancer sequences identify several known cell-type specific “master” factors. Furthermore, de novo motif discovery not only recovers many of these motifs, but also identifies novel non-canonical motifs, providing additional insight into TF binding preferences. Across the length of SEs, motifs are most enriched in DHSs, though relative enrichment is also observed outside of DHSs. Interestingly, we show that single nucleotide polymorphisms associated with diseases or quantitative traits significantly overlap motif occurrences located in SEs, but outside of DHSs. CONCLUSIONS: These results reinforce the role of SEs in influencing risk for diseases and suggest an expanded regulatory functional role for motifs that occur outside highly accessible chromatin. Furthermore, the motif signatures generated here expand our understanding of the binding preference of well-characterized TFs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0015-7) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4502539 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45025392015-07-16 Motif signatures in stretch enhancers are enriched for disease-associated genetic variants Quang, Daniel X Erdos, Michael R Parker, Stephen C J Collins, Francis S Epigenetics Chromatin Research BACKGROUND: Stretch enhancers (SEs) are large chromatin-defined regulatory elements that are at least 3,000 base pairs (bps) long, in contrast to the median enhancer length of 800 bps. SEs tend to be cell-type specific, regulate cell-type specific gene expression, and are enriched in disease-associated genetic variants in disease-relevant cell types. Transcription factors (TFs) can bind to enhancers to modulate enhancer activity, and their sequence specificity can be represented by motifs. We hypothesize motifs can provide a biological context for how genetic variants contribute to disease. RESULTS: We integrated chromatin state, gene expression, and chromatin accessibility [measured as DNase I Hypersensitive Sites (DHSs)] maps across nine different cell types. Motif enrichment analyses of chromatin-defined enhancer sequences identify several known cell-type specific “master” factors. Furthermore, de novo motif discovery not only recovers many of these motifs, but also identifies novel non-canonical motifs, providing additional insight into TF binding preferences. Across the length of SEs, motifs are most enriched in DHSs, though relative enrichment is also observed outside of DHSs. Interestingly, we show that single nucleotide polymorphisms associated with diseases or quantitative traits significantly overlap motif occurrences located in SEs, but outside of DHSs. CONCLUSIONS: These results reinforce the role of SEs in influencing risk for diseases and suggest an expanded regulatory functional role for motifs that occur outside highly accessible chromatin. Furthermore, the motif signatures generated here expand our understanding of the binding preference of well-characterized TFs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0015-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-16 /pmc/articles/PMC4502539/ /pubmed/26180553 http://dx.doi.org/10.1186/s13072-015-0015-7 Text en © Quang et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Quang, Daniel X Erdos, Michael R Parker, Stephen C J Collins, Francis S Motif signatures in stretch enhancers are enriched for disease-associated genetic variants |
title | Motif signatures in stretch enhancers are enriched for disease-associated genetic variants |
title_full | Motif signatures in stretch enhancers are enriched for disease-associated genetic variants |
title_fullStr | Motif signatures in stretch enhancers are enriched for disease-associated genetic variants |
title_full_unstemmed | Motif signatures in stretch enhancers are enriched for disease-associated genetic variants |
title_short | Motif signatures in stretch enhancers are enriched for disease-associated genetic variants |
title_sort | motif signatures in stretch enhancers are enriched for disease-associated genetic variants |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4502539/ https://www.ncbi.nlm.nih.gov/pubmed/26180553 http://dx.doi.org/10.1186/s13072-015-0015-7 |
work_keys_str_mv | AT quangdanielx motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants AT erdosmichaelr motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants AT parkerstephencj motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants AT collinsfranciss motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants |