Cargando…

Motif signatures in stretch enhancers are enriched for disease-associated genetic variants

BACKGROUND: Stretch enhancers (SEs) are large chromatin-defined regulatory elements that are at least 3,000 base pairs (bps) long, in contrast to the median enhancer length of 800 bps. SEs tend to be cell-type specific, regulate cell-type specific gene expression, and are enriched in disease-associa...

Descripción completa

Detalles Bibliográficos
Autores principales: Quang, Daniel X, Erdos, Michael R, Parker, Stephen C J, Collins, Francis S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4502539/
https://www.ncbi.nlm.nih.gov/pubmed/26180553
http://dx.doi.org/10.1186/s13072-015-0015-7
_version_ 1782381226327277568
author Quang, Daniel X
Erdos, Michael R
Parker, Stephen C J
Collins, Francis S
author_facet Quang, Daniel X
Erdos, Michael R
Parker, Stephen C J
Collins, Francis S
author_sort Quang, Daniel X
collection PubMed
description BACKGROUND: Stretch enhancers (SEs) are large chromatin-defined regulatory elements that are at least 3,000 base pairs (bps) long, in contrast to the median enhancer length of 800 bps. SEs tend to be cell-type specific, regulate cell-type specific gene expression, and are enriched in disease-associated genetic variants in disease-relevant cell types. Transcription factors (TFs) can bind to enhancers to modulate enhancer activity, and their sequence specificity can be represented by motifs. We hypothesize motifs can provide a biological context for how genetic variants contribute to disease. RESULTS: We integrated chromatin state, gene expression, and chromatin accessibility [measured as DNase I Hypersensitive Sites (DHSs)] maps across nine different cell types. Motif enrichment analyses of chromatin-defined enhancer sequences identify several known cell-type specific “master” factors. Furthermore, de novo motif discovery not only recovers many of these motifs, but also identifies novel non-canonical motifs, providing additional insight into TF binding preferences. Across the length of SEs, motifs are most enriched in DHSs, though relative enrichment is also observed outside of DHSs. Interestingly, we show that single nucleotide polymorphisms associated with diseases or quantitative traits significantly overlap motif occurrences located in SEs, but outside of DHSs. CONCLUSIONS: These results reinforce the role of SEs in influencing risk for diseases and suggest an expanded regulatory functional role for motifs that occur outside highly accessible chromatin. Furthermore, the motif signatures generated here expand our understanding of the binding preference of well-characterized TFs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0015-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4502539
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45025392015-07-16 Motif signatures in stretch enhancers are enriched for disease-associated genetic variants Quang, Daniel X Erdos, Michael R Parker, Stephen C J Collins, Francis S Epigenetics Chromatin Research BACKGROUND: Stretch enhancers (SEs) are large chromatin-defined regulatory elements that are at least 3,000 base pairs (bps) long, in contrast to the median enhancer length of 800 bps. SEs tend to be cell-type specific, regulate cell-type specific gene expression, and are enriched in disease-associated genetic variants in disease-relevant cell types. Transcription factors (TFs) can bind to enhancers to modulate enhancer activity, and their sequence specificity can be represented by motifs. We hypothesize motifs can provide a biological context for how genetic variants contribute to disease. RESULTS: We integrated chromatin state, gene expression, and chromatin accessibility [measured as DNase I Hypersensitive Sites (DHSs)] maps across nine different cell types. Motif enrichment analyses of chromatin-defined enhancer sequences identify several known cell-type specific “master” factors. Furthermore, de novo motif discovery not only recovers many of these motifs, but also identifies novel non-canonical motifs, providing additional insight into TF binding preferences. Across the length of SEs, motifs are most enriched in DHSs, though relative enrichment is also observed outside of DHSs. Interestingly, we show that single nucleotide polymorphisms associated with diseases or quantitative traits significantly overlap motif occurrences located in SEs, but outside of DHSs. CONCLUSIONS: These results reinforce the role of SEs in influencing risk for diseases and suggest an expanded regulatory functional role for motifs that occur outside highly accessible chromatin. Furthermore, the motif signatures generated here expand our understanding of the binding preference of well-characterized TFs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0015-7) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-16 /pmc/articles/PMC4502539/ /pubmed/26180553 http://dx.doi.org/10.1186/s13072-015-0015-7 Text en © Quang et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Quang, Daniel X
Erdos, Michael R
Parker, Stephen C J
Collins, Francis S
Motif signatures in stretch enhancers are enriched for disease-associated genetic variants
title Motif signatures in stretch enhancers are enriched for disease-associated genetic variants
title_full Motif signatures in stretch enhancers are enriched for disease-associated genetic variants
title_fullStr Motif signatures in stretch enhancers are enriched for disease-associated genetic variants
title_full_unstemmed Motif signatures in stretch enhancers are enriched for disease-associated genetic variants
title_short Motif signatures in stretch enhancers are enriched for disease-associated genetic variants
title_sort motif signatures in stretch enhancers are enriched for disease-associated genetic variants
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4502539/
https://www.ncbi.nlm.nih.gov/pubmed/26180553
http://dx.doi.org/10.1186/s13072-015-0015-7
work_keys_str_mv AT quangdanielx motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants
AT erdosmichaelr motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants
AT parkerstephencj motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants
AT collinsfranciss motifsignaturesinstretchenhancersareenrichedfordiseaseassociatedgeneticvariants