Cargando…

Short DNA sequence patterns accurately identify broadly active human enhancers

BACKGROUND: Enhancers are DNA regulatory elements that influence gene expression. There is substantial diversity in enhancers’ activity patterns: some enhancers drive expression in a single cellular context, while others are active across many. Sequence characteristics, such as transcription factor...

Descripción completa

Detalles Bibliográficos
Autores principales: Colbran, Laura L., Chen, Ling, Capra, John A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5512948/
https://www.ncbi.nlm.nih.gov/pubmed/28716036
http://dx.doi.org/10.1186/s12864-017-3934-9
_version_ 1783250559812239360
author Colbran, Laura L.
Chen, Ling
Capra, John A.
author_facet Colbran, Laura L.
Chen, Ling
Capra, John A.
author_sort Colbran, Laura L.
collection PubMed
description BACKGROUND: Enhancers are DNA regulatory elements that influence gene expression. There is substantial diversity in enhancers’ activity patterns: some enhancers drive expression in a single cellular context, while others are active across many. Sequence characteristics, such as transcription factor (TF) binding motifs, influence the activity patterns of regulatory sequences; however, the regulatory logic through which specific sequences drive enhancer activity patterns is poorly understood. Recent analysis of Drosophila enhancers suggested that short dinucleotide repeat motifs (DRMs) are general enhancer sequence features that drive broad regulatory activity. However, it is not known whether the regulatory role of DRMs is conserved across species. RESULTS: We performed a comprehensive analysis of the relationship between short DNA sequence patterns, including DRMs, and human enhancer activity in 38,538 enhancers across 411 different contexts. In a machine-learning framework, the occurrence patterns of short sequence motifs accurately predicted broadly active human enhancers. However, DRMs alone were weakly predictive of broad enhancer activity in humans and showed different enrichment patterns than in Drosophila. In general, GC-rich sequence motifs were significantly associated with broad enhancer activity, and consistent with this enrichment, broadly active human TFs recognize GC-rich motifs. CONCLUSIONS: Our results reveal the importance of specific sequence motifs in broadly active human enhancers, demonstrate the lack of evolutionary conservation of the role of DRMs, and provide a computational framework for investigating the logic of enhancer sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3934-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5512948
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55129482017-07-19 Short DNA sequence patterns accurately identify broadly active human enhancers Colbran, Laura L. Chen, Ling Capra, John A. BMC Genomics Research Article BACKGROUND: Enhancers are DNA regulatory elements that influence gene expression. There is substantial diversity in enhancers’ activity patterns: some enhancers drive expression in a single cellular context, while others are active across many. Sequence characteristics, such as transcription factor (TF) binding motifs, influence the activity patterns of regulatory sequences; however, the regulatory logic through which specific sequences drive enhancer activity patterns is poorly understood. Recent analysis of Drosophila enhancers suggested that short dinucleotide repeat motifs (DRMs) are general enhancer sequence features that drive broad regulatory activity. However, it is not known whether the regulatory role of DRMs is conserved across species. RESULTS: We performed a comprehensive analysis of the relationship between short DNA sequence patterns, including DRMs, and human enhancer activity in 38,538 enhancers across 411 different contexts. In a machine-learning framework, the occurrence patterns of short sequence motifs accurately predicted broadly active human enhancers. However, DRMs alone were weakly predictive of broad enhancer activity in humans and showed different enrichment patterns than in Drosophila. In general, GC-rich sequence motifs were significantly associated with broad enhancer activity, and consistent with this enrichment, broadly active human TFs recognize GC-rich motifs. CONCLUSIONS: Our results reveal the importance of specific sequence motifs in broadly active human enhancers, demonstrate the lack of evolutionary conservation of the role of DRMs, and provide a computational framework for investigating the logic of enhancer sequences. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3934-9) contains supplementary material, which is available to authorized users. BioMed Central 2017-07-17 /pmc/articles/PMC5512948/ /pubmed/28716036 http://dx.doi.org/10.1186/s12864-017-3934-9 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Colbran, Laura L.
Chen, Ling
Capra, John A.
Short DNA sequence patterns accurately identify broadly active human enhancers
title Short DNA sequence patterns accurately identify broadly active human enhancers
title_full Short DNA sequence patterns accurately identify broadly active human enhancers
title_fullStr Short DNA sequence patterns accurately identify broadly active human enhancers
title_full_unstemmed Short DNA sequence patterns accurately identify broadly active human enhancers
title_short Short DNA sequence patterns accurately identify broadly active human enhancers
title_sort short dna sequence patterns accurately identify broadly active human enhancers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5512948/
https://www.ncbi.nlm.nih.gov/pubmed/28716036
http://dx.doi.org/10.1186/s12864-017-3934-9
work_keys_str_mv AT colbranlaural shortdnasequencepatternsaccuratelyidentifybroadlyactivehumanenhancers
AT chenling shortdnasequencepatternsaccuratelyidentifybroadlyactivehumanenhancers
AT caprajohna shortdnasequencepatternsaccuratelyidentifybroadlyactivehumanenhancers