Cargando…

G+C content dominates intrinsic nucleosome occupancy

BACKGROUND: The relative preference of nucleosomes to form on individual DNA sequences plays a major role in genome packaging. A wide variety of DNA sequence features are believed to influence nucleosome formation, including periodic dinucleotide signals, poly-A stretches and other short motifs, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Tillo, Desiree, Hughes, Timothy R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2808325/
https://www.ncbi.nlm.nih.gov/pubmed/20028554
http://dx.doi.org/10.1186/1471-2105-10-442
_version_ 1782176478084988928
author Tillo, Desiree
Hughes, Timothy R
author_facet Tillo, Desiree
Hughes, Timothy R
author_sort Tillo, Desiree
collection PubMed
description BACKGROUND: The relative preference of nucleosomes to form on individual DNA sequences plays a major role in genome packaging. A wide variety of DNA sequence features are believed to influence nucleosome formation, including periodic dinucleotide signals, poly-A stretches and other short motifs, and sequence properties that influence DNA structure, including base content. It was recently shown by Kaplan et al. that a probabilistic model using composition of all 5-mers within a nucleosome-sized tiling window accurately predicts intrinsic nucleosome occupancy across an entire genome in vitro. However, the model is complicated, and it is not clear which specific DNA sequence properties are most important for intrinsic nucleosome-forming preferences. RESULTS: We find that a simple linear combination of only 14 simple DNA sequence attributes (G+C content, two transformations of dinucleotide composition, and the frequency of eleven 4-bp sequences) explains nucleosome occupancy in vitro and in vivo in a manner comparable to the Kaplan model. G+C content and frequency of AAAA are the most important features. G+C content is dominant, alone explaining ~50% of the variation in nucleosome occupancy in vitro. CONCLUSIONS: Our findings provide a dramatically simplified means to predict and understand intrinsic nucleosome occupancy. G+C content may dominate because it both reduces frequency of poly-A-like stretches and correlates with many other DNA structural characteristics. Since G+C content is enriched or depleted at many types of features in diverse eukaryotic genomes, our results suggest that variation in nucleotide composition may have a widespread and direct influence on chromatin structure.
format Text
id pubmed-2808325
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28083252010-01-20 G+C content dominates intrinsic nucleosome occupancy Tillo, Desiree Hughes, Timothy R BMC Bioinformatics Research article BACKGROUND: The relative preference of nucleosomes to form on individual DNA sequences plays a major role in genome packaging. A wide variety of DNA sequence features are believed to influence nucleosome formation, including periodic dinucleotide signals, poly-A stretches and other short motifs, and sequence properties that influence DNA structure, including base content. It was recently shown by Kaplan et al. that a probabilistic model using composition of all 5-mers within a nucleosome-sized tiling window accurately predicts intrinsic nucleosome occupancy across an entire genome in vitro. However, the model is complicated, and it is not clear which specific DNA sequence properties are most important for intrinsic nucleosome-forming preferences. RESULTS: We find that a simple linear combination of only 14 simple DNA sequence attributes (G+C content, two transformations of dinucleotide composition, and the frequency of eleven 4-bp sequences) explains nucleosome occupancy in vitro and in vivo in a manner comparable to the Kaplan model. G+C content and frequency of AAAA are the most important features. G+C content is dominant, alone explaining ~50% of the variation in nucleosome occupancy in vitro. CONCLUSIONS: Our findings provide a dramatically simplified means to predict and understand intrinsic nucleosome occupancy. G+C content may dominate because it both reduces frequency of poly-A-like stretches and correlates with many other DNA structural characteristics. Since G+C content is enriched or depleted at many types of features in diverse eukaryotic genomes, our results suggest that variation in nucleotide composition may have a widespread and direct influence on chromatin structure. BioMed Central 2009-12-22 /pmc/articles/PMC2808325/ /pubmed/20028554 http://dx.doi.org/10.1186/1471-2105-10-442 Text en Copyright ©2009 Tillo and Hughes; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Tillo, Desiree
Hughes, Timothy R
G+C content dominates intrinsic nucleosome occupancy
title G+C content dominates intrinsic nucleosome occupancy
title_full G+C content dominates intrinsic nucleosome occupancy
title_fullStr G+C content dominates intrinsic nucleosome occupancy
title_full_unstemmed G+C content dominates intrinsic nucleosome occupancy
title_short G+C content dominates intrinsic nucleosome occupancy
title_sort g+c content dominates intrinsic nucleosome occupancy
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2808325/
https://www.ncbi.nlm.nih.gov/pubmed/20028554
http://dx.doi.org/10.1186/1471-2105-10-442
work_keys_str_mv AT tillodesiree gccontentdominatesintrinsicnucleosomeoccupancy
AT hughestimothyr gccontentdominatesintrinsicnucleosomeoccupancy