Cargando…

Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context

Motivation: Histone modifications regulate chromatin structure and gene expression. Although nucleosome formation is known to be affected by primary DNA sequence composition, no sequence signature has been identified for histone modifications. It is known that dense H3K4me3 nucleosome sites are acco...

Descripción completa

Detalles Bibliográficos
Autores principales: Ha, Misook, Hong, Soondo, Li, Wen-Hsiung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3658463/
https://www.ncbi.nlm.nih.gov/pubmed/23511541
http://dx.doi.org/10.1093/bioinformatics/btt126
_version_ 1782270276305682432
author Ha, Misook
Hong, Soondo
Li, Wen-Hsiung
author_facet Ha, Misook
Hong, Soondo
Li, Wen-Hsiung
author_sort Ha, Misook
collection PubMed
description Motivation: Histone modifications regulate chromatin structure and gene expression. Although nucleosome formation is known to be affected by primary DNA sequence composition, no sequence signature has been identified for histone modifications. It is known that dense H3K4me3 nucleosome sites are accompanied by a low density of other nucleosomes and are associated with gene activation. This observation suggests a different sequence composition of H3K4me3 from other nucleosomes. Approach: To understand the relationship between genome sequence and chromatin structure, we studied DNA sequences at histone modification sites in various human cell types. We found sequence specificity for H3K4me3, but not for other histone modifications. Using the sequence specificities of H3 and H3K4me3 nucleosomes, we developed a model that computes the probability of H3K4me3 occupation at each base pair from the genome sequence context. Results: A comparison of our predictions with experimental data suggests a high performance of our method, revealing a strong association between H3K4me3 and specific genomic DNA context. The high probability of H3K4me3 occupation occurs at transcription start and termination sites, exon boundaries and binding sites of transcription regulators involved in chromatin modification activities, including histone acetylases and enhancer- and insulator-associated factors. Thus, the human genome sequence contains signatures for chromatin modifications essential for gene regulation and development. Our method may be applied to find new sequence elements functioning by chromatin modulation. Availability: Software and supplementary data are available at Bioinformatics online. Contact: misook.ha@samsung.com or wli@uchicago.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3658463
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36584632013-05-24 Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context Ha, Misook Hong, Soondo Li, Wen-Hsiung Bioinformatics Original Papers Motivation: Histone modifications regulate chromatin structure and gene expression. Although nucleosome formation is known to be affected by primary DNA sequence composition, no sequence signature has been identified for histone modifications. It is known that dense H3K4me3 nucleosome sites are accompanied by a low density of other nucleosomes and are associated with gene activation. This observation suggests a different sequence composition of H3K4me3 from other nucleosomes. Approach: To understand the relationship between genome sequence and chromatin structure, we studied DNA sequences at histone modification sites in various human cell types. We found sequence specificity for H3K4me3, but not for other histone modifications. Using the sequence specificities of H3 and H3K4me3 nucleosomes, we developed a model that computes the probability of H3K4me3 occupation at each base pair from the genome sequence context. Results: A comparison of our predictions with experimental data suggests a high performance of our method, revealing a strong association between H3K4me3 and specific genomic DNA context. The high probability of H3K4me3 occupation occurs at transcription start and termination sites, exon boundaries and binding sites of transcription regulators involved in chromatin modification activities, including histone acetylases and enhancer- and insulator-associated factors. Thus, the human genome sequence contains signatures for chromatin modifications essential for gene regulation and development. Our method may be applied to find new sequence elements functioning by chromatin modulation. Availability: Software and supplementary data are available at Bioinformatics online. Contact: misook.ha@samsung.com or wli@uchicago.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-05-01 2013-03-19 /pmc/articles/PMC3658463/ /pubmed/23511541 http://dx.doi.org/10.1093/bioinformatics/btt126 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Ha, Misook
Hong, Soondo
Li, Wen-Hsiung
Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context
title Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context
title_full Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context
title_fullStr Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context
title_full_unstemmed Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context
title_short Predicting the probability of H3K4me3 occupation at a base pair from the genome sequence context
title_sort predicting the probability of h3k4me3 occupation at a base pair from the genome sequence context
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3658463/
https://www.ncbi.nlm.nih.gov/pubmed/23511541
http://dx.doi.org/10.1093/bioinformatics/btt126
work_keys_str_mv AT hamisook predictingtheprobabilityofh3k4me3occupationatabasepairfromthegenomesequencecontext
AT hongsoondo predictingtheprobabilityofh3k4me3occupationatabasepairfromthegenomesequencecontext
AT liwenhsiung predictingtheprobabilityofh3k4me3occupationatabasepairfromthegenomesequencecontext