Cargando…

Automated mapping of large-scale chromatin structure in ENCODE

Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical c...

Descripción completa

Detalles Bibliográficos
Autores principales: Lian, Heng, Thompson, William A., Thurman, Robert, Stamatoyannopoulos, John A., Noble, William Stafford, Lawrence, Charles E.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2519158/
https://www.ncbi.nlm.nih.gov/pubmed/18591192
http://dx.doi.org/10.1093/bioinformatics/btn335
_version_ 1782158631384383488
author Lian, Heng
Thompson, William A.
Thurman, Robert
Stamatoyannopoulos, John A.
Noble, William Stafford
Lawrence, Charles E.
author_facet Lian, Heng
Thompson, William A.
Thurman, Robert
Stamatoyannopoulos, John A.
Noble, William Stafford
Lawrence, Charles E.
author_sort Lian, Heng
collection PubMed
description Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical change-point model (CPM), a generalization of a hidden Markov Model (HMM), to characterize tiled microarray DNaseI sensitivity data available from the ENCODE project. Results: Our analysis shows that the accessibility of chromatin to cleavage by DNaseI is well described by a four state model of local segments with each state described by a continuous mixture of Gaussian variables. The CPM produces a better fit to the observed data than the HMM. The large posterior probability for the four-state CPM suggests that the data falls naturally into four classes of regions, which we call major and minor DNaseI hypersensitive sites (DHSs), regions of intermediate sensitivity, and insensitive regions. These classes agree well with a model of chromatin in which local disruptions (DHSs) are concentrated within larger domains of intermediate sensitivity, the accessibility islands. The CPM assigns 92% of the bases within the ENCODE regions to the insensitive regions. The 5.8% of the bases that are in regions of intermediate sensitivity are clearly enriched in functional elements, including genes and activating histone modifications, while the remaining 2.2% of the bases in hypersensitive regions are very strongly enriched in these elements. Availability: The CPM software is available upon request from the authors. Contact: jstam@stamlab.org; noble@gs.washington.edu; Charles_Lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Source code is available at http://noble.gs.washington.edu/proj/segment.
format Text
id pubmed-2519158
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-25191582009-02-25 Automated mapping of large-scale chromatin structure in ENCODE Lian, Heng Thompson, William A. Thurman, Robert Stamatoyannopoulos, John A. Noble, William Stafford Lawrence, Charles E. Bioinformatics Original Papers Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical change-point model (CPM), a generalization of a hidden Markov Model (HMM), to characterize tiled microarray DNaseI sensitivity data available from the ENCODE project. Results: Our analysis shows that the accessibility of chromatin to cleavage by DNaseI is well described by a four state model of local segments with each state described by a continuous mixture of Gaussian variables. The CPM produces a better fit to the observed data than the HMM. The large posterior probability for the four-state CPM suggests that the data falls naturally into four classes of regions, which we call major and minor DNaseI hypersensitive sites (DHSs), regions of intermediate sensitivity, and insensitive regions. These classes agree well with a model of chromatin in which local disruptions (DHSs) are concentrated within larger domains of intermediate sensitivity, the accessibility islands. The CPM assigns 92% of the bases within the ENCODE regions to the insensitive regions. The 5.8% of the bases that are in regions of intermediate sensitivity are clearly enriched in functional elements, including genes and activating histone modifications, while the remaining 2.2% of the bases in hypersensitive regions are very strongly enriched in these elements. Availability: The CPM software is available upon request from the authors. Contact: jstam@stamlab.org; noble@gs.washington.edu; Charles_Lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Source code is available at http://noble.gs.washington.edu/proj/segment. Oxford University Press 2008-09-01 2008-06-30 /pmc/articles/PMC2519158/ /pubmed/18591192 http://dx.doi.org/10.1093/bioinformatics/btn335 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Lian, Heng
Thompson, William A.
Thurman, Robert
Stamatoyannopoulos, John A.
Noble, William Stafford
Lawrence, Charles E.
Automated mapping of large-scale chromatin structure in ENCODE
title Automated mapping of large-scale chromatin structure in ENCODE
title_full Automated mapping of large-scale chromatin structure in ENCODE
title_fullStr Automated mapping of large-scale chromatin structure in ENCODE
title_full_unstemmed Automated mapping of large-scale chromatin structure in ENCODE
title_short Automated mapping of large-scale chromatin structure in ENCODE
title_sort automated mapping of large-scale chromatin structure in encode
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2519158/
https://www.ncbi.nlm.nih.gov/pubmed/18591192
http://dx.doi.org/10.1093/bioinformatics/btn335
work_keys_str_mv AT lianheng automatedmappingoflargescalechromatinstructureinencode
AT thompsonwilliama automatedmappingoflargescalechromatinstructureinencode
AT thurmanrobert automatedmappingoflargescalechromatinstructureinencode
AT stamatoyannopoulosjohna automatedmappingoflargescalechromatinstructureinencode
AT noblewilliamstafford automatedmappingoflargescalechromatinstructureinencode
AT lawrencecharlese automatedmappingoflargescalechromatinstructureinencode