Cargando…
Automated mapping of large-scale chromatin structure in ENCODE
Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical c...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2519158/ https://www.ncbi.nlm.nih.gov/pubmed/18591192 http://dx.doi.org/10.1093/bioinformatics/btn335 |
_version_ | 1782158631384383488 |
---|---|
author | Lian, Heng Thompson, William A. Thurman, Robert Stamatoyannopoulos, John A. Noble, William Stafford Lawrence, Charles E. |
author_facet | Lian, Heng Thompson, William A. Thurman, Robert Stamatoyannopoulos, John A. Noble, William Stafford Lawrence, Charles E. |
author_sort | Lian, Heng |
collection | PubMed |
description | Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical change-point model (CPM), a generalization of a hidden Markov Model (HMM), to characterize tiled microarray DNaseI sensitivity data available from the ENCODE project. Results: Our analysis shows that the accessibility of chromatin to cleavage by DNaseI is well described by a four state model of local segments with each state described by a continuous mixture of Gaussian variables. The CPM produces a better fit to the observed data than the HMM. The large posterior probability for the four-state CPM suggests that the data falls naturally into four classes of regions, which we call major and minor DNaseI hypersensitive sites (DHSs), regions of intermediate sensitivity, and insensitive regions. These classes agree well with a model of chromatin in which local disruptions (DHSs) are concentrated within larger domains of intermediate sensitivity, the accessibility islands. The CPM assigns 92% of the bases within the ENCODE regions to the insensitive regions. The 5.8% of the bases that are in regions of intermediate sensitivity are clearly enriched in functional elements, including genes and activating histone modifications, while the remaining 2.2% of the bases in hypersensitive regions are very strongly enriched in these elements. Availability: The CPM software is available upon request from the authors. Contact: jstam@stamlab.org; noble@gs.washington.edu; Charles_Lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Source code is available at http://noble.gs.washington.edu/proj/segment. |
format | Text |
id | pubmed-2519158 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-25191582009-02-25 Automated mapping of large-scale chromatin structure in ENCODE Lian, Heng Thompson, William A. Thurman, Robert Stamatoyannopoulos, John A. Noble, William Stafford Lawrence, Charles E. Bioinformatics Original Papers Motivation: A recently developed DNaseI assay has given us our first genome-wide view of chromatin structure. In addition to cataloging DNaseI hypersensitive sites, these data allows us to more completely characterize overall features of chromatin accessibility. We employed a Bayesian hierarchical change-point model (CPM), a generalization of a hidden Markov Model (HMM), to characterize tiled microarray DNaseI sensitivity data available from the ENCODE project. Results: Our analysis shows that the accessibility of chromatin to cleavage by DNaseI is well described by a four state model of local segments with each state described by a continuous mixture of Gaussian variables. The CPM produces a better fit to the observed data than the HMM. The large posterior probability for the four-state CPM suggests that the data falls naturally into four classes of regions, which we call major and minor DNaseI hypersensitive sites (DHSs), regions of intermediate sensitivity, and insensitive regions. These classes agree well with a model of chromatin in which local disruptions (DHSs) are concentrated within larger domains of intermediate sensitivity, the accessibility islands. The CPM assigns 92% of the bases within the ENCODE regions to the insensitive regions. The 5.8% of the bases that are in regions of intermediate sensitivity are clearly enriched in functional elements, including genes and activating histone modifications, while the remaining 2.2% of the bases in hypersensitive regions are very strongly enriched in these elements. Availability: The CPM software is available upon request from the authors. Contact: jstam@stamlab.org; noble@gs.washington.edu; Charles_Lawrence@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online. Source code is available at http://noble.gs.washington.edu/proj/segment. Oxford University Press 2008-09-01 2008-06-30 /pmc/articles/PMC2519158/ /pubmed/18591192 http://dx.doi.org/10.1093/bioinformatics/btn335 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Lian, Heng Thompson, William A. Thurman, Robert Stamatoyannopoulos, John A. Noble, William Stafford Lawrence, Charles E. Automated mapping of large-scale chromatin structure in ENCODE |
title | Automated mapping of large-scale chromatin structure in ENCODE |
title_full | Automated mapping of large-scale chromatin structure in ENCODE |
title_fullStr | Automated mapping of large-scale chromatin structure in ENCODE |
title_full_unstemmed | Automated mapping of large-scale chromatin structure in ENCODE |
title_short | Automated mapping of large-scale chromatin structure in ENCODE |
title_sort | automated mapping of large-scale chromatin structure in encode |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2519158/ https://www.ncbi.nlm.nih.gov/pubmed/18591192 http://dx.doi.org/10.1093/bioinformatics/btn335 |
work_keys_str_mv | AT lianheng automatedmappingoflargescalechromatinstructureinencode AT thompsonwilliama automatedmappingoflargescalechromatinstructureinencode AT thurmanrobert automatedmappingoflargescalechromatinstructureinencode AT stamatoyannopoulosjohna automatedmappingoflargescalechromatinstructureinencode AT noblewilliamstafford automatedmappingoflargescalechromatinstructureinencode AT lawrencecharlese automatedmappingoflargescalechromatinstructureinencode |