Cargando…

Modeling gene expression using chromatin features in various cellular contexts

BACKGROUND: Previous work has demonstrated that chromatin feature levels correlate with gene expression. The ENCODE project enables us to further explore this relationship using an unprecedented volume of data. Expression levels from more than 100,000 promoters were measured using a variety of high-...

Descripción completa

Detalles Bibliográficos
Autores principales: Dong, Xianjun, Greven, Melissa C, Kundaje, Anshul, Djebali, Sarah, Brown, James B, Cheng, Chao, Gingeras, Thomas R, Gerstein, Mark, Guigó, Roderic, Birney, Ewan, Weng, Zhiping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3491397/
https://www.ncbi.nlm.nih.gov/pubmed/22950368
http://dx.doi.org/10.1186/gb-2012-13-9-r53
Descripción
Sumario:BACKGROUND: Previous work has demonstrated that chromatin feature levels correlate with gene expression. The ENCODE project enables us to further explore this relationship using an unprecedented volume of data. Expression levels from more than 100,000 promoters were measured using a variety of high-throughput techniques applied to RNA extracted by different protocols from different cellular compartments of several human cell lines. ENCODE also generated the genome-wide mapping of eleven histone marks, one histone variant, and DNase I hypersensitivity sites in seven cell lines. RESULTS: We built a novel quantitative model to study the relationship between chromatin features and expression levels. Our study not only confirms that the general relationships found in previous studies hold across various cell lines, but also makes new suggestions about the relationship between chromatin features and gene expression levels. We found that expression status and expression levels can be predicted by different groups of chromatin features, both with high accuracy. We also found that expression levels measured by CAGE are better predicted than by RNA-PET or RNA-Seq, and different categories of chromatin features are the most predictive of expression for different RNA measurement methods. Additionally, PolyA+ RNA is overall more predictable than PolyA- RNA among different cell compartments, and PolyA+ cytosolic RNA measured with RNA-Seq is more predictable than PolyA+ nuclear RNA, while the opposite is true for PolyA- RNA. CONCLUSIONS: Our study provides new insights into transcriptional regulation by analyzing chromatin features in different cellular contexts.