Cargando…
Modelling the conditional regulatory activity of methylated and bivalent promoters
BACKGROUND: Predictive modelling of gene expression is a powerful framework for the in silico exploration of transcriptional regulatory interactions through the integration of high-throughput -omics data. A major limitation of previous approaches is their inability to handle conditional interactions...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474576/ https://www.ncbi.nlm.nih.gov/pubmed/26097508 http://dx.doi.org/10.1186/s13072-015-0013-9 |
_version_ | 1782377295060664320 |
---|---|
author | Budden, David M. Hurley, Daniel G. Crampin, Edmund J. |
author_facet | Budden, David M. Hurley, Daniel G. Crampin, Edmund J. |
author_sort | Budden, David M. |
collection | PubMed |
description | BACKGROUND: Predictive modelling of gene expression is a powerful framework for the in silico exploration of transcriptional regulatory interactions through the integration of high-throughput -omics data. A major limitation of previous approaches is their inability to handle conditional interactions that emerge when genes are subject to different regulatory mechanisms. Although chromatin immunoprecipitation-based histone modification data are often used as proxies for chromatin accessibility, the association between these variables and expression often depends upon the presence of other epigenetic markers (e.g. DNA methylation or histone variants). These conditional interactions are poorly handled by previous predictive models and reduce the reliability of downstream biological inference. RESULTS: We have previously demonstrated that integrating both transcription factor and histone modification data within a single predictive model is rendered ineffective by their statistical redundancy. In this study, we evaluate four proposed methods for quantifying gene-level DNA methylation levels and demonstrate that inclusion of these data in predictive modelling frameworks is also subject to this critical limitation in data integration. Based on the hypothesis that statistical redundancy in epigenetic data is caused by conditional regulatory interactions within a dynamic chromatin context, we construct a new gene expression model which is the first to improve prediction accuracy by unsupervised identification of latent regulatory classes. We show that DNA methylation and H2A.Z histone variant data can be interpreted in this way to identify and explore the signatures of silenced and bivalent promoters, substantially improving genome-wide predictions of mRNA transcript abundance and downstream biological inference across multiple cell lines. CONCLUSIONS: Previous models of gene expression have been applied successfully to several important problems in molecular biology, including the discovery of transcription factor roles, identification of regulatory elements responsible for differential expression patterns and comparative analysis of the transcriptome across distant species. Our analysis supports our hypothesis that statistical redundancy in epigenetic data is partially due to conditional relationships between these regulators and gene expression levels. This analysis provides insight into the heterogeneous roles of H3K4me3 and H3K27me3 in the presence of the H2A.Z histone variant (implicated in cancer progression) and how these signatures change during lineage commitment and carcinogenesis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0013-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4474576 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44745762015-06-20 Modelling the conditional regulatory activity of methylated and bivalent promoters Budden, David M. Hurley, Daniel G. Crampin, Edmund J. Epigenetics Chromatin Research BACKGROUND: Predictive modelling of gene expression is a powerful framework for the in silico exploration of transcriptional regulatory interactions through the integration of high-throughput -omics data. A major limitation of previous approaches is their inability to handle conditional interactions that emerge when genes are subject to different regulatory mechanisms. Although chromatin immunoprecipitation-based histone modification data are often used as proxies for chromatin accessibility, the association between these variables and expression often depends upon the presence of other epigenetic markers (e.g. DNA methylation or histone variants). These conditional interactions are poorly handled by previous predictive models and reduce the reliability of downstream biological inference. RESULTS: We have previously demonstrated that integrating both transcription factor and histone modification data within a single predictive model is rendered ineffective by their statistical redundancy. In this study, we evaluate four proposed methods for quantifying gene-level DNA methylation levels and demonstrate that inclusion of these data in predictive modelling frameworks is also subject to this critical limitation in data integration. Based on the hypothesis that statistical redundancy in epigenetic data is caused by conditional regulatory interactions within a dynamic chromatin context, we construct a new gene expression model which is the first to improve prediction accuracy by unsupervised identification of latent regulatory classes. We show that DNA methylation and H2A.Z histone variant data can be interpreted in this way to identify and explore the signatures of silenced and bivalent promoters, substantially improving genome-wide predictions of mRNA transcript abundance and downstream biological inference across multiple cell lines. CONCLUSIONS: Previous models of gene expression have been applied successfully to several important problems in molecular biology, including the discovery of transcription factor roles, identification of regulatory elements responsible for differential expression patterns and comparative analysis of the transcriptome across distant species. Our analysis supports our hypothesis that statistical redundancy in epigenetic data is partially due to conditional relationships between these regulators and gene expression levels. This analysis provides insight into the heterogeneous roles of H3K4me3 and H3K27me3 in the presence of the H2A.Z histone variant (implicated in cancer progression) and how these signatures change during lineage commitment and carcinogenesis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0013-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-19 /pmc/articles/PMC4474576/ /pubmed/26097508 http://dx.doi.org/10.1186/s13072-015-0013-9 Text en © Budden et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Budden, David M. Hurley, Daniel G. Crampin, Edmund J. Modelling the conditional regulatory activity of methylated and bivalent promoters |
title | Modelling the conditional regulatory activity of methylated and bivalent promoters |
title_full | Modelling the conditional regulatory activity of methylated and bivalent promoters |
title_fullStr | Modelling the conditional regulatory activity of methylated and bivalent promoters |
title_full_unstemmed | Modelling the conditional regulatory activity of methylated and bivalent promoters |
title_short | Modelling the conditional regulatory activity of methylated and bivalent promoters |
title_sort | modelling the conditional regulatory activity of methylated and bivalent promoters |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474576/ https://www.ncbi.nlm.nih.gov/pubmed/26097508 http://dx.doi.org/10.1186/s13072-015-0013-9 |
work_keys_str_mv | AT buddendavidm modellingtheconditionalregulatoryactivityofmethylatedandbivalentpromoters AT hurleydanielg modellingtheconditionalregulatoryactivityofmethylatedandbivalentpromoters AT crampinedmundj modellingtheconditionalregulatoryactivityofmethylatedandbivalentpromoters |