Cargando…

Modelling the conditional regulatory activity of methylated and bivalent promoters

BACKGROUND: Predictive modelling of gene expression is a powerful framework for the in silico exploration of transcriptional regulatory interactions through the integration of high-throughput -omics data. A major limitation of previous approaches is their inability to handle conditional interactions...

Descripción completa

Detalles Bibliográficos
Autores principales: Budden, David M., Hurley, Daniel G., Crampin, Edmund J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474576/
https://www.ncbi.nlm.nih.gov/pubmed/26097508
http://dx.doi.org/10.1186/s13072-015-0013-9
_version_ 1782377295060664320
author Budden, David M.
Hurley, Daniel G.
Crampin, Edmund J.
author_facet Budden, David M.
Hurley, Daniel G.
Crampin, Edmund J.
author_sort Budden, David M.
collection PubMed
description BACKGROUND: Predictive modelling of gene expression is a powerful framework for the in silico exploration of transcriptional regulatory interactions through the integration of high-throughput -omics data. A major limitation of previous approaches is their inability to handle conditional interactions that emerge when genes are subject to different regulatory mechanisms. Although chromatin immunoprecipitation-based histone modification data are often used as proxies for chromatin accessibility, the association between these variables and expression often depends upon the presence of other epigenetic markers (e.g. DNA methylation or histone variants). These conditional interactions are poorly handled by previous predictive models and reduce the reliability of downstream biological inference. RESULTS: We have previously demonstrated that integrating both transcription factor and histone modification data within a single predictive model is rendered ineffective by their statistical redundancy. In this study, we evaluate four proposed methods for quantifying gene-level DNA methylation levels and demonstrate that inclusion of these data in predictive modelling frameworks is also subject to this critical limitation in data integration. Based on the hypothesis that statistical redundancy in epigenetic data is caused by conditional regulatory interactions within a dynamic chromatin context, we construct a new gene expression model which is the first to improve prediction accuracy by unsupervised identification of latent regulatory classes. We show that DNA methylation and H2A.Z histone variant data can be interpreted in this way to identify and explore the signatures of silenced and bivalent promoters, substantially improving genome-wide predictions of mRNA transcript abundance and downstream biological inference across multiple cell lines. CONCLUSIONS: Previous models of gene expression have been applied successfully to several important problems in molecular biology, including the discovery of transcription factor roles, identification of regulatory elements responsible for differential expression patterns and comparative analysis of the transcriptome across distant species. Our analysis supports our hypothesis that statistical redundancy in epigenetic data is partially due to conditional relationships between these regulators and gene expression levels. This analysis provides insight into the heterogeneous roles of H3K4me3 and H3K27me3 in the presence of the H2A.Z histone variant (implicated in cancer progression) and how these signatures change during lineage commitment and carcinogenesis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0013-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4474576
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44745762015-06-20 Modelling the conditional regulatory activity of methylated and bivalent promoters Budden, David M. Hurley, Daniel G. Crampin, Edmund J. Epigenetics Chromatin Research BACKGROUND: Predictive modelling of gene expression is a powerful framework for the in silico exploration of transcriptional regulatory interactions through the integration of high-throughput -omics data. A major limitation of previous approaches is their inability to handle conditional interactions that emerge when genes are subject to different regulatory mechanisms. Although chromatin immunoprecipitation-based histone modification data are often used as proxies for chromatin accessibility, the association between these variables and expression often depends upon the presence of other epigenetic markers (e.g. DNA methylation or histone variants). These conditional interactions are poorly handled by previous predictive models and reduce the reliability of downstream biological inference. RESULTS: We have previously demonstrated that integrating both transcription factor and histone modification data within a single predictive model is rendered ineffective by their statistical redundancy. In this study, we evaluate four proposed methods for quantifying gene-level DNA methylation levels and demonstrate that inclusion of these data in predictive modelling frameworks is also subject to this critical limitation in data integration. Based on the hypothesis that statistical redundancy in epigenetic data is caused by conditional regulatory interactions within a dynamic chromatin context, we construct a new gene expression model which is the first to improve prediction accuracy by unsupervised identification of latent regulatory classes. We show that DNA methylation and H2A.Z histone variant data can be interpreted in this way to identify and explore the signatures of silenced and bivalent promoters, substantially improving genome-wide predictions of mRNA transcript abundance and downstream biological inference across multiple cell lines. CONCLUSIONS: Previous models of gene expression have been applied successfully to several important problems in molecular biology, including the discovery of transcription factor roles, identification of regulatory elements responsible for differential expression patterns and comparative analysis of the transcriptome across distant species. Our analysis supports our hypothesis that statistical redundancy in epigenetic data is partially due to conditional relationships between these regulators and gene expression levels. This analysis provides insight into the heterogeneous roles of H3K4me3 and H3K27me3 in the presence of the H2A.Z histone variant (implicated in cancer progression) and how these signatures change during lineage commitment and carcinogenesis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13072-015-0013-9) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-19 /pmc/articles/PMC4474576/ /pubmed/26097508 http://dx.doi.org/10.1186/s13072-015-0013-9 Text en © Budden et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Budden, David M.
Hurley, Daniel G.
Crampin, Edmund J.
Modelling the conditional regulatory activity of methylated and bivalent promoters
title Modelling the conditional regulatory activity of methylated and bivalent promoters
title_full Modelling the conditional regulatory activity of methylated and bivalent promoters
title_fullStr Modelling the conditional regulatory activity of methylated and bivalent promoters
title_full_unstemmed Modelling the conditional regulatory activity of methylated and bivalent promoters
title_short Modelling the conditional regulatory activity of methylated and bivalent promoters
title_sort modelling the conditional regulatory activity of methylated and bivalent promoters
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4474576/
https://www.ncbi.nlm.nih.gov/pubmed/26097508
http://dx.doi.org/10.1186/s13072-015-0013-9
work_keys_str_mv AT buddendavidm modellingtheconditionalregulatoryactivityofmethylatedandbivalentpromoters
AT hurleydanielg modellingtheconditionalregulatoryactivityofmethylatedandbivalentpromoters
AT crampinedmundj modellingtheconditionalregulatoryactivityofmethylatedandbivalentpromoters