Cargando…
Bayesian mixture regression analysis for regulation of Pluripotency in ES cells
BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6941360/ https://www.ncbi.nlm.nih.gov/pubmed/31898480 http://dx.doi.org/10.1186/s12859-019-3331-2 |
_version_ | 1783484538527154176 |
---|---|
author | Aflakparast, Mehran Geeven, Geert de Gunst, Mathisca C.M. |
author_facet | Aflakparast, Mehran Geeven, Geert de Gunst, Mathisca C.M. |
author_sort | Aflakparast, Mehran |
collection | PubMed |
description | BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels of gene expression, regression methods have proven to be useful tools. They have been successfully applied to predict mRNA levels from genome-wide experimental data and they provide insight into context-dependent gene regulatory mechanisms. However, heterogeneity arising from gene-set specific regulatory interactions is often overlooked. RESULTS: We show that regression models that predict gene expression by using experimentally derived ChIP-seq profiles of TFs can be significantly improved by mixture modelling. In order to find biologically relevant gene clusters, we employ a Bayesian allocation procedure which allows us to integrate additional biological information such as three-dimensional nuclear organization of chromosomes and gene function. The data integration procedure involves transforming the additional data into gene similarity values. We propose a generic similarity measure that is especially suitable for situations where the additional data are of both continuous and discrete type, and compare its performance with similar measures in the context of mixture modelling. CONCLUSIONS: We applied the proposed method on a data from mouse embryonic stem cells (ESC). We find that including additional data results in mixture components that exhibit biologically meaningful gene clusters, and provides valuable insight into the heterogeneity of the regulatory interactions. |
format | Online Article Text |
id | pubmed-6941360 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69413602020-01-06 Bayesian mixture regression analysis for regulation of Pluripotency in ES cells Aflakparast, Mehran Geeven, Geert de Gunst, Mathisca C.M. BMC Bioinformatics Methodology Article BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels of gene expression, regression methods have proven to be useful tools. They have been successfully applied to predict mRNA levels from genome-wide experimental data and they provide insight into context-dependent gene regulatory mechanisms. However, heterogeneity arising from gene-set specific regulatory interactions is often overlooked. RESULTS: We show that regression models that predict gene expression by using experimentally derived ChIP-seq profiles of TFs can be significantly improved by mixture modelling. In order to find biologically relevant gene clusters, we employ a Bayesian allocation procedure which allows us to integrate additional biological information such as three-dimensional nuclear organization of chromosomes and gene function. The data integration procedure involves transforming the additional data into gene similarity values. We propose a generic similarity measure that is especially suitable for situations where the additional data are of both continuous and discrete type, and compare its performance with similar measures in the context of mixture modelling. CONCLUSIONS: We applied the proposed method on a data from mouse embryonic stem cells (ESC). We find that including additional data results in mixture components that exhibit biologically meaningful gene clusters, and provides valuable insight into the heterogeneity of the regulatory interactions. BioMed Central 2020-01-02 /pmc/articles/PMC6941360/ /pubmed/31898480 http://dx.doi.org/10.1186/s12859-019-3331-2 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Aflakparast, Mehran Geeven, Geert de Gunst, Mathisca C.M. Bayesian mixture regression analysis for regulation of Pluripotency in ES cells |
title | Bayesian mixture regression analysis for regulation of Pluripotency in ES cells |
title_full | Bayesian mixture regression analysis for regulation of Pluripotency in ES cells |
title_fullStr | Bayesian mixture regression analysis for regulation of Pluripotency in ES cells |
title_full_unstemmed | Bayesian mixture regression analysis for regulation of Pluripotency in ES cells |
title_short | Bayesian mixture regression analysis for regulation of Pluripotency in ES cells |
title_sort | bayesian mixture regression analysis for regulation of pluripotency in es cells |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6941360/ https://www.ncbi.nlm.nih.gov/pubmed/31898480 http://dx.doi.org/10.1186/s12859-019-3331-2 |
work_keys_str_mv | AT aflakparastmehran bayesianmixtureregressionanalysisforregulationofpluripotencyinescells AT geevengeert bayesianmixtureregressionanalysisforregulationofpluripotencyinescells AT degunstmathiscacm bayesianmixtureregressionanalysisforregulationofpluripotencyinescells |