Cargando…

Bayesian mixture regression analysis for regulation of Pluripotency in ES cells

BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels...

Descripción completa

Detalles Bibliográficos
Autores principales: Aflakparast, Mehran, Geeven, Geert, de Gunst, Mathisca C.M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6941360/
https://www.ncbi.nlm.nih.gov/pubmed/31898480
http://dx.doi.org/10.1186/s12859-019-3331-2
_version_ 1783484538527154176
author Aflakparast, Mehran
Geeven, Geert
de Gunst, Mathisca C.M.
author_facet Aflakparast, Mehran
Geeven, Geert
de Gunst, Mathisca C.M.
author_sort Aflakparast, Mehran
collection PubMed
description BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels of gene expression, regression methods have proven to be useful tools. They have been successfully applied to predict mRNA levels from genome-wide experimental data and they provide insight into context-dependent gene regulatory mechanisms. However, heterogeneity arising from gene-set specific regulatory interactions is often overlooked. RESULTS: We show that regression models that predict gene expression by using experimentally derived ChIP-seq profiles of TFs can be significantly improved by mixture modelling. In order to find biologically relevant gene clusters, we employ a Bayesian allocation procedure which allows us to integrate additional biological information such as three-dimensional nuclear organization of chromosomes and gene function. The data integration procedure involves transforming the additional data into gene similarity values. We propose a generic similarity measure that is especially suitable for situations where the additional data are of both continuous and discrete type, and compare its performance with similar measures in the context of mixture modelling. CONCLUSIONS: We applied the proposed method on a data from mouse embryonic stem cells (ESC). We find that including additional data results in mixture components that exhibit biologically meaningful gene clusters, and provides valuable insight into the heterogeneity of the regulatory interactions.
format Online
Article
Text
id pubmed-6941360
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69413602020-01-06 Bayesian mixture regression analysis for regulation of Pluripotency in ES cells Aflakparast, Mehran Geeven, Geert de Gunst, Mathisca C.M. BMC Bioinformatics Methodology Article BACKGROUND: Observed levels of gene expression strongly depend on both activity of DNA binding transcription factors (TFs) and chromatin state through different histone modifications (HMs). In order to recover the functional relationship between local chromatin state, TF binding and observed levels of gene expression, regression methods have proven to be useful tools. They have been successfully applied to predict mRNA levels from genome-wide experimental data and they provide insight into context-dependent gene regulatory mechanisms. However, heterogeneity arising from gene-set specific regulatory interactions is often overlooked. RESULTS: We show that regression models that predict gene expression by using experimentally derived ChIP-seq profiles of TFs can be significantly improved by mixture modelling. In order to find biologically relevant gene clusters, we employ a Bayesian allocation procedure which allows us to integrate additional biological information such as three-dimensional nuclear organization of chromosomes and gene function. The data integration procedure involves transforming the additional data into gene similarity values. We propose a generic similarity measure that is especially suitable for situations where the additional data are of both continuous and discrete type, and compare its performance with similar measures in the context of mixture modelling. CONCLUSIONS: We applied the proposed method on a data from mouse embryonic stem cells (ESC). We find that including additional data results in mixture components that exhibit biologically meaningful gene clusters, and provides valuable insight into the heterogeneity of the regulatory interactions. BioMed Central 2020-01-02 /pmc/articles/PMC6941360/ /pubmed/31898480 http://dx.doi.org/10.1186/s12859-019-3331-2 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Aflakparast, Mehran
Geeven, Geert
de Gunst, Mathisca C.M.
Bayesian mixture regression analysis for regulation of Pluripotency in ES cells
title Bayesian mixture regression analysis for regulation of Pluripotency in ES cells
title_full Bayesian mixture regression analysis for regulation of Pluripotency in ES cells
title_fullStr Bayesian mixture regression analysis for regulation of Pluripotency in ES cells
title_full_unstemmed Bayesian mixture regression analysis for regulation of Pluripotency in ES cells
title_short Bayesian mixture regression analysis for regulation of Pluripotency in ES cells
title_sort bayesian mixture regression analysis for regulation of pluripotency in es cells
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6941360/
https://www.ncbi.nlm.nih.gov/pubmed/31898480
http://dx.doi.org/10.1186/s12859-019-3331-2
work_keys_str_mv AT aflakparastmehran bayesianmixtureregressionanalysisforregulationofpluripotencyinescells
AT geevengeert bayesianmixtureregressionanalysisforregulationofpluripotencyinescells
AT degunstmathiscacm bayesianmixtureregressionanalysisforregulationofpluripotencyinescells