Cargando…

Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells

BACKGROUND: Studies have shown the strong association between histone modification levels and gene expression levels. The detailed relationships between the two can vary substantially due to differential regulation, and hence a simple regression model may not be adequate. We apply a regression hidde...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Yeonok, Ghosh, Debashis, Zhang, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4144088/
https://www.ncbi.nlm.nih.gov/pubmed/24884369
http://dx.doi.org/10.1186/1471-2164-15-360
_version_ 1782332006220169216
author Lee, Yeonok
Ghosh, Debashis
Zhang, Yu
author_facet Lee, Yeonok
Ghosh, Debashis
Zhang, Yu
author_sort Lee, Yeonok
collection PubMed
description BACKGROUND: Studies have shown the strong association between histone modification levels and gene expression levels. The detailed relationships between the two can vary substantially due to differential regulation, and hence a simple regression model may not be adequate. We apply a regression hidden Markov model (regHMM) to further investigate the potential multiple relationships between genes and histone methylation levels in mouse embryonic stem cells. RESULTS: Seven histone methylation levels are used in the study. Averaged histone modifications over non-overlapping 200 bp windows on the range transcription starting site (TSS) ± 1 Kb are used as predictors, and in total 70 explanatory variables are generated. Based on regHMM results, genes segregated into two groups, referred to as State 1 and State 2, have distinct association strengths. Genes in State 1 are better explained by histone methylation levels with R(2)=.72 while those in State 2 have weaker association strength with R(2)=.38. The regression coefficients in the two states are not very different in magnitude except in the intercept,.25 and 1.15 for State 1 and State 2, respectively. We found specific GO categories that may be attributed to the different relationships. The GO categories more frequently observed in State 2 match those of housekeeping genes, such as cytoplasm, nucleus, and protein binding. In addition, the housekeeping gene expression levels are significantly less explained by histone methylation in mouse embryonic stem cells, which is consistent with the constitutive expression patterns that would be expected. CONCLUSION: Gene expression levels are not universally affected by histone methylation levels, and the relationships between the two differ by the gene functions. The expression levels of the genes that perform the most common housekeeping genes’ GO categories are less strongly associated with histone methylation levels. We suspect that additional biological factors may also be strongly associated with the gene expression levels in State 2. We discover that the effect of the presence of CpG island in TSS ± 1 Kb is larger in State 2. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-360) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4144088
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41440882014-09-02 Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells Lee, Yeonok Ghosh, Debashis Zhang, Yu BMC Genomics Methodology Article BACKGROUND: Studies have shown the strong association between histone modification levels and gene expression levels. The detailed relationships between the two can vary substantially due to differential regulation, and hence a simple regression model may not be adequate. We apply a regression hidden Markov model (regHMM) to further investigate the potential multiple relationships between genes and histone methylation levels in mouse embryonic stem cells. RESULTS: Seven histone methylation levels are used in the study. Averaged histone modifications over non-overlapping 200 bp windows on the range transcription starting site (TSS) ± 1 Kb are used as predictors, and in total 70 explanatory variables are generated. Based on regHMM results, genes segregated into two groups, referred to as State 1 and State 2, have distinct association strengths. Genes in State 1 are better explained by histone methylation levels with R(2)=.72 while those in State 2 have weaker association strength with R(2)=.38. The regression coefficients in the two states are not very different in magnitude except in the intercept,.25 and 1.15 for State 1 and State 2, respectively. We found specific GO categories that may be attributed to the different relationships. The GO categories more frequently observed in State 2 match those of housekeeping genes, such as cytoplasm, nucleus, and protein binding. In addition, the housekeeping gene expression levels are significantly less explained by histone methylation in mouse embryonic stem cells, which is consistent with the constitutive expression patterns that would be expected. CONCLUSION: Gene expression levels are not universally affected by histone methylation levels, and the relationships between the two differ by the gene functions. The expression levels of the genes that perform the most common housekeeping genes’ GO categories are less strongly associated with histone methylation levels. We suspect that additional biological factors may also be strongly associated with the gene expression levels in State 2. We discover that the effect of the presence of CpG island in TSS ± 1 Kb is larger in State 2. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-360) contains supplementary material, which is available to authorized users. BioMed Central 2014-05-12 /pmc/articles/PMC4144088/ /pubmed/24884369 http://dx.doi.org/10.1186/1471-2164-15-360 Text en © Lee et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lee, Yeonok
Ghosh, Debashis
Zhang, Yu
Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
title Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
title_full Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
title_fullStr Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
title_full_unstemmed Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
title_short Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
title_sort regression hidden markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4144088/
https://www.ncbi.nlm.nih.gov/pubmed/24884369
http://dx.doi.org/10.1186/1471-2164-15-360
work_keys_str_mv AT leeyeonok regressionhiddenmarkovmodelingrevealsheterogeneousgeneexpressionregulationacasestudyinmouseembryonicstemcells
AT ghoshdebashis regressionhiddenmarkovmodelingrevealsheterogeneousgeneexpressionregulationacasestudyinmouseembryonicstemcells
AT zhangyu regressionhiddenmarkovmodelingrevealsheterogeneousgeneexpressionregulationacasestudyinmouseembryonicstemcells