Cargando…
Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells
BACKGROUND: Studies have shown the strong association between histone modification levels and gene expression levels. The detailed relationships between the two can vary substantially due to differential regulation, and hence a simple regression model may not be adequate. We apply a regression hidde...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4144088/ https://www.ncbi.nlm.nih.gov/pubmed/24884369 http://dx.doi.org/10.1186/1471-2164-15-360 |
_version_ | 1782332006220169216 |
---|---|
author | Lee, Yeonok Ghosh, Debashis Zhang, Yu |
author_facet | Lee, Yeonok Ghosh, Debashis Zhang, Yu |
author_sort | Lee, Yeonok |
collection | PubMed |
description | BACKGROUND: Studies have shown the strong association between histone modification levels and gene expression levels. The detailed relationships between the two can vary substantially due to differential regulation, and hence a simple regression model may not be adequate. We apply a regression hidden Markov model (regHMM) to further investigate the potential multiple relationships between genes and histone methylation levels in mouse embryonic stem cells. RESULTS: Seven histone methylation levels are used in the study. Averaged histone modifications over non-overlapping 200 bp windows on the range transcription starting site (TSS) ± 1 Kb are used as predictors, and in total 70 explanatory variables are generated. Based on regHMM results, genes segregated into two groups, referred to as State 1 and State 2, have distinct association strengths. Genes in State 1 are better explained by histone methylation levels with R(2)=.72 while those in State 2 have weaker association strength with R(2)=.38. The regression coefficients in the two states are not very different in magnitude except in the intercept,.25 and 1.15 for State 1 and State 2, respectively. We found specific GO categories that may be attributed to the different relationships. The GO categories more frequently observed in State 2 match those of housekeeping genes, such as cytoplasm, nucleus, and protein binding. In addition, the housekeeping gene expression levels are significantly less explained by histone methylation in mouse embryonic stem cells, which is consistent with the constitutive expression patterns that would be expected. CONCLUSION: Gene expression levels are not universally affected by histone methylation levels, and the relationships between the two differ by the gene functions. The expression levels of the genes that perform the most common housekeeping genes’ GO categories are less strongly associated with histone methylation levels. We suspect that additional biological factors may also be strongly associated with the gene expression levels in State 2. We discover that the effect of the presence of CpG island in TSS ± 1 Kb is larger in State 2. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-360) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4144088 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41440882014-09-02 Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells Lee, Yeonok Ghosh, Debashis Zhang, Yu BMC Genomics Methodology Article BACKGROUND: Studies have shown the strong association between histone modification levels and gene expression levels. The detailed relationships between the two can vary substantially due to differential regulation, and hence a simple regression model may not be adequate. We apply a regression hidden Markov model (regHMM) to further investigate the potential multiple relationships between genes and histone methylation levels in mouse embryonic stem cells. RESULTS: Seven histone methylation levels are used in the study. Averaged histone modifications over non-overlapping 200 bp windows on the range transcription starting site (TSS) ± 1 Kb are used as predictors, and in total 70 explanatory variables are generated. Based on regHMM results, genes segregated into two groups, referred to as State 1 and State 2, have distinct association strengths. Genes in State 1 are better explained by histone methylation levels with R(2)=.72 while those in State 2 have weaker association strength with R(2)=.38. The regression coefficients in the two states are not very different in magnitude except in the intercept,.25 and 1.15 for State 1 and State 2, respectively. We found specific GO categories that may be attributed to the different relationships. The GO categories more frequently observed in State 2 match those of housekeeping genes, such as cytoplasm, nucleus, and protein binding. In addition, the housekeeping gene expression levels are significantly less explained by histone methylation in mouse embryonic stem cells, which is consistent with the constitutive expression patterns that would be expected. CONCLUSION: Gene expression levels are not universally affected by histone methylation levels, and the relationships between the two differ by the gene functions. The expression levels of the genes that perform the most common housekeeping genes’ GO categories are less strongly associated with histone methylation levels. We suspect that additional biological factors may also be strongly associated with the gene expression levels in State 2. We discover that the effect of the presence of CpG island in TSS ± 1 Kb is larger in State 2. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-360) contains supplementary material, which is available to authorized users. BioMed Central 2014-05-12 /pmc/articles/PMC4144088/ /pubmed/24884369 http://dx.doi.org/10.1186/1471-2164-15-360 Text en © Lee et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Lee, Yeonok Ghosh, Debashis Zhang, Yu Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells |
title | Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells |
title_full | Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells |
title_fullStr | Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells |
title_full_unstemmed | Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells |
title_short | Regression hidden Markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells |
title_sort | regression hidden markov modeling reveals heterogeneous gene expression regulation: a case study in mouse embryonic stem cells |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4144088/ https://www.ncbi.nlm.nih.gov/pubmed/24884369 http://dx.doi.org/10.1186/1471-2164-15-360 |
work_keys_str_mv | AT leeyeonok regressionhiddenmarkovmodelingrevealsheterogeneousgeneexpressionregulationacasestudyinmouseembryonicstemcells AT ghoshdebashis regressionhiddenmarkovmodelingrevealsheterogeneousgeneexpressionregulationacasestudyinmouseembryonicstemcells AT zhangyu regressionhiddenmarkovmodelingrevealsheterogeneousgeneexpressionregulationacasestudyinmouseembryonicstemcells |