Cargando…
Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines
BACKGROUND: The histones in the core of nucleosomes may be subject to covalent post-transcriptional modifications. These modifications are thought to correlate with and possibly affect various genomic functions, including transcription. Each modification may alone or in combination with other modifi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5264486/ https://www.ncbi.nlm.nih.gov/pubmed/28122488 http://dx.doi.org/10.1186/s12859-016-1418-6 |
_version_ | 1782500114944753664 |
---|---|
author | Chitsazian, Fereshteh Sadeghi, Mehdi Elahi, Elahe |
author_facet | Chitsazian, Fereshteh Sadeghi, Mehdi Elahi, Elahe |
author_sort | Chitsazian, Fereshteh |
collection | PubMed |
description | BACKGROUND: The histones in the core of nucleosomes may be subject to covalent post-transcriptional modifications. These modifications are thought to correlate with and possibly affect various genomic functions, including transcription. Each modification may alone or in combination with other modifications influence or be influenced by transcription. We aimed to identify correlations between single modifications or combinations of modifications at specific nucleosome sized gene regions with transcription activity based on global histone modification and transcription data of human CD4+ T cells and three other human cell lines. Transcription activity was defined in a binary fashion as either on or off. The analysis was done using the Classification and Regression Tree (CART) data mining protocol, and the Multifactorial Dimensionality Reduction (MDR) method was performed to confirm the CART results. These powerful methods have not previously been used for analysis of histone modification data. RESULTS: We showed that analysis of the single histone modification H2BK5ac at only four gene regions correctly predicted transcription activity status of over 75% of genes in CD4+ T-cells. The H2BK5ac modification status also had high power for prediction of gene transcription activity in the three other cell lines studied. The informative gene regions with the H2BK5ac modification were all positioned proximal to transcription initiation sites. The CART and MDR methods were appropriate tools for the analysis performed. In the study, we also developed a non-arbitrary protocol for binary classification of genes as transcriptionally active or inactive. CONCLUSIONS: The importance of H2BK5ac modification with regards to transcription control has not previously been emphasized. Analysis of this single modification at only four nucleosome sized gene regions, all of which are at or proximal to transcription initiation, has high power for prediction of gene transcription activity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1418-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5264486 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52644862017-01-30 Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines Chitsazian, Fereshteh Sadeghi, Mehdi Elahi, Elahe BMC Bioinformatics Research Article BACKGROUND: The histones in the core of nucleosomes may be subject to covalent post-transcriptional modifications. These modifications are thought to correlate with and possibly affect various genomic functions, including transcription. Each modification may alone or in combination with other modifications influence or be influenced by transcription. We aimed to identify correlations between single modifications or combinations of modifications at specific nucleosome sized gene regions with transcription activity based on global histone modification and transcription data of human CD4+ T cells and three other human cell lines. Transcription activity was defined in a binary fashion as either on or off. The analysis was done using the Classification and Regression Tree (CART) data mining protocol, and the Multifactorial Dimensionality Reduction (MDR) method was performed to confirm the CART results. These powerful methods have not previously been used for analysis of histone modification data. RESULTS: We showed that analysis of the single histone modification H2BK5ac at only four gene regions correctly predicted transcription activity status of over 75% of genes in CD4+ T-cells. The H2BK5ac modification status also had high power for prediction of gene transcription activity in the three other cell lines studied. The informative gene regions with the H2BK5ac modification were all positioned proximal to transcription initiation sites. The CART and MDR methods were appropriate tools for the analysis performed. In the study, we also developed a non-arbitrary protocol for binary classification of genes as transcriptionally active or inactive. CONCLUSIONS: The importance of H2BK5ac modification with regards to transcription control has not previously been emphasized. Analysis of this single modification at only four nucleosome sized gene regions, all of which are at or proximal to transcription initiation, has high power for prediction of gene transcription activity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1418-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-25 /pmc/articles/PMC5264486/ /pubmed/28122488 http://dx.doi.org/10.1186/s12859-016-1418-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Chitsazian, Fereshteh Sadeghi, Mehdi Elahi, Elahe Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines |
title | Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines |
title_full | Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines |
title_fullStr | Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines |
title_full_unstemmed | Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines |
title_short | Confident gene activity prediction based on single histone modification H2BK5ac in human cell lines |
title_sort | confident gene activity prediction based on single histone modification h2bk5ac in human cell lines |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5264486/ https://www.ncbi.nlm.nih.gov/pubmed/28122488 http://dx.doi.org/10.1186/s12859-016-1418-6 |
work_keys_str_mv | AT chitsazianfereshteh confidentgeneactivitypredictionbasedonsinglehistonemodificationh2bk5acinhumancelllines AT sadeghimehdi confidentgeneactivitypredictionbasedonsinglehistonemodificationh2bk5acinhumancelllines AT elahielahe confidentgeneactivitypredictionbasedonsinglehistonemodificationh2bk5acinhumancelllines |