Cargando…
Expression regulation of genes is linked to their CpG density distributions around transcription start sites
The CpG dinucleotide and its methylation behaviors play vital roles in gene regulation. Previous studies have divided genes into several categories based on the CpG intensity around transcription starting sites and found that housekeeping genes tend to possess high CpG density, whereas tissue-specif...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Life Science Alliance LLC
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113945/ https://www.ncbi.nlm.nih.gov/pubmed/35580989 http://dx.doi.org/10.26508/lsa.202101302 |
_version_ | 1784709674689363968 |
---|---|
author | Tian, Hao He, Yueying Xue, Yue Gao, Yi Qin |
author_facet | Tian, Hao He, Yueying Xue, Yue Gao, Yi Qin |
author_sort | Tian, Hao |
collection | PubMed |
description | The CpG dinucleotide and its methylation behaviors play vital roles in gene regulation. Previous studies have divided genes into several categories based on the CpG intensity around transcription starting sites and found that housekeeping genes tend to possess high CpG density, whereas tissue-specific genes are generally characterized by low CpG density. In this study, we investigated how the CpG density distribution of a gene affects its transcription and regulation pattern. Based on the CpG density distribution around transcription starting site, by means of a semi-supervised neural network we designed, which took data augmentation into account, we divided the human genes into three categories, and genes within each cluster shared similar CpG density distribution. Not only sequence properties, these different clusters exhibited distinctly different structural features, regulatory mechanisms, correlation patterns between the expression level and CpG/TpG density, and expression and epigenetic mark variations during tumorigenesis. For instance, the activation of cluster 3 genes relies more on 3D genome reorganization, compared with cluster 1 and 2 genes, whereas cluster 2 genes showed the strongest correlation between gene expression and H3K27me3. Genes exhibiting uncoupled correlation between gene regulation and histone modifications are mainly in cluster 3. These results emphasized that the usage of epigenetic marks in gene regulation is partially rooted in the sequence property of genes such as their CpG density distribution and explained to some extent why the relation between epigenetic marks and gene expression is controversial. |
format | Online Article Text |
id | pubmed-9113945 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Life Science Alliance LLC |
record_format | MEDLINE/PubMed |
spelling | pubmed-91139452022-05-27 Expression regulation of genes is linked to their CpG density distributions around transcription start sites Tian, Hao He, Yueying Xue, Yue Gao, Yi Qin Life Sci Alliance Research Articles The CpG dinucleotide and its methylation behaviors play vital roles in gene regulation. Previous studies have divided genes into several categories based on the CpG intensity around transcription starting sites and found that housekeeping genes tend to possess high CpG density, whereas tissue-specific genes are generally characterized by low CpG density. In this study, we investigated how the CpG density distribution of a gene affects its transcription and regulation pattern. Based on the CpG density distribution around transcription starting site, by means of a semi-supervised neural network we designed, which took data augmentation into account, we divided the human genes into three categories, and genes within each cluster shared similar CpG density distribution. Not only sequence properties, these different clusters exhibited distinctly different structural features, regulatory mechanisms, correlation patterns between the expression level and CpG/TpG density, and expression and epigenetic mark variations during tumorigenesis. For instance, the activation of cluster 3 genes relies more on 3D genome reorganization, compared with cluster 1 and 2 genes, whereas cluster 2 genes showed the strongest correlation between gene expression and H3K27me3. Genes exhibiting uncoupled correlation between gene regulation and histone modifications are mainly in cluster 3. These results emphasized that the usage of epigenetic marks in gene regulation is partially rooted in the sequence property of genes such as their CpG density distribution and explained to some extent why the relation between epigenetic marks and gene expression is controversial. Life Science Alliance LLC 2022-05-17 /pmc/articles/PMC9113945/ /pubmed/35580989 http://dx.doi.org/10.26508/lsa.202101302 Text en © 2022 Tian et al. https://creativecommons.org/licenses/by/4.0/This article is available under a Creative Commons License (Attribution 4.0 International, as described at https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Articles Tian, Hao He, Yueying Xue, Yue Gao, Yi Qin Expression regulation of genes is linked to their CpG density distributions around transcription start sites |
title | Expression regulation of genes is linked to their CpG density distributions around transcription start sites |
title_full | Expression regulation of genes is linked to their CpG density distributions around transcription start sites |
title_fullStr | Expression regulation of genes is linked to their CpG density distributions around transcription start sites |
title_full_unstemmed | Expression regulation of genes is linked to their CpG density distributions around transcription start sites |
title_short | Expression regulation of genes is linked to their CpG density distributions around transcription start sites |
title_sort | expression regulation of genes is linked to their cpg density distributions around transcription start sites |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113945/ https://www.ncbi.nlm.nih.gov/pubmed/35580989 http://dx.doi.org/10.26508/lsa.202101302 |
work_keys_str_mv | AT tianhao expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites AT heyueying expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites AT xueyue expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites AT gaoyiqin expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites |