Cargando…

Expression regulation of genes is linked to their CpG density distributions around transcription start sites

The CpG dinucleotide and its methylation behaviors play vital roles in gene regulation. Previous studies have divided genes into several categories based on the CpG intensity around transcription starting sites and found that housekeeping genes tend to possess high CpG density, whereas tissue-specif...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Hao, He, Yueying, Xue, Yue, Gao, Yi Qin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Life Science Alliance LLC 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113945/
https://www.ncbi.nlm.nih.gov/pubmed/35580989
http://dx.doi.org/10.26508/lsa.202101302
_version_ 1784709674689363968
author Tian, Hao
He, Yueying
Xue, Yue
Gao, Yi Qin
author_facet Tian, Hao
He, Yueying
Xue, Yue
Gao, Yi Qin
author_sort Tian, Hao
collection PubMed
description The CpG dinucleotide and its methylation behaviors play vital roles in gene regulation. Previous studies have divided genes into several categories based on the CpG intensity around transcription starting sites and found that housekeeping genes tend to possess high CpG density, whereas tissue-specific genes are generally characterized by low CpG density. In this study, we investigated how the CpG density distribution of a gene affects its transcription and regulation pattern. Based on the CpG density distribution around transcription starting site, by means of a semi-supervised neural network we designed, which took data augmentation into account, we divided the human genes into three categories, and genes within each cluster shared similar CpG density distribution. Not only sequence properties, these different clusters exhibited distinctly different structural features, regulatory mechanisms, correlation patterns between the expression level and CpG/TpG density, and expression and epigenetic mark variations during tumorigenesis. For instance, the activation of cluster 3 genes relies more on 3D genome reorganization, compared with cluster 1 and 2 genes, whereas cluster 2 genes showed the strongest correlation between gene expression and H3K27me3. Genes exhibiting uncoupled correlation between gene regulation and histone modifications are mainly in cluster 3. These results emphasized that the usage of epigenetic marks in gene regulation is partially rooted in the sequence property of genes such as their CpG density distribution and explained to some extent why the relation between epigenetic marks and gene expression is controversial.
format Online
Article
Text
id pubmed-9113945
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Life Science Alliance LLC
record_format MEDLINE/PubMed
spelling pubmed-91139452022-05-27 Expression regulation of genes is linked to their CpG density distributions around transcription start sites Tian, Hao He, Yueying Xue, Yue Gao, Yi Qin Life Sci Alliance Research Articles The CpG dinucleotide and its methylation behaviors play vital roles in gene regulation. Previous studies have divided genes into several categories based on the CpG intensity around transcription starting sites and found that housekeeping genes tend to possess high CpG density, whereas tissue-specific genes are generally characterized by low CpG density. In this study, we investigated how the CpG density distribution of a gene affects its transcription and regulation pattern. Based on the CpG density distribution around transcription starting site, by means of a semi-supervised neural network we designed, which took data augmentation into account, we divided the human genes into three categories, and genes within each cluster shared similar CpG density distribution. Not only sequence properties, these different clusters exhibited distinctly different structural features, regulatory mechanisms, correlation patterns between the expression level and CpG/TpG density, and expression and epigenetic mark variations during tumorigenesis. For instance, the activation of cluster 3 genes relies more on 3D genome reorganization, compared with cluster 1 and 2 genes, whereas cluster 2 genes showed the strongest correlation between gene expression and H3K27me3. Genes exhibiting uncoupled correlation between gene regulation and histone modifications are mainly in cluster 3. These results emphasized that the usage of epigenetic marks in gene regulation is partially rooted in the sequence property of genes such as their CpG density distribution and explained to some extent why the relation between epigenetic marks and gene expression is controversial. Life Science Alliance LLC 2022-05-17 /pmc/articles/PMC9113945/ /pubmed/35580989 http://dx.doi.org/10.26508/lsa.202101302 Text en © 2022 Tian et al. https://creativecommons.org/licenses/by/4.0/This article is available under a Creative Commons License (Attribution 4.0 International, as described at https://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Articles
Tian, Hao
He, Yueying
Xue, Yue
Gao, Yi Qin
Expression regulation of genes is linked to their CpG density distributions around transcription start sites
title Expression regulation of genes is linked to their CpG density distributions around transcription start sites
title_full Expression regulation of genes is linked to their CpG density distributions around transcription start sites
title_fullStr Expression regulation of genes is linked to their CpG density distributions around transcription start sites
title_full_unstemmed Expression regulation of genes is linked to their CpG density distributions around transcription start sites
title_short Expression regulation of genes is linked to their CpG density distributions around transcription start sites
title_sort expression regulation of genes is linked to their cpg density distributions around transcription start sites
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9113945/
https://www.ncbi.nlm.nih.gov/pubmed/35580989
http://dx.doi.org/10.26508/lsa.202101302
work_keys_str_mv AT tianhao expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites
AT heyueying expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites
AT xueyue expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites
AT gaoyiqin expressionregulationofgenesislinkedtotheircpgdensitydistributionsaroundtranscriptionstartsites