Cargando…

Intergenic, gene terminal, and intragenic CpG islands in the human genome

BACKGROUND: Recently, it has been discovered that the human genome contains many transcription start sites for non-coding RNA. Regulatory regions related to transcription of this non-coding RNAs are poorly studied. Some of these regulatory regions may be associated with CpG islands located far from...

Descripción completa

Detalles Bibliográficos
Autores principales: Medvedeva, Yulia A, Fridman, Marina V, Oparina, Nina J, Malko, Dmitry B, Ermakova, Ekaterina O, Kulakovskiy, Ivan V, Heinzel, Andreas, Makeev, Vsevolod J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817693/
https://www.ncbi.nlm.nih.gov/pubmed/20085634
http://dx.doi.org/10.1186/1471-2164-11-48
_version_ 1782177227639619584
author Medvedeva, Yulia A
Fridman, Marina V
Oparina, Nina J
Malko, Dmitry B
Ermakova, Ekaterina O
Kulakovskiy, Ivan V
Heinzel, Andreas
Makeev, Vsevolod J
author_facet Medvedeva, Yulia A
Fridman, Marina V
Oparina, Nina J
Malko, Dmitry B
Ermakova, Ekaterina O
Kulakovskiy, Ivan V
Heinzel, Andreas
Makeev, Vsevolod J
author_sort Medvedeva, Yulia A
collection PubMed
description BACKGROUND: Recently, it has been discovered that the human genome contains many transcription start sites for non-coding RNA. Regulatory regions related to transcription of this non-coding RNAs are poorly studied. Some of these regulatory regions may be associated with CpG islands located far from transcription start-sites of any protein coding gene. The human genome contains many such CpG islands; however, until now their properties were not systematically studied. RESULTS: We studied CpG islands located in different regions of the human genome using methods of bioinformatics and comparative genomics. We have observed that CpG islands have a preference to overlap with exons, including exons located far from transcription start site, but usually extend well into introns. Synonymous substitution rate of CpG-containing codons becomes substantially reduced in regions where CpG islands overlap with protein-coding exons, even if they are located far downstream from transcription start site. CAGE tag analysis displayed frequent transcription start sites in all CpG islands, including those found far from transcription start sites of protein coding genes. Computational prediction and analysis of published ChIP-chip data revealed that CpG islands contain an increased number of sites recognized by Sp1 protein. CpG islands containing more CAGE tags usually also contain more Sp1 binding sites. This is especially relevant for CpG islands located in 3' gene regions. Various examples of transcription, confirmed by mRNAs or ESTs, but with no evidence of protein coding genes, were found in CAGE-enriched CpG islands located far from transcription start site of any known protein coding gene. CONCLUSIONS: CpG islands located far from transcription start sites of protein coding genes have transcription initiation activity and display Sp1 binding properties. In exons, overlapping with these islands, the synonymous substitution rate of CpG containing codons is decreased. This suggests that these CpG islands are involved in transcription initiation, possibly of some non-coding RNAs.
format Text
id pubmed-2817693
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28176932010-02-09 Intergenic, gene terminal, and intragenic CpG islands in the human genome Medvedeva, Yulia A Fridman, Marina V Oparina, Nina J Malko, Dmitry B Ermakova, Ekaterina O Kulakovskiy, Ivan V Heinzel, Andreas Makeev, Vsevolod J BMC Genomics Research Article BACKGROUND: Recently, it has been discovered that the human genome contains many transcription start sites for non-coding RNA. Regulatory regions related to transcription of this non-coding RNAs are poorly studied. Some of these regulatory regions may be associated with CpG islands located far from transcription start-sites of any protein coding gene. The human genome contains many such CpG islands; however, until now their properties were not systematically studied. RESULTS: We studied CpG islands located in different regions of the human genome using methods of bioinformatics and comparative genomics. We have observed that CpG islands have a preference to overlap with exons, including exons located far from transcription start site, but usually extend well into introns. Synonymous substitution rate of CpG-containing codons becomes substantially reduced in regions where CpG islands overlap with protein-coding exons, even if they are located far downstream from transcription start site. CAGE tag analysis displayed frequent transcription start sites in all CpG islands, including those found far from transcription start sites of protein coding genes. Computational prediction and analysis of published ChIP-chip data revealed that CpG islands contain an increased number of sites recognized by Sp1 protein. CpG islands containing more CAGE tags usually also contain more Sp1 binding sites. This is especially relevant for CpG islands located in 3' gene regions. Various examples of transcription, confirmed by mRNAs or ESTs, but with no evidence of protein coding genes, were found in CAGE-enriched CpG islands located far from transcription start site of any known protein coding gene. CONCLUSIONS: CpG islands located far from transcription start sites of protein coding genes have transcription initiation activity and display Sp1 binding properties. In exons, overlapping with these islands, the synonymous substitution rate of CpG containing codons is decreased. This suggests that these CpG islands are involved in transcription initiation, possibly of some non-coding RNAs. BioMed Central 2010-01-19 /pmc/articles/PMC2817693/ /pubmed/20085634 http://dx.doi.org/10.1186/1471-2164-11-48 Text en Copyright ©2010 Medvedeva et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Medvedeva, Yulia A
Fridman, Marina V
Oparina, Nina J
Malko, Dmitry B
Ermakova, Ekaterina O
Kulakovskiy, Ivan V
Heinzel, Andreas
Makeev, Vsevolod J
Intergenic, gene terminal, and intragenic CpG islands in the human genome
title Intergenic, gene terminal, and intragenic CpG islands in the human genome
title_full Intergenic, gene terminal, and intragenic CpG islands in the human genome
title_fullStr Intergenic, gene terminal, and intragenic CpG islands in the human genome
title_full_unstemmed Intergenic, gene terminal, and intragenic CpG islands in the human genome
title_short Intergenic, gene terminal, and intragenic CpG islands in the human genome
title_sort intergenic, gene terminal, and intragenic cpg islands in the human genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817693/
https://www.ncbi.nlm.nih.gov/pubmed/20085634
http://dx.doi.org/10.1186/1471-2164-11-48
work_keys_str_mv AT medvedevayuliaa intergenicgeneterminalandintrageniccpgislandsinthehumangenome
AT fridmanmarinav intergenicgeneterminalandintrageniccpgislandsinthehumangenome
AT oparinaninaj intergenicgeneterminalandintrageniccpgislandsinthehumangenome
AT malkodmitryb intergenicgeneterminalandintrageniccpgislandsinthehumangenome
AT ermakovaekaterinao intergenicgeneterminalandintrageniccpgislandsinthehumangenome
AT kulakovskiyivanv intergenicgeneterminalandintrageniccpgislandsinthehumangenome
AT heinzelandreas intergenicgeneterminalandintrageniccpgislandsinthehumangenome
AT makeevvsevolodj intergenicgeneterminalandintrageniccpgislandsinthehumangenome