Cargando…

CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?

BACKGROUND: CpG islands (CGIs), clusters of CpG dinucleotides in GC-rich regions, are often located in the 5' end of genes and considered gene markers. Hackenberg et al. (2006) recently developed a new algorithm, CpGcluster, which uses a completely different mathematical approach from previous...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Leng, Zhao, Zhongming
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652441/
https://www.ncbi.nlm.nih.gov/pubmed/19232104
http://dx.doi.org/10.1186/1471-2105-10-65
_version_ 1782165238928375808
author Han, Leng
Zhao, Zhongming
author_facet Han, Leng
Zhao, Zhongming
author_sort Han, Leng
collection PubMed
description BACKGROUND: CpG islands (CGIs), clusters of CpG dinucleotides in GC-rich regions, are often located in the 5' end of genes and considered gene markers. Hackenberg et al. (2006) recently developed a new algorithm, CpGcluster, which uses a completely different mathematical approach from previous traditional algorithms. Their evaluation suggests that CpGcluster provides a much more efficient approach to detecting functional clusters or islands of CpGs. RESULTS: We systematically compared CpGcluster with the traditional algorithm by Takai and Jones (2002). Our comparisons of (1) the number of islands versus the number of genes in a genome, (2) the distribution of islands in different genomic regions, (3) island length, (4) the distance between two neighboring islands, and (5) methylation status suggest that Takai and Jones' algorithm is overall more appropriate for identifying promoter-associated islands of CpGs in vertebrate genomes. CONCLUSION: The generation of genome sequence and DNA methylation data is expected to accelerate greatly. The information in this study is important for its extensive utility in gene feature analysis and epigenomics including gene prediction and methylation chip design in different genomes.
format Text
id pubmed-2652441
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26524412009-03-09 CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome? Han, Leng Zhao, Zhongming BMC Bioinformatics Research Article BACKGROUND: CpG islands (CGIs), clusters of CpG dinucleotides in GC-rich regions, are often located in the 5' end of genes and considered gene markers. Hackenberg et al. (2006) recently developed a new algorithm, CpGcluster, which uses a completely different mathematical approach from previous traditional algorithms. Their evaluation suggests that CpGcluster provides a much more efficient approach to detecting functional clusters or islands of CpGs. RESULTS: We systematically compared CpGcluster with the traditional algorithm by Takai and Jones (2002). Our comparisons of (1) the number of islands versus the number of genes in a genome, (2) the distribution of islands in different genomic regions, (3) island length, (4) the distance between two neighboring islands, and (5) methylation status suggest that Takai and Jones' algorithm is overall more appropriate for identifying promoter-associated islands of CpGs in vertebrate genomes. CONCLUSION: The generation of genome sequence and DNA methylation data is expected to accelerate greatly. The information in this study is important for its extensive utility in gene feature analysis and epigenomics including gene prediction and methylation chip design in different genomes. BioMed Central 2009-02-20 /pmc/articles/PMC2652441/ /pubmed/19232104 http://dx.doi.org/10.1186/1471-2105-10-65 Text en Copyright © 2009 Han and Zhao; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Han, Leng
Zhao, Zhongming
CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?
title CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?
title_full CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?
title_fullStr CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?
title_full_unstemmed CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?
title_short CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome?
title_sort cpg islands or cpg clusters: how to identify functional gc-rich regions in a genome?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2652441/
https://www.ncbi.nlm.nih.gov/pubmed/19232104
http://dx.doi.org/10.1186/1471-2105-10-65
work_keys_str_mv AT hanleng cpgislandsorcpgclustershowtoidentifyfunctionalgcrichregionsinagenome
AT zhaozhongming cpgislandsorcpgclustershowtoidentifyfunctionalgcrichregionsinagenome