Cargando…

Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation

Transitions at CpG dinucleotides, referred to as “CpG substitutions”, are a major mutational input into vertebrate genomes and a leading cause of human genetic disease. The prevalence of CpG substitutions is due to their mutational origin, which is dependent on DNA methylation. In comparison, other...

Descripción completa

Detalles Bibliográficos
Autores principales: Elango, Navin, Kim, Seong-Ho, Vigoda, Eric, Yi, Soojin V.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2265638/
https://www.ncbi.nlm.nih.gov/pubmed/18463707
http://dx.doi.org/10.1371/journal.pcbi.1000015
_version_ 1782151502149713920
author Elango, Navin
Kim, Seong-Ho
Vigoda, Eric
Yi, Soojin V.
author_facet Elango, Navin
Kim, Seong-Ho
Vigoda, Eric
Yi, Soojin V.
author_sort Elango, Navin
collection PubMed
description Transitions at CpG dinucleotides, referred to as “CpG substitutions”, are a major mutational input into vertebrate genomes and a leading cause of human genetic disease. The prevalence of CpG substitutions is due to their mutational origin, which is dependent on DNA methylation. In comparison, other single nucleotide substitutions (for example those occurring at GpC dinucleotides) mainly arise from errors during DNA replication. Here we analyzed high quality BAC-based data from human, chimpanzee, and baboon to investigate regional variation of CpG substitution rates. We show that CpG substitutions occur approximately 15 times more frequently than other single nucleotide substitutions in primate genomes, and that they exhibit substantial regional variation. Patterns of CpG rate variation are consistent with differences in methylation level and susceptibility to subsequent deamination. In particular, we propose a “distance-decaying” hypothesis, positing that due to the molecular mechanism of a CpG substitution, rates are correlated with the stability of double-stranded DNA surrounding each CpG dinucleotide, and the effect of local DNA stability may decrease with distance from the CpG dinucleotide. Consistent with our “distance-decaying” hypothesis, rates of CpG substitution are strongly (negatively) correlated with regional G+C content. The influence of G+C content decays as the distance from the target CpG site increases. We estimate that the influence of local G+C content extends up to 1,500∼2,000 bps centered on each CpG site. We also show that the distance-decaying relationship persisted when we controlled for the effect of long-range homogeneity of nucleotide composition. GpC sites, in contrast, do not exhibit such “distance-decaying” relationship. Our results highlight an example of the distinctive properties of methylation-dependent substitutions versus substitutions mostly arising from errors during DNA replication. Furthermore, the negative relationship between G+C content and CpG rates may provide an explanation for the observation that GC-rich SINEs show lower CpG rates than other repetitive elements.
format Text
id pubmed-2265638
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-22656382008-03-08 Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation Elango, Navin Kim, Seong-Ho Vigoda, Eric Yi, Soojin V. PLoS Comput Biol Research Article Transitions at CpG dinucleotides, referred to as “CpG substitutions”, are a major mutational input into vertebrate genomes and a leading cause of human genetic disease. The prevalence of CpG substitutions is due to their mutational origin, which is dependent on DNA methylation. In comparison, other single nucleotide substitutions (for example those occurring at GpC dinucleotides) mainly arise from errors during DNA replication. Here we analyzed high quality BAC-based data from human, chimpanzee, and baboon to investigate regional variation of CpG substitution rates. We show that CpG substitutions occur approximately 15 times more frequently than other single nucleotide substitutions in primate genomes, and that they exhibit substantial regional variation. Patterns of CpG rate variation are consistent with differences in methylation level and susceptibility to subsequent deamination. In particular, we propose a “distance-decaying” hypothesis, positing that due to the molecular mechanism of a CpG substitution, rates are correlated with the stability of double-stranded DNA surrounding each CpG dinucleotide, and the effect of local DNA stability may decrease with distance from the CpG dinucleotide. Consistent with our “distance-decaying” hypothesis, rates of CpG substitution are strongly (negatively) correlated with regional G+C content. The influence of G+C content decays as the distance from the target CpG site increases. We estimate that the influence of local G+C content extends up to 1,500∼2,000 bps centered on each CpG site. We also show that the distance-decaying relationship persisted when we controlled for the effect of long-range homogeneity of nucleotide composition. GpC sites, in contrast, do not exhibit such “distance-decaying” relationship. Our results highlight an example of the distinctive properties of methylation-dependent substitutions versus substitutions mostly arising from errors during DNA replication. Furthermore, the negative relationship between G+C content and CpG rates may provide an explanation for the observation that GC-rich SINEs show lower CpG rates than other repetitive elements. Public Library of Science 2008-02-29 /pmc/articles/PMC2265638/ /pubmed/18463707 http://dx.doi.org/10.1371/journal.pcbi.1000015 Text en Elango et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Elango, Navin
Kim, Seong-Ho
Vigoda, Eric
Yi, Soojin V.
Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation
title Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation
title_full Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation
title_fullStr Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation
title_full_unstemmed Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation
title_short Mutations of Different Molecular Origins Exhibit Contrasting Patterns of Regional Substitution Rate Variation
title_sort mutations of different molecular origins exhibit contrasting patterns of regional substitution rate variation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2265638/
https://www.ncbi.nlm.nih.gov/pubmed/18463707
http://dx.doi.org/10.1371/journal.pcbi.1000015
work_keys_str_mv AT elangonavin mutationsofdifferentmolecularoriginsexhibitcontrastingpatternsofregionalsubstitutionratevariation
AT kimseongho mutationsofdifferentmolecularoriginsexhibitcontrastingpatternsofregionalsubstitutionratevariation
AT mutationsofdifferentmolecularoriginsexhibitcontrastingpatternsofregionalsubstitutionratevariation
AT vigodaeric mutationsofdifferentmolecularoriginsexhibitcontrastingpatternsofregionalsubstitutionratevariation
AT yisoojinv mutationsofdifferentmolecularoriginsexhibitcontrastingpatternsofregionalsubstitutionratevariation