Cargando…

Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals

BACKGROUND: Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinuc...

Descripción completa

Detalles Bibliográficos
Autores principales: Gaffney, Daniel J, Keightley, Peter D
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2576242/
https://www.ncbi.nlm.nih.gov/pubmed/18826599
http://dx.doi.org/10.1186/1471-2148-8-265
_version_ 1782160375399055360
author Gaffney, Daniel J
Keightley, Peter D
author_facet Gaffney, Daniel J
Keightley, Peter D
author_sort Gaffney, Daniel J
collection PubMed
description BACKGROUND: Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related. RESULTS: Using simulations of DNA sequence evolution we show that assignment of the ancestral CpG state based on the simple presence/absence of the CpG dinucleotide can seriously bias estimates of the substitution rate, because many true non-CpG changes are misassigned as CpG. Paradoxically, this bias is most severe between closely related species, because a minimum of two substitutions are required to misassign a true ancestral CpG site as non-CpG whereas only a single substitution is required to misassign a true ancestral non-CpG site as CpG in a two branch tree. We also show that CpG misassignment bias differentially affects fourfold degenerate and noncoding sites due to differences in base composition such that fourfold degenerate sites can appear to be evolving more slowly than noncoding sites. We demonstrate that the effects predicted by our simulations occur in a real evolutionary setting by comparing substitution rates estimated from human-chimp coding and intronic sequence using CpG/non-CpG assignment with estimates derived from a method that is largely free from bias. CONCLUSION: Our study demonstrates that a common method of assigning sites into CpG and non CpG classes in pairwise alignments is seriously biased and recommends against the adoption of ad hoc methods of ancestral state assignment.
format Text
id pubmed-2576242
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25762422008-10-31 Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals Gaffney, Daniel J Keightley, Peter D BMC Evol Biol Methodology Article BACKGROUND: Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related. RESULTS: Using simulations of DNA sequence evolution we show that assignment of the ancestral CpG state based on the simple presence/absence of the CpG dinucleotide can seriously bias estimates of the substitution rate, because many true non-CpG changes are misassigned as CpG. Paradoxically, this bias is most severe between closely related species, because a minimum of two substitutions are required to misassign a true ancestral CpG site as non-CpG whereas only a single substitution is required to misassign a true ancestral non-CpG site as CpG in a two branch tree. We also show that CpG misassignment bias differentially affects fourfold degenerate and noncoding sites due to differences in base composition such that fourfold degenerate sites can appear to be evolving more slowly than noncoding sites. We demonstrate that the effects predicted by our simulations occur in a real evolutionary setting by comparing substitution rates estimated from human-chimp coding and intronic sequence using CpG/non-CpG assignment with estimates derived from a method that is largely free from bias. CONCLUSION: Our study demonstrates that a common method of assigning sites into CpG and non CpG classes in pairwise alignments is seriously biased and recommends against the adoption of ad hoc methods of ancestral state assignment. BioMed Central 2008-09-30 /pmc/articles/PMC2576242/ /pubmed/18826599 http://dx.doi.org/10.1186/1471-2148-8-265 Text en Copyright ©2008 Gaffney and Keightley; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Gaffney, Daniel J
Keightley, Peter D
Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals
title Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals
title_full Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals
title_fullStr Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals
title_full_unstemmed Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals
title_short Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals
title_sort effect of the assignment of ancestral cpg state on the estimation of nucleotide substitution rates in mammals
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2576242/
https://www.ncbi.nlm.nih.gov/pubmed/18826599
http://dx.doi.org/10.1186/1471-2148-8-265
work_keys_str_mv AT gaffneydanielj effectoftheassignmentofancestralcpgstateontheestimationofnucleotidesubstitutionratesinmammals
AT keightleypeterd effectoftheassignmentofancestralcpgstateontheestimationofnucleotidesubstitutionratesinmammals