Cargando…
Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals
BACKGROUND: Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinuc...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2576242/ https://www.ncbi.nlm.nih.gov/pubmed/18826599 http://dx.doi.org/10.1186/1471-2148-8-265 |
_version_ | 1782160375399055360 |
---|---|
author | Gaffney, Daniel J Keightley, Peter D |
author_facet | Gaffney, Daniel J Keightley, Peter D |
author_sort | Gaffney, Daniel J |
collection | PubMed |
description | BACKGROUND: Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related. RESULTS: Using simulations of DNA sequence evolution we show that assignment of the ancestral CpG state based on the simple presence/absence of the CpG dinucleotide can seriously bias estimates of the substitution rate, because many true non-CpG changes are misassigned as CpG. Paradoxically, this bias is most severe between closely related species, because a minimum of two substitutions are required to misassign a true ancestral CpG site as non-CpG whereas only a single substitution is required to misassign a true ancestral non-CpG site as CpG in a two branch tree. We also show that CpG misassignment bias differentially affects fourfold degenerate and noncoding sites due to differences in base composition such that fourfold degenerate sites can appear to be evolving more slowly than noncoding sites. We demonstrate that the effects predicted by our simulations occur in a real evolutionary setting by comparing substitution rates estimated from human-chimp coding and intronic sequence using CpG/non-CpG assignment with estimates derived from a method that is largely free from bias. CONCLUSION: Our study demonstrates that a common method of assigning sites into CpG and non CpG classes in pairwise alignments is seriously biased and recommends against the adoption of ad hoc methods of ancestral state assignment. |
format | Text |
id | pubmed-2576242 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-25762422008-10-31 Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals Gaffney, Daniel J Keightley, Peter D BMC Evol Biol Methodology Article BACKGROUND: Molecular evolutionary studies in mammals often estimate nucleotide substitution rates within and outside CpG dinucleotides separately. Frequently, in alignments of two sequences, the division of sites into CpG and non-CpG classes is based simply on the presence or absence of a CpG dinucleotide in either sequence, a procedure that we refer to as CpG/non-CpG assignment. Although it likely that this procedure is biased, it is generally assumed that the bias is negligible if species are very closely related. RESULTS: Using simulations of DNA sequence evolution we show that assignment of the ancestral CpG state based on the simple presence/absence of the CpG dinucleotide can seriously bias estimates of the substitution rate, because many true non-CpG changes are misassigned as CpG. Paradoxically, this bias is most severe between closely related species, because a minimum of two substitutions are required to misassign a true ancestral CpG site as non-CpG whereas only a single substitution is required to misassign a true ancestral non-CpG site as CpG in a two branch tree. We also show that CpG misassignment bias differentially affects fourfold degenerate and noncoding sites due to differences in base composition such that fourfold degenerate sites can appear to be evolving more slowly than noncoding sites. We demonstrate that the effects predicted by our simulations occur in a real evolutionary setting by comparing substitution rates estimated from human-chimp coding and intronic sequence using CpG/non-CpG assignment with estimates derived from a method that is largely free from bias. CONCLUSION: Our study demonstrates that a common method of assigning sites into CpG and non CpG classes in pairwise alignments is seriously biased and recommends against the adoption of ad hoc methods of ancestral state assignment. BioMed Central 2008-09-30 /pmc/articles/PMC2576242/ /pubmed/18826599 http://dx.doi.org/10.1186/1471-2148-8-265 Text en Copyright ©2008 Gaffney and Keightley; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Gaffney, Daniel J Keightley, Peter D Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals |
title | Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals |
title_full | Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals |
title_fullStr | Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals |
title_full_unstemmed | Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals |
title_short | Effect of the assignment of ancestral CpG state on the estimation of nucleotide substitution rates in mammals |
title_sort | effect of the assignment of ancestral cpg state on the estimation of nucleotide substitution rates in mammals |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2576242/ https://www.ncbi.nlm.nih.gov/pubmed/18826599 http://dx.doi.org/10.1186/1471-2148-8-265 |
work_keys_str_mv | AT gaffneydanielj effectoftheassignmentofancestralcpgstateontheestimationofnucleotidesubstitutionratesinmammals AT keightleypeterd effectoftheassignmentofancestralcpgstateontheestimationofnucleotidesubstitutionratesinmammals |