Cargando…

Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition

The measurement of synonymous and nonsynonymous substitution rates (dS and dN) is useful for assessing selection operating on protein sequences or for investigating mutational processes affecting genomes. In particular, the ratio [Formula: see text] is expected to be a good proxy for ω, the ratio of...

Descripción completa

Detalles Bibliográficos
Autores principales: Guéguen, Laurent, Duret, Laurent
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850866/
https://www.ncbi.nlm.nih.gov/pubmed/29220511
http://dx.doi.org/10.1093/molbev/msx308
_version_ 1783306297680068608
author Guéguen, Laurent
Duret, Laurent
author_facet Guéguen, Laurent
Duret, Laurent
author_sort Guéguen, Laurent
collection PubMed
description The measurement of synonymous and nonsynonymous substitution rates (dS and dN) is useful for assessing selection operating on protein sequences or for investigating mutational processes affecting genomes. In particular, the ratio [Formula: see text] is expected to be a good proxy for ω, the ratio of fixation probabilities of nonsynonymous mutations relative to that of neutral mutations. Standard methods for estimating dN, dS, or ω rely on the assumption that the base composition of sequences is at the equilibrium of the evolutionary process. In many clades, this assumption of stationarity is in fact incorrect, and we show here through simulations and analyses of empirical data that nonstationarity biases the estimate of dN, dS, and ω. We show that the bias in the estimate of ω can be fixed by explicitly taking into consideration nonstationarity in the modeling of codon evolution, in a maximum likelihood framework. Moreover, we propose an exact method for estimating dN and dS on branches, based on stochastic mapping, that can take into account nonstationarity. This method can be directly applied to any kind of codon evolution model, as long as neutrality is clearly parameterized.
format Online
Article
Text
id pubmed-5850866
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58508662018-03-23 Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition Guéguen, Laurent Duret, Laurent Mol Biol Evol Methods The measurement of synonymous and nonsynonymous substitution rates (dS and dN) is useful for assessing selection operating on protein sequences or for investigating mutational processes affecting genomes. In particular, the ratio [Formula: see text] is expected to be a good proxy for ω, the ratio of fixation probabilities of nonsynonymous mutations relative to that of neutral mutations. Standard methods for estimating dN, dS, or ω rely on the assumption that the base composition of sequences is at the equilibrium of the evolutionary process. In many clades, this assumption of stationarity is in fact incorrect, and we show here through simulations and analyses of empirical data that nonstationarity biases the estimate of dN, dS, and ω. We show that the bias in the estimate of ω can be fixed by explicitly taking into consideration nonstationarity in the modeling of codon evolution, in a maximum likelihood framework. Moreover, we propose an exact method for estimating dN and dS on branches, based on stochastic mapping, that can take into account nonstationarity. This method can be directly applied to any kind of codon evolution model, as long as neutrality is clearly parameterized. Oxford University Press 2018-03 2017-12-06 /pmc/articles/PMC5850866/ /pubmed/29220511 http://dx.doi.org/10.1093/molbev/msx308 Text en © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Guéguen, Laurent
Duret, Laurent
Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition
title Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition
title_full Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition
title_fullStr Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition
title_full_unstemmed Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition
title_short Unbiased Estimate of Synonymous and Nonsynonymous Substitution Rates with Nonstationary Base Composition
title_sort unbiased estimate of synonymous and nonsynonymous substitution rates with nonstationary base composition
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5850866/
https://www.ncbi.nlm.nih.gov/pubmed/29220511
http://dx.doi.org/10.1093/molbev/msx308
work_keys_str_mv AT gueguenlaurent unbiasedestimateofsynonymousandnonsynonymoussubstitutionrateswithnonstationarybasecomposition
AT duretlaurent unbiasedestimateofsynonymousandnonsynonymoussubstitutionrateswithnonstationarybasecomposition