Cargando…

An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias

Phylogenetic codon models are routinely used to characterize selective regimes in coding sequences. Their parametric design, however, is still a matter of debate, in particular concerning the question of how to account for differing nucleotide frequencies and substitution rates. This problem relates...

Descripción completa

Detalles Bibliográficos
Autores principales: Latrille, Thibault, Lartillot, Nicolas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8831783/
https://www.ncbi.nlm.nih.gov/pubmed/35021218
http://dx.doi.org/10.1093/molbev/msac005
_version_ 1784648579284992000
author Latrille, Thibault
Lartillot, Nicolas
author_facet Latrille, Thibault
Lartillot, Nicolas
author_sort Latrille, Thibault
collection PubMed
description Phylogenetic codon models are routinely used to characterize selective regimes in coding sequences. Their parametric design, however, is still a matter of debate, in particular concerning the question of how to account for differing nucleotide frequencies and substitution rates. This problem relates to the fact that nucleotide composition in protein-coding sequences is the result of the interactions between mutation and selection. In particular, because of the structure of the genetic code, the nucleotide composition differs between the three coding positions, with the third position showing a more extreme composition. Yet, phylogenetic codon models do not correctly capture this phenomenon and instead predict that the nucleotide composition should be the same for all three positions. Alternatively, some models allow for different nucleotide rates at the three positions, an approach conflating the effects of mutation and selection on nucleotide composition. In practice, it results in inaccurate estimation of the strength of selection. Conceptually, the problem comes from the fact that phylogenetic codon models do not correctly capture the fixation bias acting against the mutational pressure at the mutation–selection equilibrium. To address this problem and to more accurately identify mutation rates and selection strength, we present an improved codon modeling approach where the fixation rate is not seen as a scalar, but as a tensor. This approach gives an accurate representation of how mutation and selection oppose each other at equilibrium and yields a reliable estimate of the mutational process, while disentangling the mean fixation probabilities prevailing in different mutational directions.
format Online
Article
Text
id pubmed-8831783
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-88317832022-02-11 An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias Latrille, Thibault Lartillot, Nicolas Mol Biol Evol Methods Phylogenetic codon models are routinely used to characterize selective regimes in coding sequences. Their parametric design, however, is still a matter of debate, in particular concerning the question of how to account for differing nucleotide frequencies and substitution rates. This problem relates to the fact that nucleotide composition in protein-coding sequences is the result of the interactions between mutation and selection. In particular, because of the structure of the genetic code, the nucleotide composition differs between the three coding positions, with the third position showing a more extreme composition. Yet, phylogenetic codon models do not correctly capture this phenomenon and instead predict that the nucleotide composition should be the same for all three positions. Alternatively, some models allow for different nucleotide rates at the three positions, an approach conflating the effects of mutation and selection on nucleotide composition. In practice, it results in inaccurate estimation of the strength of selection. Conceptually, the problem comes from the fact that phylogenetic codon models do not correctly capture the fixation bias acting against the mutational pressure at the mutation–selection equilibrium. To address this problem and to more accurately identify mutation rates and selection strength, we present an improved codon modeling approach where the fixation rate is not seen as a scalar, but as a tensor. This approach gives an accurate representation of how mutation and selection oppose each other at equilibrium and yields a reliable estimate of the mutational process, while disentangling the mean fixation probabilities prevailing in different mutational directions. Oxford University Press 2022-01-11 /pmc/articles/PMC8831783/ /pubmed/35021218 http://dx.doi.org/10.1093/molbev/msac005 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Latrille, Thibault
Lartillot, Nicolas
An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias
title An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias
title_full An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias
title_fullStr An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias
title_full_unstemmed An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias
title_short An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias
title_sort improved codon modeling approach for accurate estimation of the mutation bias
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8831783/
https://www.ncbi.nlm.nih.gov/pubmed/35021218
http://dx.doi.org/10.1093/molbev/msac005
work_keys_str_mv AT latrillethibault animprovedcodonmodelingapproachforaccurateestimationofthemutationbias
AT lartillotnicolas animprovedcodonmodelingapproachforaccurateestimationofthemutationbias
AT latrillethibault improvedcodonmodelingapproachforaccurateestimationofthemutationbias
AT lartillotnicolas improvedcodonmodelingapproachforaccurateestimationofthemutationbias