Cargando…
Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection
Selection modulates gene sequence evolution in different ways by constraining potential changes of amino acid sequences (purifying selection) or by favoring new and adaptive genetic variants (positive selection). The number of nonsynonymous differences in a pair of protein-coding sequences can be us...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817425/ https://www.ncbi.nlm.nih.gov/pubmed/20333200 http://dx.doi.org/10.1093/gbe/evp030 |
_version_ | 1782177193069117440 |
---|---|
author | Wolf, Jochen B. W. Künstner, Axel Nam, Kiwoong Jakobsson, Mattias Ellegren, Hans |
author_facet | Wolf, Jochen B. W. Künstner, Axel Nam, Kiwoong Jakobsson, Mattias Ellegren, Hans |
author_sort | Wolf, Jochen B. W. |
collection | PubMed |
description | Selection modulates gene sequence evolution in different ways by constraining potential changes of amino acid sequences (purifying selection) or by favoring new and adaptive genetic variants (positive selection). The number of nonsynonymous differences in a pair of protein-coding sequences can be used to quantify the mode and strength of selection. To control for regional variation in substitution rates, the proportionate number of nonsynonymous differences (d(N)) is divided by the proportionate number of synonymous differences (d(S)). The resulting ratio (d(N)/d(S)) is a widely used indicator for functional divergence to identify particular genes that underwent positive selection. With the ever-growing amount of genome data, summary statistics like mean d(N)/d(S) allow gathering information on the mode of evolution for entire species. Both applications hinge on the assumption that d(S) and mean d(S) (∼branch length) are neutral and adequately control for variation in substitution rates across genes and across organisms, respectively. We here explore the validity of this assumption using empirical data based on whole-genome protein sequence alignments between human and 15 other vertebrate species and several simulation approaches. We find that d(N)/d(S) does not appropriately reflect the action of selection as it is strongly influenced by its denominator (d(S)). Particularly for closely related taxa, such as human and chimpanzee, d(N)/d(S) can be misleading and is not an unadulterated indicator of selection. Instead, we suggest that inconsistencies in the behavior of d(N)/d(S) are to be expected and highlight the idea that this behavior may be inherent to taking the ratio of two randomly distributed variables that are nonlinearly correlated. New null hypotheses will be needed to adequately handle these nonlinear dynamics. |
format | Text |
id | pubmed-2817425 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-28174252010-03-22 Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection Wolf, Jochen B. W. Künstner, Axel Nam, Kiwoong Jakobsson, Mattias Ellegren, Hans Genome Biol Evol Research Articles Selection modulates gene sequence evolution in different ways by constraining potential changes of amino acid sequences (purifying selection) or by favoring new and adaptive genetic variants (positive selection). The number of nonsynonymous differences in a pair of protein-coding sequences can be used to quantify the mode and strength of selection. To control for regional variation in substitution rates, the proportionate number of nonsynonymous differences (d(N)) is divided by the proportionate number of synonymous differences (d(S)). The resulting ratio (d(N)/d(S)) is a widely used indicator for functional divergence to identify particular genes that underwent positive selection. With the ever-growing amount of genome data, summary statistics like mean d(N)/d(S) allow gathering information on the mode of evolution for entire species. Both applications hinge on the assumption that d(S) and mean d(S) (∼branch length) are neutral and adequately control for variation in substitution rates across genes and across organisms, respectively. We here explore the validity of this assumption using empirical data based on whole-genome protein sequence alignments between human and 15 other vertebrate species and several simulation approaches. We find that d(N)/d(S) does not appropriately reflect the action of selection as it is strongly influenced by its denominator (d(S)). Particularly for closely related taxa, such as human and chimpanzee, d(N)/d(S) can be misleading and is not an unadulterated indicator of selection. Instead, we suggest that inconsistencies in the behavior of d(N)/d(S) are to be expected and highlight the idea that this behavior may be inherent to taking the ratio of two randomly distributed variables that are nonlinearly correlated. New null hypotheses will be needed to adequately handle these nonlinear dynamics. Oxford University Press 2009 2009-08-13 /pmc/articles/PMC2817425/ /pubmed/20333200 http://dx.doi.org/10.1093/gbe/evp030 Text en © The Author(s) 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Wolf, Jochen B. W. Künstner, Axel Nam, Kiwoong Jakobsson, Mattias Ellegren, Hans Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection |
title | Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection |
title_full | Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection |
title_fullStr | Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection |
title_full_unstemmed | Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection |
title_short | Nonlinear Dynamics of Nonsynonymous (d(N)) and Synonymous (d(S)) Substitution Rates Affects Inference of Selection |
title_sort | nonlinear dynamics of nonsynonymous (d(n)) and synonymous (d(s)) substitution rates affects inference of selection |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2817425/ https://www.ncbi.nlm.nih.gov/pubmed/20333200 http://dx.doi.org/10.1093/gbe/evp030 |
work_keys_str_mv | AT wolfjochenbw nonlineardynamicsofnonsynonymousdnandsynonymousdssubstitutionratesaffectsinferenceofselection AT kunstneraxel nonlineardynamicsofnonsynonymousdnandsynonymousdssubstitutionratesaffectsinferenceofselection AT namkiwoong nonlineardynamicsofnonsynonymousdnandsynonymousdssubstitutionratesaffectsinferenceofselection AT jakobssonmattias nonlineardynamicsofnonsynonymousdnandsynonymousdssubstitutionratesaffectsinferenceofselection AT ellegrenhans nonlineardynamicsofnonsynonymousdnandsynonymousdssubstitutionratesaffectsinferenceofselection |