Cargando…
The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3684852/ https://www.ncbi.nlm.nih.gov/pubmed/23558341 http://dx.doi.org/10.1093/molbev/mst062 |
_version_ | 1782273619591692288 |
---|---|
author | Gharib, Walid H. Robinson-Rechavi, Marc |
author_facet | Gharib, Walid H. Robinson-Rechavi, Marc |
author_sort | Gharib, Walid H. |
collection | PubMed |
description | Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test. |
format | Online Article Text |
id | pubmed-3684852 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36848522013-06-18 The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC Gharib, Walid H. Robinson-Rechavi, Marc Mol Biol Evol Methods Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test. Oxford University Press 2013-07 2013-04-04 /pmc/articles/PMC3684852/ /pubmed/23558341 http://dx.doi.org/10.1093/molbev/mst062 Text en © The Author 2013. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Gharib, Walid H. Robinson-Rechavi, Marc The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC |
title | The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC |
title_full | The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC |
title_fullStr | The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC |
title_full_unstemmed | The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC |
title_short | The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC |
title_sort | branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in gc |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3684852/ https://www.ncbi.nlm.nih.gov/pubmed/23558341 http://dx.doi.org/10.1093/molbev/mst062 |
work_keys_str_mv | AT gharibwalidh thebranchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc AT robinsonrechavimarc thebranchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc AT gharibwalidh branchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc AT robinsonrechavimarc branchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc |