Cargando…

The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC

Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω...

Descripción completa

Detalles Bibliográficos
Autores principales: Gharib, Walid H., Robinson-Rechavi, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3684852/
https://www.ncbi.nlm.nih.gov/pubmed/23558341
http://dx.doi.org/10.1093/molbev/mst062
_version_ 1782273619591692288
author Gharib, Walid H.
Robinson-Rechavi, Marc
author_facet Gharib, Walid H.
Robinson-Rechavi, Marc
author_sort Gharib, Walid H.
collection PubMed
description Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test.
format Online
Article
Text
id pubmed-3684852
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36848522013-06-18 The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC Gharib, Walid H. Robinson-Rechavi, Marc Mol Biol Evol Methods Positive selection is widely estimated from protein coding sequence alignments by the nonsynonymous-to-synonymous ratio ω. Increasingly elaborate codon models are used in a likelihood framework for this estimation. Although there is widespread concern about the robustness of the estimation of the ω ratio, more efforts are needed to estimate this robustness, especially in the context of complex models. Here, we focused on the branch-site codon model. We investigated its robustness on a large set of simulated data. First, we investigated the impact of sequence divergence. We found evidence of underestimation of the synonymous substitution rate for values as small as 0.5, with a slight increase in false positives for the branch-site test. When dS increases further, underestimation of dS is worse, but false positives decrease. Interestingly, the detection of true positives follows a similar distribution, with a maximum for intermediary values of dS. Thus, high dS is more of a concern for a loss of power (false negatives) than for false positives of the test. Second, we investigated the impact of GC content. We showed that there is no significant difference of false positives between high GC (up to ∼80%) and low GC (∼30%) genes. Moreover, neither shifts of GC content on a specific branch nor major shifts in GC along the gene sequence generate many false positives. Our results confirm that the branch-site is a very conservative test. Oxford University Press 2013-07 2013-04-04 /pmc/articles/PMC3684852/ /pubmed/23558341 http://dx.doi.org/10.1093/molbev/mst062 Text en © The Author 2013. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods
Gharib, Walid H.
Robinson-Rechavi, Marc
The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
title The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
title_full The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
title_fullStr The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
title_full_unstemmed The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
title_short The Branch-Site Test of Positive Selection Is Surprisingly Robust but Lacks Power under Synonymous Substitution Saturation and Variation in GC
title_sort branch-site test of positive selection is surprisingly robust but lacks power under synonymous substitution saturation and variation in gc
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3684852/
https://www.ncbi.nlm.nih.gov/pubmed/23558341
http://dx.doi.org/10.1093/molbev/mst062
work_keys_str_mv AT gharibwalidh thebranchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc
AT robinsonrechavimarc thebranchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc
AT gharibwalidh branchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc
AT robinsonrechavimarc branchsitetestofpositiveselectionissurprisinglyrobustbutlackspowerundersynonymoussubstitutionsaturationandvariationingc