Cargando…

Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates

Site-specific evolutionary rates can be estimated from codon sequences or from amino-acid sequences. For codon sequences, the most popular methods use some variation of the dN∕dS ratio. For amino-acid sequences, one widely-used method is called Rate4Site, and it assigns a relative conservation score...

Descripción completa

Detalles Bibliográficos
Autores principales: Sydykova, Dariya K., Wilke, Claus O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5452972/
https://www.ncbi.nlm.nih.gov/pubmed/28584717
http://dx.doi.org/10.7717/peerj.3391
_version_ 1783240550643662848
author Sydykova, Dariya K.
Wilke, Claus O.
author_facet Sydykova, Dariya K.
Wilke, Claus O.
author_sort Sydykova, Dariya K.
collection PubMed
description Site-specific evolutionary rates can be estimated from codon sequences or from amino-acid sequences. For codon sequences, the most popular methods use some variation of the dN∕dS ratio. For amino-acid sequences, one widely-used method is called Rate4Site, and it assigns a relative conservation score to each site in an alignment. How site-wise dN∕dS values relate to Rate4Site scores is not known. Here we elucidate the relationship between these two rate measurements. We simulate sequences with known dN∕dS, using either dN∕dS models or mutation–selection models for simulation. We then infer Rate4Site scores on the simulated alignments, and we compare those scores to either true or inferred dN∕dS values on the same alignments. We find that Rate4Site scores generally correlate well with true dN∕dS, and the correlation strengths increase in alignments with greater sequence divergence and more taxa. Moreover, Rate4Site scores correlate very well with inferred (as opposed to true) dN∕dS values, even for small alignments with little divergence. Finally, we verify this relationship between Rate4Site and dN∕dS in a variety of empirical datasets. We conclude that codon-level and amino-acid-level analysis frameworks are directly comparable and yield very similar inferences.
format Online
Article
Text
id pubmed-5452972
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-54529722017-06-05 Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates Sydykova, Dariya K. Wilke, Claus O. PeerJ Bioinformatics Site-specific evolutionary rates can be estimated from codon sequences or from amino-acid sequences. For codon sequences, the most popular methods use some variation of the dN∕dS ratio. For amino-acid sequences, one widely-used method is called Rate4Site, and it assigns a relative conservation score to each site in an alignment. How site-wise dN∕dS values relate to Rate4Site scores is not known. Here we elucidate the relationship between these two rate measurements. We simulate sequences with known dN∕dS, using either dN∕dS models or mutation–selection models for simulation. We then infer Rate4Site scores on the simulated alignments, and we compare those scores to either true or inferred dN∕dS values on the same alignments. We find that Rate4Site scores generally correlate well with true dN∕dS, and the correlation strengths increase in alignments with greater sequence divergence and more taxa. Moreover, Rate4Site scores correlate very well with inferred (as opposed to true) dN∕dS values, even for small alignments with little divergence. Finally, we verify this relationship between Rate4Site and dN∕dS in a variety of empirical datasets. We conclude that codon-level and amino-acid-level analysis frameworks are directly comparable and yield very similar inferences. PeerJ Inc. 2017-05-30 /pmc/articles/PMC5452972/ /pubmed/28584717 http://dx.doi.org/10.7717/peerj.3391 Text en ©2017 Sydykova and Wilke http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Sydykova, Dariya K.
Wilke, Claus O.
Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates
title Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates
title_full Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates
title_fullStr Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates
title_full_unstemmed Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates
title_short Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates
title_sort calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5452972/
https://www.ncbi.nlm.nih.gov/pubmed/28584717
http://dx.doi.org/10.7717/peerj.3391
work_keys_str_mv AT sydykovadariyak calculatingsitespecificevolutionaryratesattheaminoacidorcodonlevelyieldssimilarrateestimates
AT wilkeclauso calculatingsitespecificevolutionaryratesattheaminoacidorcodonlevelyieldssimilarrateestimates