Cargando…

Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade

When inferring phylogenetic relationships, not all sites in a sequence alignment are equally informative. One recently proposed approach that takes advantage of this inequality relies on sites that contain amino acids whose replacement requires multiple substitutions. Identifying these so-called RGC...

Descripción completa

Detalles Bibliográficos
Autores principales: Polzin, Kenneth, Rokas, Antonis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3956930/
https://www.ncbi.nlm.nih.gov/pubmed/24637883
http://dx.doi.org/10.1371/journal.pone.0092213
_version_ 1782307735608492032
author Polzin, Kenneth
Rokas, Antonis
author_facet Polzin, Kenneth
Rokas, Antonis
author_sort Polzin, Kenneth
collection PubMed
description When inferring phylogenetic relationships, not all sites in a sequence alignment are equally informative. One recently proposed approach that takes advantage of this inequality relies on sites that contain amino acids whose replacement requires multiple substitutions. Identifying these so-called RGC_CAM substitutions (after Rare Genomic Changes as Conserved Amino acids-Multiple substitutions) requires that, first, at any given site in the amino acid sequence alignment, there must be a minimum of two different amino acids; second, each amino acid must be present in at least two taxa; and third, the amino acids must require a minimum of two nucleotide substitutions to replace each other. Although theory suggests that RGC_CAM substitutions are expected to be rare and less likely to be homoplastic, the informativeness of RGC_CAM substitutions has not been extensively evaluated in biological data sets. We investigated the quality of RGC_CAM substitutions by examining their degree of homoplasy and internode certainty in nearly 2.7 million aligned amino acid sites from 5,261 proteins from five species belonging to the yeast Saccharomyces sensu stricto clade whose phylogeny is well-established. We identified 2,647 sites containing RGC_CAM substitutions, a number that contrasts sharply with the 100,887 sites containing RGC_non-CAM substitutions (i.e., changes between amino acids that require only a single nucleotide substitution). We found that RGC_CAM substitutions had significantly lower homoplasy than RGC_non-CAM ones; specifically RGC_CAM substitutions showed a per-site average homoplasy index of 0.100, whereas RGC_non-CAM substitutions had a homoplasy index of 0.215. Internode certainty values were also higher for sites containing RGC_CAM substitutions than for RGC_non-CAM ones. These results suggest that RGC_CAM substitutions possess a strong phylogenetic signal and are useful markers for phylogenetic inference despite their rarity.
format Online
Article
Text
id pubmed-3956930
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39569302014-03-18 Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade Polzin, Kenneth Rokas, Antonis PLoS One Research Article When inferring phylogenetic relationships, not all sites in a sequence alignment are equally informative. One recently proposed approach that takes advantage of this inequality relies on sites that contain amino acids whose replacement requires multiple substitutions. Identifying these so-called RGC_CAM substitutions (after Rare Genomic Changes as Conserved Amino acids-Multiple substitutions) requires that, first, at any given site in the amino acid sequence alignment, there must be a minimum of two different amino acids; second, each amino acid must be present in at least two taxa; and third, the amino acids must require a minimum of two nucleotide substitutions to replace each other. Although theory suggests that RGC_CAM substitutions are expected to be rare and less likely to be homoplastic, the informativeness of RGC_CAM substitutions has not been extensively evaluated in biological data sets. We investigated the quality of RGC_CAM substitutions by examining their degree of homoplasy and internode certainty in nearly 2.7 million aligned amino acid sites from 5,261 proteins from five species belonging to the yeast Saccharomyces sensu stricto clade whose phylogeny is well-established. We identified 2,647 sites containing RGC_CAM substitutions, a number that contrasts sharply with the 100,887 sites containing RGC_non-CAM substitutions (i.e., changes between amino acids that require only a single nucleotide substitution). We found that RGC_CAM substitutions had significantly lower homoplasy than RGC_non-CAM ones; specifically RGC_CAM substitutions showed a per-site average homoplasy index of 0.100, whereas RGC_non-CAM substitutions had a homoplasy index of 0.215. Internode certainty values were also higher for sites containing RGC_CAM substitutions than for RGC_non-CAM ones. These results suggest that RGC_CAM substitutions possess a strong phylogenetic signal and are useful markers for phylogenetic inference despite their rarity. Public Library of Science 2014-03-17 /pmc/articles/PMC3956930/ /pubmed/24637883 http://dx.doi.org/10.1371/journal.pone.0092213 Text en © 2014 Polzin, Rokas http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Polzin, Kenneth
Rokas, Antonis
Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade
title Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade
title_full Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade
title_fullStr Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade
title_full_unstemmed Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade
title_short Evaluating Rare Amino Acid Substitutions (RGC_CAMs) in a Yeast Model Clade
title_sort evaluating rare amino acid substitutions (rgc_cams) in a yeast model clade
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3956930/
https://www.ncbi.nlm.nih.gov/pubmed/24637883
http://dx.doi.org/10.1371/journal.pone.0092213
work_keys_str_mv AT polzinkenneth evaluatingrareaminoacidsubstitutionsrgccamsinayeastmodelclade
AT rokasantonis evaluatingrareaminoacidsubstitutionsrgccamsinayeastmodelclade