Cargando…

Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage

Amino acid substitution models are commonly used for phylogenetic inference, for ancestral sequence reconstruction, and for the inference of positive selection. All commonly used models explicitly assume that each site evolves independently, an assumption that is violated by both linkage and protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Stark, Tristan L, Liberles, David A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8557849/
https://www.ncbi.nlm.nih.gov/pubmed/34581792
http://dx.doi.org/10.1093/gbe/evab225
_version_ 1784592437573844992
author Stark, Tristan L
Liberles, David A
author_facet Stark, Tristan L
Liberles, David A
author_sort Stark, Tristan L
collection PubMed
description Amino acid substitution models are commonly used for phylogenetic inference, for ancestral sequence reconstruction, and for the inference of positive selection. All commonly used models explicitly assume that each site evolves independently, an assumption that is violated by both linkage and protein structural and functional constraints. We introduce two new models for amino acid substitution which incorporate linkage between sites, each based on the (population-genetic) Moran model. The first model is a generalized population process tracking arbitrarily many sites which undergo mutation, with individuals replaced according to their fitnesses. This model provides a reasonably complete framework for simulations but is numerically and analytically intractable. We also introduce a second model which includes several simplifying assumptions but for which some theoretical results can be derived. We analyze the simplified model to determine conditions where linkage is likely to have meaningful effects on sitewise substitution probabilities, as well as conditions under which the effects are likely to be negligible. These findings are an important step in the generation of tractable phylogenetic models that parameterize selective coefficients for amino acid substitution while accounting for linkage of sites leading to both hitchhiking and background selection.
format Online
Article
Text
id pubmed-8557849
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85578492021-11-01 Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage Stark, Tristan L Liberles, David A Genome Biol Evol Research Article Amino acid substitution models are commonly used for phylogenetic inference, for ancestral sequence reconstruction, and for the inference of positive selection. All commonly used models explicitly assume that each site evolves independently, an assumption that is violated by both linkage and protein structural and functional constraints. We introduce two new models for amino acid substitution which incorporate linkage between sites, each based on the (population-genetic) Moran model. The first model is a generalized population process tracking arbitrarily many sites which undergo mutation, with individuals replaced according to their fitnesses. This model provides a reasonably complete framework for simulations but is numerically and analytically intractable. We also introduce a second model which includes several simplifying assumptions but for which some theoretical results can be derived. We analyze the simplified model to determine conditions where linkage is likely to have meaningful effects on sitewise substitution probabilities, as well as conditions under which the effects are likely to be negligible. These findings are an important step in the generation of tractable phylogenetic models that parameterize selective coefficients for amino acid substitution while accounting for linkage of sites leading to both hitchhiking and background selection. Oxford University Press 2021-09-28 /pmc/articles/PMC8557849/ /pubmed/34581792 http://dx.doi.org/10.1093/gbe/evab225 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Stark, Tristan L
Liberles, David A
Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage
title Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage
title_full Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage
title_fullStr Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage
title_full_unstemmed Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage
title_short Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage
title_sort characterizing amino acid substitution with complete linkage of sites on a lineage
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8557849/
https://www.ncbi.nlm.nih.gov/pubmed/34581792
http://dx.doi.org/10.1093/gbe/evab225
work_keys_str_mv AT starktristanl characterizingaminoacidsubstitutionwithcompletelinkageofsitesonalineage
AT liberlesdavida characterizingaminoacidsubstitutionwithcompletelinkageofsitesonalineage