Cargando…
Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)
The multi-species coalescent (MSC) provides a theoretical foundation for modern phylogenetics and comparative population genetics. Its theoretical properties have been heavily studied but there are still aspects of the MSC that are largely unknown, including the covariances in pairwise coalescence t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014196/ https://www.ncbi.nlm.nih.gov/pubmed/35430886 http://dx.doi.org/10.1098/rstb.2020.0415 |
_version_ | 1784688157656088576 |
---|---|
author | Guerra, Geno Nielsen, Rasmus |
author_facet | Guerra, Geno Nielsen, Rasmus |
author_sort | Guerra, Geno |
collection | PubMed |
description | The multi-species coalescent (MSC) provides a theoretical foundation for modern phylogenetics and comparative population genetics. Its theoretical properties have been heavily studied but there are still aspects of the MSC that are largely unknown, including the covariances in pairwise coalescence times, which are fundamental for understanding the properties of statistics that combine data from multiple species, such as the fixation index (F(ST)). The major contribution of this study is the derivation and implementation of exact expressions for the covariances of pairwise coalescence times under phylogenetic models with piecewise constant changes in population size, assuming no gene flow after species divergence. We use these expressions to derive the variance in average pairwise differences within and between populations. We then derive approximations for the expectation and bias of a sequence-based estimator of F(ST), a commonly used genetic measurement of population differentiation, when it is applied to a non-recombining region of the genome. We show that the estimator of F(ST) is generally biased downward. A freely available software package is provided, STCov, to calculate the mean, variances and covariances in coalescence times presented here under user-defined piecewise-constant species trees. This article is part of the theme issue ‘Celebrating 50 years since Lewontin's apportionment of human diversity’. |
format | Online Article Text |
id | pubmed-9014196 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-90141962022-04-21 Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) Guerra, Geno Nielsen, Rasmus Philos Trans R Soc Lond B Biol Sci Articles The multi-species coalescent (MSC) provides a theoretical foundation for modern phylogenetics and comparative population genetics. Its theoretical properties have been heavily studied but there are still aspects of the MSC that are largely unknown, including the covariances in pairwise coalescence times, which are fundamental for understanding the properties of statistics that combine data from multiple species, such as the fixation index (F(ST)). The major contribution of this study is the derivation and implementation of exact expressions for the covariances of pairwise coalescence times under phylogenetic models with piecewise constant changes in population size, assuming no gene flow after species divergence. We use these expressions to derive the variance in average pairwise differences within and between populations. We then derive approximations for the expectation and bias of a sequence-based estimator of F(ST), a commonly used genetic measurement of population differentiation, when it is applied to a non-recombining region of the genome. We show that the estimator of F(ST) is generally biased downward. A freely available software package is provided, STCov, to calculate the mean, variances and covariances in coalescence times presented here under user-defined piecewise-constant species trees. This article is part of the theme issue ‘Celebrating 50 years since Lewontin's apportionment of human diversity’. The Royal Society 2022-06-06 2022-04-18 /pmc/articles/PMC9014196/ /pubmed/35430886 http://dx.doi.org/10.1098/rstb.2020.0415 Text en © 2022 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Articles Guerra, Geno Nielsen, Rasmus Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) |
title | Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) |
title_full | Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) |
title_fullStr | Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) |
title_full_unstemmed | Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) |
title_short | Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) |
title_sort | covariance of pairwise differences on a multi-species coalescent tree and implications for f(st) |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014196/ https://www.ncbi.nlm.nih.gov/pubmed/35430886 http://dx.doi.org/10.1098/rstb.2020.0415 |
work_keys_str_mv | AT guerrageno covarianceofpairwisedifferencesonamultispeciescoalescenttreeandimplicationsforfst AT nielsenrasmus covarianceofpairwisedifferencesonamultispeciescoalescenttreeandimplicationsforfst |