Cargando…

Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)

The multi-species coalescent (MSC) provides a theoretical foundation for modern phylogenetics and comparative population genetics. Its theoretical properties have been heavily studied but there are still aspects of the MSC that are largely unknown, including the covariances in pairwise coalescence t...

Descripción completa

Detalles Bibliográficos
Autores principales: Guerra, Geno, Nielsen, Rasmus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014196/
https://www.ncbi.nlm.nih.gov/pubmed/35430886
http://dx.doi.org/10.1098/rstb.2020.0415
_version_ 1784688157656088576
author Guerra, Geno
Nielsen, Rasmus
author_facet Guerra, Geno
Nielsen, Rasmus
author_sort Guerra, Geno
collection PubMed
description The multi-species coalescent (MSC) provides a theoretical foundation for modern phylogenetics and comparative population genetics. Its theoretical properties have been heavily studied but there are still aspects of the MSC that are largely unknown, including the covariances in pairwise coalescence times, which are fundamental for understanding the properties of statistics that combine data from multiple species, such as the fixation index (F(ST)). The major contribution of this study is the derivation and implementation of exact expressions for the covariances of pairwise coalescence times under phylogenetic models with piecewise constant changes in population size, assuming no gene flow after species divergence. We use these expressions to derive the variance in average pairwise differences within and between populations. We then derive approximations for the expectation and bias of a sequence-based estimator of F(ST), a commonly used genetic measurement of population differentiation, when it is applied to a non-recombining region of the genome. We show that the estimator of F(ST) is generally biased downward. A freely available software package is provided, STCov, to calculate the mean, variances and covariances in coalescence times presented here under user-defined piecewise-constant species trees. This article is part of the theme issue ‘Celebrating 50 years since Lewontin's apportionment of human diversity’.
format Online
Article
Text
id pubmed-9014196
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-90141962022-04-21 Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST) Guerra, Geno Nielsen, Rasmus Philos Trans R Soc Lond B Biol Sci Articles The multi-species coalescent (MSC) provides a theoretical foundation for modern phylogenetics and comparative population genetics. Its theoretical properties have been heavily studied but there are still aspects of the MSC that are largely unknown, including the covariances in pairwise coalescence times, which are fundamental for understanding the properties of statistics that combine data from multiple species, such as the fixation index (F(ST)). The major contribution of this study is the derivation and implementation of exact expressions for the covariances of pairwise coalescence times under phylogenetic models with piecewise constant changes in population size, assuming no gene flow after species divergence. We use these expressions to derive the variance in average pairwise differences within and between populations. We then derive approximations for the expectation and bias of a sequence-based estimator of F(ST), a commonly used genetic measurement of population differentiation, when it is applied to a non-recombining region of the genome. We show that the estimator of F(ST) is generally biased downward. A freely available software package is provided, STCov, to calculate the mean, variances and covariances in coalescence times presented here under user-defined piecewise-constant species trees. This article is part of the theme issue ‘Celebrating 50 years since Lewontin's apportionment of human diversity’. The Royal Society 2022-06-06 2022-04-18 /pmc/articles/PMC9014196/ /pubmed/35430886 http://dx.doi.org/10.1098/rstb.2020.0415 Text en © 2022 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Articles
Guerra, Geno
Nielsen, Rasmus
Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)
title Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)
title_full Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)
title_fullStr Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)
title_full_unstemmed Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)
title_short Covariance of pairwise differences on a multi-species coalescent tree and implications for F(ST)
title_sort covariance of pairwise differences on a multi-species coalescent tree and implications for f(st)
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9014196/
https://www.ncbi.nlm.nih.gov/pubmed/35430886
http://dx.doi.org/10.1098/rstb.2020.0415
work_keys_str_mv AT guerrageno covarianceofpairwisedifferencesonamultispeciescoalescenttreeandimplicationsforfst
AT nielsenrasmus covarianceofpairwisedifferencesonamultispeciescoalescenttreeandimplicationsforfst