Cargando…

Inferring Past Effective Population Size from Distributions of Coalescent Times

Inferring and understanding changes in effective population size over time is a major challenge for population genetics. Here we investigate some theoretical properties of random-mating populations with varying size over time. In particular, we present an exact solution to compute the population siz...

Descripción completa

Detalles Bibliográficos
Autores principales: Gattepaille, Lucie, Günther, Torsten, Jakobsson, Mattias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5105851/
https://www.ncbi.nlm.nih.gov/pubmed/27638421
http://dx.doi.org/10.1534/genetics.115.185058
_version_ 1782466960862216192
author Gattepaille, Lucie
Günther, Torsten
Jakobsson, Mattias
author_facet Gattepaille, Lucie
Günther, Torsten
Jakobsson, Mattias
author_sort Gattepaille, Lucie
collection PubMed
description Inferring and understanding changes in effective population size over time is a major challenge for population genetics. Here we investigate some theoretical properties of random-mating populations with varying size over time. In particular, we present an exact solution to compute the population size as a function of time, [Formula: see text] , based on distributions of coalescent times of samples of any size. This result reduces the problem of population size inference to a problem of estimating coalescent time distributions. To illustrate the analytic results, we design a heuristic method using a tree-inference algorithm and investigate simulated and empirical population-genetic data. We investigate the effects of a range of conditions associated with empirical data, for instance number of loci, sample size, mutation rate, and cryptic recombination. We show that our approach performs well with genomic data (≥ 10,000 loci) and that increasing the sample size from 2 to 10 greatly improves the inference of [Formula: see text] whereas further increase in sample size results in modest improvements, even under a scenario of exponential growth. We also investigate the impact of recombination and characterize the potential biases in inference of [Formula: see text]. The approach can handle large sample sizes and the computations are fast. We apply our method to human genomes from four populations and reconstruct population size profiles that are coherent with previous finds, including the Out-of-Africa bottleneck. Additionally, we uncover a potential difference in population size between African and non-African populations as early as 400 KYA. In summary, we provide an analytic relationship between distributions of coalescent times and [Formula: see text] , which can be incorporated into powerful approaches for inferring past population sizes from population-genomic data.
format Online
Article
Text
id pubmed-5105851
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-51058512016-11-14 Inferring Past Effective Population Size from Distributions of Coalescent Times Gattepaille, Lucie Günther, Torsten Jakobsson, Mattias Genetics Investigations Inferring and understanding changes in effective population size over time is a major challenge for population genetics. Here we investigate some theoretical properties of random-mating populations with varying size over time. In particular, we present an exact solution to compute the population size as a function of time, [Formula: see text] , based on distributions of coalescent times of samples of any size. This result reduces the problem of population size inference to a problem of estimating coalescent time distributions. To illustrate the analytic results, we design a heuristic method using a tree-inference algorithm and investigate simulated and empirical population-genetic data. We investigate the effects of a range of conditions associated with empirical data, for instance number of loci, sample size, mutation rate, and cryptic recombination. We show that our approach performs well with genomic data (≥ 10,000 loci) and that increasing the sample size from 2 to 10 greatly improves the inference of [Formula: see text] whereas further increase in sample size results in modest improvements, even under a scenario of exponential growth. We also investigate the impact of recombination and characterize the potential biases in inference of [Formula: see text]. The approach can handle large sample sizes and the computations are fast. We apply our method to human genomes from four populations and reconstruct population size profiles that are coherent with previous finds, including the Out-of-Africa bottleneck. Additionally, we uncover a potential difference in population size between African and non-African populations as early as 400 KYA. In summary, we provide an analytic relationship between distributions of coalescent times and [Formula: see text] , which can be incorporated into powerful approaches for inferring past population sizes from population-genomic data. Genetics Society of America 2016-11 2016-09-15 /pmc/articles/PMC5105851/ /pubmed/27638421 http://dx.doi.org/10.1534/genetics.115.185058 Text en Copyright © 2016 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Gattepaille, Lucie
Günther, Torsten
Jakobsson, Mattias
Inferring Past Effective Population Size from Distributions of Coalescent Times
title Inferring Past Effective Population Size from Distributions of Coalescent Times
title_full Inferring Past Effective Population Size from Distributions of Coalescent Times
title_fullStr Inferring Past Effective Population Size from Distributions of Coalescent Times
title_full_unstemmed Inferring Past Effective Population Size from Distributions of Coalescent Times
title_short Inferring Past Effective Population Size from Distributions of Coalescent Times
title_sort inferring past effective population size from distributions of coalescent times
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5105851/
https://www.ncbi.nlm.nih.gov/pubmed/27638421
http://dx.doi.org/10.1534/genetics.115.185058
work_keys_str_mv AT gattepaillelucie inferringpasteffectivepopulationsizefromdistributionsofcoalescenttimes
AT gunthertorsten inferringpasteffectivepopulationsizefromdistributionsofcoalescenttimes
AT jakobssonmattias inferringpasteffectivepopulationsizefromdistributionsofcoalescenttimes