Cargando…

Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories

Inference of demographic history from genetic data is a primary goal of population genetics of model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the demographic hi...

Descripción completa

Detalles Bibliográficos
Autores principales: Beichman, Annabel C., Phung, Tanya N., Lohmueller, Kirk E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5677151/
https://www.ncbi.nlm.nih.gov/pubmed/28893846
http://dx.doi.org/10.1534/g3.117.300259
_version_ 1783277184523173888
author Beichman, Annabel C.
Phung, Tanya N.
Lohmueller, Kirk E.
author_facet Beichman, Annabel C.
Phung, Tanya N.
Lohmueller, Kirk E.
author_sort Beichman, Annabel C.
collection PubMed
description Inference of demographic history from genetic data is a primary goal of population genetics of model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the demographic history of an entire population, while site frequency spectrum (SFS)-based methods use the distribution of allele frequencies in a sample to reconstruct the same historical events. Although both methods are extensively used in empirical studies and perform well on data simulated under simple models, there have been only limited comparisons of them in more complex and realistic settings. Here we use published demographic models based on data from three human populations (Yoruba, descendants of northwest-Europeans, and Han Chinese) as an empirical test case to study the behavior of both inference procedures. We find that several of the demographic histories inferred by the whole genome-based methods do not predict the genome-wide distribution of heterozygosity, nor do they predict the empirical SFS. However, using simulated data, we also find that the whole genome methods can reconstruct the complex demographic models inferred by SFS-based methods, suggesting that the discordant patterns of genetic variation are not attributable to a lack of statistical power, but may reflect unmodeled complexities in the underlying demography. More generally, our findings indicate that demographic inference from a small number of genomes, routine in genomic studies of nonmodel organisms, should be interpreted cautiously, as these models cannot recapitulate other summaries of the data.
format Online
Article
Text
id pubmed-5677151
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-56771512017-11-09 Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories Beichman, Annabel C. Phung, Tanya N. Lohmueller, Kirk E. G3 (Bethesda) Investigations Inference of demographic history from genetic data is a primary goal of population genetics of model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the demographic history of an entire population, while site frequency spectrum (SFS)-based methods use the distribution of allele frequencies in a sample to reconstruct the same historical events. Although both methods are extensively used in empirical studies and perform well on data simulated under simple models, there have been only limited comparisons of them in more complex and realistic settings. Here we use published demographic models based on data from three human populations (Yoruba, descendants of northwest-Europeans, and Han Chinese) as an empirical test case to study the behavior of both inference procedures. We find that several of the demographic histories inferred by the whole genome-based methods do not predict the genome-wide distribution of heterozygosity, nor do they predict the empirical SFS. However, using simulated data, we also find that the whole genome methods can reconstruct the complex demographic models inferred by SFS-based methods, suggesting that the discordant patterns of genetic variation are not attributable to a lack of statistical power, but may reflect unmodeled complexities in the underlying demography. More generally, our findings indicate that demographic inference from a small number of genomes, routine in genomic studies of nonmodel organisms, should be interpreted cautiously, as these models cannot recapitulate other summaries of the data. Genetics Society of America 2017-09-11 /pmc/articles/PMC5677151/ /pubmed/28893846 http://dx.doi.org/10.1534/g3.117.300259 Text en Copyright © 2017 Beichman et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Beichman, Annabel C.
Phung, Tanya N.
Lohmueller, Kirk E.
Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories
title Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories
title_full Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories
title_fullStr Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories
title_full_unstemmed Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories
title_short Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories
title_sort comparison of single genome and allele frequency data reveals discordant demographic histories
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5677151/
https://www.ncbi.nlm.nih.gov/pubmed/28893846
http://dx.doi.org/10.1534/g3.117.300259
work_keys_str_mv AT beichmanannabelc comparisonofsinglegenomeandallelefrequencydatarevealsdiscordantdemographichistories
AT phungtanyan comparisonofsinglegenomeandallelefrequencydatarevealsdiscordantdemographichistories
AT lohmuellerkirke comparisonofsinglegenomeandallelefrequencydatarevealsdiscordantdemographichistories