Cargando…
Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories
Inference of demographic history from genetic data is a primary goal of population genetics of model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the demographic hi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5677151/ https://www.ncbi.nlm.nih.gov/pubmed/28893846 http://dx.doi.org/10.1534/g3.117.300259 |
_version_ | 1783277184523173888 |
---|---|
author | Beichman, Annabel C. Phung, Tanya N. Lohmueller, Kirk E. |
author_facet | Beichman, Annabel C. Phung, Tanya N. Lohmueller, Kirk E. |
author_sort | Beichman, Annabel C. |
collection | PubMed |
description | Inference of demographic history from genetic data is a primary goal of population genetics of model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the demographic history of an entire population, while site frequency spectrum (SFS)-based methods use the distribution of allele frequencies in a sample to reconstruct the same historical events. Although both methods are extensively used in empirical studies and perform well on data simulated under simple models, there have been only limited comparisons of them in more complex and realistic settings. Here we use published demographic models based on data from three human populations (Yoruba, descendants of northwest-Europeans, and Han Chinese) as an empirical test case to study the behavior of both inference procedures. We find that several of the demographic histories inferred by the whole genome-based methods do not predict the genome-wide distribution of heterozygosity, nor do they predict the empirical SFS. However, using simulated data, we also find that the whole genome methods can reconstruct the complex demographic models inferred by SFS-based methods, suggesting that the discordant patterns of genetic variation are not attributable to a lack of statistical power, but may reflect unmodeled complexities in the underlying demography. More generally, our findings indicate that demographic inference from a small number of genomes, routine in genomic studies of nonmodel organisms, should be interpreted cautiously, as these models cannot recapitulate other summaries of the data. |
format | Online Article Text |
id | pubmed-5677151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-56771512017-11-09 Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories Beichman, Annabel C. Phung, Tanya N. Lohmueller, Kirk E. G3 (Bethesda) Investigations Inference of demographic history from genetic data is a primary goal of population genetics of model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the demographic history of an entire population, while site frequency spectrum (SFS)-based methods use the distribution of allele frequencies in a sample to reconstruct the same historical events. Although both methods are extensively used in empirical studies and perform well on data simulated under simple models, there have been only limited comparisons of them in more complex and realistic settings. Here we use published demographic models based on data from three human populations (Yoruba, descendants of northwest-Europeans, and Han Chinese) as an empirical test case to study the behavior of both inference procedures. We find that several of the demographic histories inferred by the whole genome-based methods do not predict the genome-wide distribution of heterozygosity, nor do they predict the empirical SFS. However, using simulated data, we also find that the whole genome methods can reconstruct the complex demographic models inferred by SFS-based methods, suggesting that the discordant patterns of genetic variation are not attributable to a lack of statistical power, but may reflect unmodeled complexities in the underlying demography. More generally, our findings indicate that demographic inference from a small number of genomes, routine in genomic studies of nonmodel organisms, should be interpreted cautiously, as these models cannot recapitulate other summaries of the data. Genetics Society of America 2017-09-11 /pmc/articles/PMC5677151/ /pubmed/28893846 http://dx.doi.org/10.1534/g3.117.300259 Text en Copyright © 2017 Beichman et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Beichman, Annabel C. Phung, Tanya N. Lohmueller, Kirk E. Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories |
title | Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories |
title_full | Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories |
title_fullStr | Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories |
title_full_unstemmed | Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories |
title_short | Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories |
title_sort | comparison of single genome and allele frequency data reveals discordant demographic histories |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5677151/ https://www.ncbi.nlm.nih.gov/pubmed/28893846 http://dx.doi.org/10.1534/g3.117.300259 |
work_keys_str_mv | AT beichmanannabelc comparisonofsinglegenomeandallelefrequencydatarevealsdiscordantdemographichistories AT phungtanyan comparisonofsinglegenomeandallelefrequencydatarevealsdiscordantdemographichistories AT lohmuellerkirke comparisonofsinglegenomeandallelefrequencydatarevealsdiscordantdemographichistories |