Cargando…

Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data

Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixture...

Descripción completa

Detalles Bibliográficos
Autores principales: Pickrell, Joseph K., Pritchard, Jonathan K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3499260/
https://www.ncbi.nlm.nih.gov/pubmed/23166502
http://dx.doi.org/10.1371/journal.pgen.1002967
_version_ 1782249930571644928
author Pickrell, Joseph K.
Pritchard, Jonathan K.
author_facet Pickrell, Joseph K.
Pritchard, Jonathan K.
author_sort Pickrell, Joseph K.
collection PubMed
description Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In our model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data, we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and “ancient” Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.com.
format Online
Article
Text
id pubmed-3499260
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34992602012-11-19 Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data Pickrell, Joseph K. Pritchard, Jonathan K. PLoS Genet Research Article Many aspects of the historical relationships between populations in a species are reflected in genetic data. Inferring these relationships from genetic data, however, remains a challenging task. In this paper, we present a statistical model for inferring the patterns of population splits and mixtures in multiple populations. In our model, the sampled populations in a species are related to their common ancestor through a graph of ancestral populations. Using genome-wide allele frequency data and a Gaussian approximation to genetic drift, we infer the structure of this graph. We applied this method to a set of 55 human populations and a set of 82 dog breeds and wild canids. In both species, we show that a simple bifurcating tree does not fully describe the data; in contrast, we infer many migration events. While some of the migration events that we find have been detected previously, many have not. For example, in the human data, we infer that Cambodians trace approximately 16% of their ancestry to a population ancestral to other extant East Asian populations. In the dog data, we infer that both the boxer and basenji trace a considerable fraction of their ancestry (9% and 25%, respectively) to wolves subsequent to domestication and that East Asian toy breeds (the Shih Tzu and the Pekingese) result from admixture between modern toy breeds and “ancient” Asian breeds. Software implementing the model described here, called TreeMix, is available at http://treemix.googlecode.com. Public Library of Science 2012-11-15 /pmc/articles/PMC3499260/ /pubmed/23166502 http://dx.doi.org/10.1371/journal.pgen.1002967 Text en © 2012 Pickrell, Pritchard http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Pickrell, Joseph K.
Pritchard, Jonathan K.
Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data
title Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data
title_full Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data
title_fullStr Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data
title_full_unstemmed Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data
title_short Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data
title_sort inference of population splits and mixtures from genome-wide allele frequency data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3499260/
https://www.ncbi.nlm.nih.gov/pubmed/23166502
http://dx.doi.org/10.1371/journal.pgen.1002967
work_keys_str_mv AT pickrelljosephk inferenceofpopulationsplitsandmixturesfromgenomewideallelefrequencydata
AT pritchardjonathank inferenceofpopulationsplitsandmixturesfromgenomewideallelefrequencydata