Cargando…
Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow
The recent explosion in available genetic data has led to significant advances in understanding the demographic histories of and relationships among human populations. It is still a challenge, however, to infer reliable parameter values for complicated models involving many populations. Here, we pre...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3708505/ https://www.ncbi.nlm.nih.gov/pubmed/23709261 http://dx.doi.org/10.1093/molbev/mst099 |
_version_ | 1782276624073359360 |
---|---|
author | Lipson, Mark Loh, Po-Ru Levin, Alex Reich, David Patterson, Nick Berger, Bonnie |
author_facet | Lipson, Mark Loh, Po-Ru Levin, Alex Reich, David Patterson, Nick Berger, Bonnie |
author_sort | Lipson, Mark |
collection | PubMed |
description | The recent explosion in available genetic data has led to significant advances in understanding the demographic histories of and relationships among human populations. It is still a challenge, however, to infer reliable parameter values for complicated models involving many populations. Here, we present MixMapper, an efficient, interactive method for constructing phylogenetic trees including admixture events using single nucleotide polymorphism (SNP) genotype data. MixMapper implements a novel two-phase approach to admixture inference using moment statistics, first building an unadmixed scaffold tree and then adding admixed populations by solving systems of equations that express allele frequency divergences in terms of mixture parameters. Importantly, all features of the model, including topology, sources of gene flow, branch lengths, and mixture proportions, are optimized automatically from the data and include estimates of statistical uncertainty. MixMapper also uses a new method to express branch lengths in easily interpretable drift units. We apply MixMapper to recently published data for Human Genome Diversity Cell Line Panel individuals genotyped on a SNP array designed especially for use in population genetics studies, obtaining confident results for 30 populations, 20 of them admixed. Notably, we confirm a signal of ancient admixture in European populations—including previously undetected admixture in Sardinians and Basques—involving a proportion of 20–40% ancient northern Eurasian ancestry. |
format | Online Article Text |
id | pubmed-3708505 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-37085052013-07-11 Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow Lipson, Mark Loh, Po-Ru Levin, Alex Reich, David Patterson, Nick Berger, Bonnie Mol Biol Evol Fast Tracks The recent explosion in available genetic data has led to significant advances in understanding the demographic histories of and relationships among human populations. It is still a challenge, however, to infer reliable parameter values for complicated models involving many populations. Here, we present MixMapper, an efficient, interactive method for constructing phylogenetic trees including admixture events using single nucleotide polymorphism (SNP) genotype data. MixMapper implements a novel two-phase approach to admixture inference using moment statistics, first building an unadmixed scaffold tree and then adding admixed populations by solving systems of equations that express allele frequency divergences in terms of mixture parameters. Importantly, all features of the model, including topology, sources of gene flow, branch lengths, and mixture proportions, are optimized automatically from the data and include estimates of statistical uncertainty. MixMapper also uses a new method to express branch lengths in easily interpretable drift units. We apply MixMapper to recently published data for Human Genome Diversity Cell Line Panel individuals genotyped on a SNP array designed especially for use in population genetics studies, obtaining confident results for 30 populations, 20 of them admixed. Notably, we confirm a signal of ancient admixture in European populations—including previously undetected admixture in Sardinians and Basques—involving a proportion of 20–40% ancient northern Eurasian ancestry. Oxford University Press 2013-08 2013-05-24 /pmc/articles/PMC3708505/ /pubmed/23709261 http://dx.doi.org/10.1093/molbev/mst099 Text en © The Author 2013. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Fast Tracks Lipson, Mark Loh, Po-Ru Levin, Alex Reich, David Patterson, Nick Berger, Bonnie Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow |
title | Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow |
title_full | Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow |
title_fullStr | Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow |
title_full_unstemmed | Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow |
title_short | Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow |
title_sort | efficient moment-based inference of admixture parameters and sources of gene flow |
topic | Fast Tracks |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3708505/ https://www.ncbi.nlm.nih.gov/pubmed/23709261 http://dx.doi.org/10.1093/molbev/mst099 |
work_keys_str_mv | AT lipsonmark efficientmomentbasedinferenceofadmixtureparametersandsourcesofgeneflow AT lohporu efficientmomentbasedinferenceofadmixtureparametersandsourcesofgeneflow AT levinalex efficientmomentbasedinferenceofadmixtureparametersandsourcesofgeneflow AT reichdavid efficientmomentbasedinferenceofadmixtureparametersandsourcesofgeneflow AT pattersonnick efficientmomentbasedinferenceofadmixtureparametersandsourcesofgeneflow AT bergerbonnie efficientmomentbasedinferenceofadmixtureparametersandsourcesofgeneflow |