Cargando…
On the limits of fitting complex models of population history to f-statistics
Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with the amount of genetic drift and the proportions of mixture, is the only information needed to predict...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
eLife Sciences Publications, Ltd
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310323/ https://www.ncbi.nlm.nih.gov/pubmed/37057893 http://dx.doi.org/10.7554/eLife.85492 |
_version_ | 1785066514200657920 |
---|---|
author | Maier, Robert Flegontov, Pavel Flegontova, Olga Işıldak, Ulaş Changmai, Piya Reich, David |
author_facet | Maier, Robert Flegontov, Pavel Flegontova, Olga Işıldak, Ulaş Changmai, Piya Reich, David |
author_sort | Maier, Robert |
collection | PubMed |
description | Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with the amount of genetic drift and the proportions of mixture, is the only information needed to predict the patterns of allele frequency correlation among populations. The space of possible AGs relating populations is vast, and thus most published studies have identified fitting AGs through a manual process driven by prior hypotheses, leaving the majority of alternative models unexplored. Here, we develop a method for systematically searching the space of all AGs that can incorporate non-genetic information in the form of topology constraints. We implement this findGraphs tool within a software package, ADMIXTOOLS 2, which is a reimplementation of the ADMIXTOOLS software with new features and large performance gains. We apply this methodology to identify alternative models to AGs that played key roles in eight publications and find that in nearly all cases many alternative models fit nominally or significantly better than the published one. Our results suggest that strong claims about population history from AGs should only be made when all well-fitting and temporally plausible models share common topological features. Our re-evaluation of published data also provides insight into the population histories of humans, dogs, and horses, identifying features that are stable across the models we explored, as well as scenarios of populations relationships that differ in important ways from models that have been highlighted in the literature. |
format | Online Article Text |
id | pubmed-10310323 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | eLife Sciences Publications, Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-103103232023-06-30 On the limits of fitting complex models of population history to f-statistics Maier, Robert Flegontov, Pavel Flegontova, Olga Işıldak, Ulaş Changmai, Piya Reich, David eLife Evolutionary Biology Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with the amount of genetic drift and the proportions of mixture, is the only information needed to predict the patterns of allele frequency correlation among populations. The space of possible AGs relating populations is vast, and thus most published studies have identified fitting AGs through a manual process driven by prior hypotheses, leaving the majority of alternative models unexplored. Here, we develop a method for systematically searching the space of all AGs that can incorporate non-genetic information in the form of topology constraints. We implement this findGraphs tool within a software package, ADMIXTOOLS 2, which is a reimplementation of the ADMIXTOOLS software with new features and large performance gains. We apply this methodology to identify alternative models to AGs that played key roles in eight publications and find that in nearly all cases many alternative models fit nominally or significantly better than the published one. Our results suggest that strong claims about population history from AGs should only be made when all well-fitting and temporally plausible models share common topological features. Our re-evaluation of published data also provides insight into the population histories of humans, dogs, and horses, identifying features that are stable across the models we explored, as well as scenarios of populations relationships that differ in important ways from models that have been highlighted in the literature. eLife Sciences Publications, Ltd 2023-06-29 /pmc/articles/PMC10310323/ /pubmed/37057893 http://dx.doi.org/10.7554/eLife.85492 Text en © 2023, Maier, Flegontov et al https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited. |
spellingShingle | Evolutionary Biology Maier, Robert Flegontov, Pavel Flegontova, Olga Işıldak, Ulaş Changmai, Piya Reich, David On the limits of fitting complex models of population history to f-statistics |
title | On the limits of fitting complex models of population history to f-statistics |
title_full | On the limits of fitting complex models of population history to f-statistics |
title_fullStr | On the limits of fitting complex models of population history to f-statistics |
title_full_unstemmed | On the limits of fitting complex models of population history to f-statistics |
title_short | On the limits of fitting complex models of population history to f-statistics |
title_sort | on the limits of fitting complex models of population history to f-statistics |
topic | Evolutionary Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310323/ https://www.ncbi.nlm.nih.gov/pubmed/37057893 http://dx.doi.org/10.7554/eLife.85492 |
work_keys_str_mv | AT maierrobert onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics AT flegontovpavel onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics AT flegontovaolga onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics AT isıldakulas onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics AT changmaipiya onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics AT reichdavid onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics |