Cargando…

On the limits of fitting complex models of population history to f-statistics

Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with the amount of genetic drift and the proportions of mixture, is the only information needed to predict...

Descripción completa

Detalles Bibliográficos
Autores principales: Maier, Robert, Flegontov, Pavel, Flegontova, Olga, Işıldak, Ulaş, Changmai, Piya, Reich, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310323/
https://www.ncbi.nlm.nih.gov/pubmed/37057893
http://dx.doi.org/10.7554/eLife.85492
_version_ 1785066514200657920
author Maier, Robert
Flegontov, Pavel
Flegontova, Olga
Işıldak, Ulaş
Changmai, Piya
Reich, David
author_facet Maier, Robert
Flegontov, Pavel
Flegontova, Olga
Işıldak, Ulaş
Changmai, Piya
Reich, David
author_sort Maier, Robert
collection PubMed
description Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with the amount of genetic drift and the proportions of mixture, is the only information needed to predict the patterns of allele frequency correlation among populations. The space of possible AGs relating populations is vast, and thus most published studies have identified fitting AGs through a manual process driven by prior hypotheses, leaving the majority of alternative models unexplored. Here, we develop a method for systematically searching the space of all AGs that can incorporate non-genetic information in the form of topology constraints. We implement this findGraphs tool within a software package, ADMIXTOOLS 2, which is a reimplementation of the ADMIXTOOLS software with new features and large performance gains. We apply this methodology to identify alternative models to AGs that played key roles in eight publications and find that in nearly all cases many alternative models fit nominally or significantly better than the published one. Our results suggest that strong claims about population history from AGs should only be made when all well-fitting and temporally plausible models share common topological features. Our re-evaluation of published data also provides insight into the population histories of humans, dogs, and horses, identifying features that are stable across the models we explored, as well as scenarios of populations relationships that differ in important ways from models that have been highlighted in the literature.
format Online
Article
Text
id pubmed-10310323
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-103103232023-06-30 On the limits of fitting complex models of population history to f-statistics Maier, Robert Flegontov, Pavel Flegontova, Olga Işıldak, Ulaş Changmai, Piya Reich, David eLife Evolutionary Biology Our understanding of population history in deep time has been assisted by fitting admixture graphs (AGs) to data: models that specify the ordering of population splits and mixtures, which along with the amount of genetic drift and the proportions of mixture, is the only information needed to predict the patterns of allele frequency correlation among populations. The space of possible AGs relating populations is vast, and thus most published studies have identified fitting AGs through a manual process driven by prior hypotheses, leaving the majority of alternative models unexplored. Here, we develop a method for systematically searching the space of all AGs that can incorporate non-genetic information in the form of topology constraints. We implement this findGraphs tool within a software package, ADMIXTOOLS 2, which is a reimplementation of the ADMIXTOOLS software with new features and large performance gains. We apply this methodology to identify alternative models to AGs that played key roles in eight publications and find that in nearly all cases many alternative models fit nominally or significantly better than the published one. Our results suggest that strong claims about population history from AGs should only be made when all well-fitting and temporally plausible models share common topological features. Our re-evaluation of published data also provides insight into the population histories of humans, dogs, and horses, identifying features that are stable across the models we explored, as well as scenarios of populations relationships that differ in important ways from models that have been highlighted in the literature. eLife Sciences Publications, Ltd 2023-06-29 /pmc/articles/PMC10310323/ /pubmed/37057893 http://dx.doi.org/10.7554/eLife.85492 Text en © 2023, Maier, Flegontov et al https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Evolutionary Biology
Maier, Robert
Flegontov, Pavel
Flegontova, Olga
Işıldak, Ulaş
Changmai, Piya
Reich, David
On the limits of fitting complex models of population history to f-statistics
title On the limits of fitting complex models of population history to f-statistics
title_full On the limits of fitting complex models of population history to f-statistics
title_fullStr On the limits of fitting complex models of population history to f-statistics
title_full_unstemmed On the limits of fitting complex models of population history to f-statistics
title_short On the limits of fitting complex models of population history to f-statistics
title_sort on the limits of fitting complex models of population history to f-statistics
topic Evolutionary Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10310323/
https://www.ncbi.nlm.nih.gov/pubmed/37057893
http://dx.doi.org/10.7554/eLife.85492
work_keys_str_mv AT maierrobert onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics
AT flegontovpavel onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics
AT flegontovaolga onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics
AT isıldakulas onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics
AT changmaipiya onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics
AT reichdavid onthelimitsoffittingcomplexmodelsofpopulationhistorytofstatistics