Cargando…

Population divergence time estimation using individual lineage label switching

Divergence time estimation from multilocus genetic data has become common in population genetics and phylogenetics. We present a new Bayesian inference method that treats the divergence time as a random variable. The divergence time is calculated from an assembly of splitting events on individual li...

Descripción completa

Detalles Bibliográficos
Autores principales: Beerli, Peter, Ashki, Haleh, Mashayekhi, Somayeh, Palczewski, Michal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982400/
https://www.ncbi.nlm.nih.gov/pubmed/35166790
http://dx.doi.org/10.1093/g3journal/jkac040
_version_ 1784681803082104832
author Beerli, Peter
Ashki, Haleh
Mashayekhi, Somayeh
Palczewski, Michal
author_facet Beerli, Peter
Ashki, Haleh
Mashayekhi, Somayeh
Palczewski, Michal
author_sort Beerli, Peter
collection PubMed
description Divergence time estimation from multilocus genetic data has become common in population genetics and phylogenetics. We present a new Bayesian inference method that treats the divergence time as a random variable. The divergence time is calculated from an assembly of splitting events on individual lineages in a genealogy. The time for such a splitting event is drawn from a hazard function of the truncated normal distribution. This allows easy integration into the standard coalescence framework used in programs such as Migrate. We explore the accuracy of the new inference method with simulated population splittings over a wide range of divergence time values and with a reanalysis of a dataset of 5 populations consisting of 3 present-day populations (Africans, Europeans, Asian) and 2 archaic samples (Altai and Ust’Isthim). Evaluations of simple divergence models without subsequent geneflow show high accuracy, whereas the accuracy of the results of isolation with migration models depends on the magnitude of the immigration rate. High immigration rates lead to a time of the most recent common ancestor of the sample that, looking backward in time, predates the divergence time. Even with many independent loci, accurate estimation of the divergence time with high immigration rates becomes problematic. Our comparison to other software tools reveals that our lineage-switching method, implemented in Migrate, is comparable to IMa2p. The software Migrate can run large numbers of sequence loci (>1,000) on computer clusters in parallel.
format Online
Article
Text
id pubmed-8982400
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-89824002022-04-05 Population divergence time estimation using individual lineage label switching Beerli, Peter Ashki, Haleh Mashayekhi, Somayeh Palczewski, Michal G3 (Bethesda) Investigation Divergence time estimation from multilocus genetic data has become common in population genetics and phylogenetics. We present a new Bayesian inference method that treats the divergence time as a random variable. The divergence time is calculated from an assembly of splitting events on individual lineages in a genealogy. The time for such a splitting event is drawn from a hazard function of the truncated normal distribution. This allows easy integration into the standard coalescence framework used in programs such as Migrate. We explore the accuracy of the new inference method with simulated population splittings over a wide range of divergence time values and with a reanalysis of a dataset of 5 populations consisting of 3 present-day populations (Africans, Europeans, Asian) and 2 archaic samples (Altai and Ust’Isthim). Evaluations of simple divergence models without subsequent geneflow show high accuracy, whereas the accuracy of the results of isolation with migration models depends on the magnitude of the immigration rate. High immigration rates lead to a time of the most recent common ancestor of the sample that, looking backward in time, predates the divergence time. Even with many independent loci, accurate estimation of the divergence time with high immigration rates becomes problematic. Our comparison to other software tools reveals that our lineage-switching method, implemented in Migrate, is comparable to IMa2p. The software Migrate can run large numbers of sequence loci (>1,000) on computer clusters in parallel. Oxford University Press 2022-02-15 /pmc/articles/PMC8982400/ /pubmed/35166790 http://dx.doi.org/10.1093/g3journal/jkac040 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Beerli, Peter
Ashki, Haleh
Mashayekhi, Somayeh
Palczewski, Michal
Population divergence time estimation using individual lineage label switching
title Population divergence time estimation using individual lineage label switching
title_full Population divergence time estimation using individual lineage label switching
title_fullStr Population divergence time estimation using individual lineage label switching
title_full_unstemmed Population divergence time estimation using individual lineage label switching
title_short Population divergence time estimation using individual lineage label switching
title_sort population divergence time estimation using individual lineage label switching
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982400/
https://www.ncbi.nlm.nih.gov/pubmed/35166790
http://dx.doi.org/10.1093/g3journal/jkac040
work_keys_str_mv AT beerlipeter populationdivergencetimeestimationusingindividuallineagelabelswitching
AT ashkihaleh populationdivergencetimeestimationusingindividuallineagelabelswitching
AT mashayekhisomayeh populationdivergencetimeestimationusingindividuallineagelabelswitching
AT palczewskimichal populationdivergencetimeestimationusingindividuallineagelabelswitching