Cargando…
Population divergence time estimation using individual lineage label switching
Divergence time estimation from multilocus genetic data has become common in population genetics and phylogenetics. We present a new Bayesian inference method that treats the divergence time as a random variable. The divergence time is calculated from an assembly of splitting events on individual li...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982400/ https://www.ncbi.nlm.nih.gov/pubmed/35166790 http://dx.doi.org/10.1093/g3journal/jkac040 |
_version_ | 1784681803082104832 |
---|---|
author | Beerli, Peter Ashki, Haleh Mashayekhi, Somayeh Palczewski, Michal |
author_facet | Beerli, Peter Ashki, Haleh Mashayekhi, Somayeh Palczewski, Michal |
author_sort | Beerli, Peter |
collection | PubMed |
description | Divergence time estimation from multilocus genetic data has become common in population genetics and phylogenetics. We present a new Bayesian inference method that treats the divergence time as a random variable. The divergence time is calculated from an assembly of splitting events on individual lineages in a genealogy. The time for such a splitting event is drawn from a hazard function of the truncated normal distribution. This allows easy integration into the standard coalescence framework used in programs such as Migrate. We explore the accuracy of the new inference method with simulated population splittings over a wide range of divergence time values and with a reanalysis of a dataset of 5 populations consisting of 3 present-day populations (Africans, Europeans, Asian) and 2 archaic samples (Altai and Ust’Isthim). Evaluations of simple divergence models without subsequent geneflow show high accuracy, whereas the accuracy of the results of isolation with migration models depends on the magnitude of the immigration rate. High immigration rates lead to a time of the most recent common ancestor of the sample that, looking backward in time, predates the divergence time. Even with many independent loci, accurate estimation of the divergence time with high immigration rates becomes problematic. Our comparison to other software tools reveals that our lineage-switching method, implemented in Migrate, is comparable to IMa2p. The software Migrate can run large numbers of sequence loci (>1,000) on computer clusters in parallel. |
format | Online Article Text |
id | pubmed-8982400 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-89824002022-04-05 Population divergence time estimation using individual lineage label switching Beerli, Peter Ashki, Haleh Mashayekhi, Somayeh Palczewski, Michal G3 (Bethesda) Investigation Divergence time estimation from multilocus genetic data has become common in population genetics and phylogenetics. We present a new Bayesian inference method that treats the divergence time as a random variable. The divergence time is calculated from an assembly of splitting events on individual lineages in a genealogy. The time for such a splitting event is drawn from a hazard function of the truncated normal distribution. This allows easy integration into the standard coalescence framework used in programs such as Migrate. We explore the accuracy of the new inference method with simulated population splittings over a wide range of divergence time values and with a reanalysis of a dataset of 5 populations consisting of 3 present-day populations (Africans, Europeans, Asian) and 2 archaic samples (Altai and Ust’Isthim). Evaluations of simple divergence models without subsequent geneflow show high accuracy, whereas the accuracy of the results of isolation with migration models depends on the magnitude of the immigration rate. High immigration rates lead to a time of the most recent common ancestor of the sample that, looking backward in time, predates the divergence time. Even with many independent loci, accurate estimation of the divergence time with high immigration rates becomes problematic. Our comparison to other software tools reveals that our lineage-switching method, implemented in Migrate, is comparable to IMa2p. The software Migrate can run large numbers of sequence loci (>1,000) on computer clusters in parallel. Oxford University Press 2022-02-15 /pmc/articles/PMC8982400/ /pubmed/35166790 http://dx.doi.org/10.1093/g3journal/jkac040 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigation Beerli, Peter Ashki, Haleh Mashayekhi, Somayeh Palczewski, Michal Population divergence time estimation using individual lineage label switching |
title | Population divergence time estimation using individual lineage label switching |
title_full | Population divergence time estimation using individual lineage label switching |
title_fullStr | Population divergence time estimation using individual lineage label switching |
title_full_unstemmed | Population divergence time estimation using individual lineage label switching |
title_short | Population divergence time estimation using individual lineage label switching |
title_sort | population divergence time estimation using individual lineage label switching |
topic | Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982400/ https://www.ncbi.nlm.nih.gov/pubmed/35166790 http://dx.doi.org/10.1093/g3journal/jkac040 |
work_keys_str_mv | AT beerlipeter populationdivergencetimeestimationusingindividuallineagelabelswitching AT ashkihaleh populationdivergencetimeestimationusingindividuallineagelabelswitching AT mashayekhisomayeh populationdivergencetimeestimationusingindividuallineagelabelswitching AT palczewskimichal populationdivergencetimeestimationusingindividuallineagelabelswitching |