Cargando…
Sweeps in time: leveraging the joint distribution of branch lengths
Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diver...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633083/ https://www.ncbi.nlm.nih.gov/pubmed/34849880 http://dx.doi.org/10.1093/genetics/iyab119 |
_version_ | 1784607871178113024 |
---|---|
author | Bisschop, Gertjan Lohse, Konrad Setter, Derek |
author_facet | Bisschop, Gertjan Lohse, Konrad Setter, Derek |
author_sort | Bisschop, Gertjan |
collection | PubMed |
description | Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diversity. Here, we develop a tractable method of describing the effect of positive selection on the genealogical histories in the surrounding genome, explicitly modeling both the timing and context of an adaptive event. In addition, our framework allows us to go beyond analyzing polymorphism data via the site frequency spectrum or summaries thereof and instead leverage information contained in patterns of linked variants. Tests on both simulations and a human data example, as well as a comparison to SweepFinder2, show that even with very small sample sizes, our analytic framework has higher power to identify old selective sweeps and to correctly infer both the time and strength of selection. Finally, we derived the marginal distribution of genealogical branch lengths at a locus affected by selection acting at a linked site. This provides a much-needed link between our analytic understanding of the effects of sweeps on sequence variation and recent advances in simulation and heuristic inference procedures that allow researchers to examine the sequence of genealogical histories along the genome. |
format | Online Article Text |
id | pubmed-8633083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-86330832021-12-01 Sweeps in time: leveraging the joint distribution of branch lengths Bisschop, Gertjan Lohse, Konrad Setter, Derek Genetics Investigation Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diversity. Here, we develop a tractable method of describing the effect of positive selection on the genealogical histories in the surrounding genome, explicitly modeling both the timing and context of an adaptive event. In addition, our framework allows us to go beyond analyzing polymorphism data via the site frequency spectrum or summaries thereof and instead leverage information contained in patterns of linked variants. Tests on both simulations and a human data example, as well as a comparison to SweepFinder2, show that even with very small sample sizes, our analytic framework has higher power to identify old selective sweeps and to correctly infer both the time and strength of selection. Finally, we derived the marginal distribution of genealogical branch lengths at a locus affected by selection acting at a linked site. This provides a much-needed link between our analytic understanding of the effects of sweeps on sequence variation and recent advances in simulation and heuristic inference procedures that allow researchers to examine the sequence of genealogical histories along the genome. Oxford University Press 2021-08-03 /pmc/articles/PMC8633083/ /pubmed/34849880 http://dx.doi.org/10.1093/genetics/iyab119 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigation Bisschop, Gertjan Lohse, Konrad Setter, Derek Sweeps in time: leveraging the joint distribution of branch lengths |
title | Sweeps in time: leveraging the joint distribution of branch lengths |
title_full | Sweeps in time: leveraging the joint distribution of branch lengths |
title_fullStr | Sweeps in time: leveraging the joint distribution of branch lengths |
title_full_unstemmed | Sweeps in time: leveraging the joint distribution of branch lengths |
title_short | Sweeps in time: leveraging the joint distribution of branch lengths |
title_sort | sweeps in time: leveraging the joint distribution of branch lengths |
topic | Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633083/ https://www.ncbi.nlm.nih.gov/pubmed/34849880 http://dx.doi.org/10.1093/genetics/iyab119 |
work_keys_str_mv | AT bisschopgertjan sweepsintimeleveragingthejointdistributionofbranchlengths AT lohsekonrad sweepsintimeleveragingthejointdistributionofbranchlengths AT setterderek sweepsintimeleveragingthejointdistributionofbranchlengths |