Cargando…

Sweeps in time: leveraging the joint distribution of branch lengths

Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diver...

Descripción completa

Detalles Bibliográficos
Autores principales: Bisschop, Gertjan, Lohse, Konrad, Setter, Derek
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633083/
https://www.ncbi.nlm.nih.gov/pubmed/34849880
http://dx.doi.org/10.1093/genetics/iyab119
_version_ 1784607871178113024
author Bisschop, Gertjan
Lohse, Konrad
Setter, Derek
author_facet Bisschop, Gertjan
Lohse, Konrad
Setter, Derek
author_sort Bisschop, Gertjan
collection PubMed
description Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diversity. Here, we develop a tractable method of describing the effect of positive selection on the genealogical histories in the surrounding genome, explicitly modeling both the timing and context of an adaptive event. In addition, our framework allows us to go beyond analyzing polymorphism data via the site frequency spectrum or summaries thereof and instead leverage information contained in patterns of linked variants. Tests on both simulations and a human data example, as well as a comparison to SweepFinder2, show that even with very small sample sizes, our analytic framework has higher power to identify old selective sweeps and to correctly infer both the time and strength of selection. Finally, we derived the marginal distribution of genealogical branch lengths at a locus affected by selection acting at a linked site. This provides a much-needed link between our analytic understanding of the effects of sweeps on sequence variation and recent advances in simulation and heuristic inference procedures that allow researchers to examine the sequence of genealogical histories along the genome.
format Online
Article
Text
id pubmed-8633083
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86330832021-12-01 Sweeps in time: leveraging the joint distribution of branch lengths Bisschop, Gertjan Lohse, Konrad Setter, Derek Genetics Investigation Current methods of identifying positively selected regions in the genome are limited in two key ways: the underlying models cannot account for the timing of adaptive events and the comparison between models of selective sweeps and sequence data is generally made via simple summaries of genetic diversity. Here, we develop a tractable method of describing the effect of positive selection on the genealogical histories in the surrounding genome, explicitly modeling both the timing and context of an adaptive event. In addition, our framework allows us to go beyond analyzing polymorphism data via the site frequency spectrum or summaries thereof and instead leverage information contained in patterns of linked variants. Tests on both simulations and a human data example, as well as a comparison to SweepFinder2, show that even with very small sample sizes, our analytic framework has higher power to identify old selective sweeps and to correctly infer both the time and strength of selection. Finally, we derived the marginal distribution of genealogical branch lengths at a locus affected by selection acting at a linked site. This provides a much-needed link between our analytic understanding of the effects of sweeps on sequence variation and recent advances in simulation and heuristic inference procedures that allow researchers to examine the sequence of genealogical histories along the genome. Oxford University Press 2021-08-03 /pmc/articles/PMC8633083/ /pubmed/34849880 http://dx.doi.org/10.1093/genetics/iyab119 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Bisschop, Gertjan
Lohse, Konrad
Setter, Derek
Sweeps in time: leveraging the joint distribution of branch lengths
title Sweeps in time: leveraging the joint distribution of branch lengths
title_full Sweeps in time: leveraging the joint distribution of branch lengths
title_fullStr Sweeps in time: leveraging the joint distribution of branch lengths
title_full_unstemmed Sweeps in time: leveraging the joint distribution of branch lengths
title_short Sweeps in time: leveraging the joint distribution of branch lengths
title_sort sweeps in time: leveraging the joint distribution of branch lengths
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633083/
https://www.ncbi.nlm.nih.gov/pubmed/34849880
http://dx.doi.org/10.1093/genetics/iyab119
work_keys_str_mv AT bisschopgertjan sweepsintimeleveragingthejointdistributionofbranchlengths
AT lohsekonrad sweepsintimeleveragingthejointdistributionofbranchlengths
AT setterderek sweepsintimeleveragingthejointdistributionofbranchlengths