Cargando…

Estimating the Strength of Selective Sweeps from Deep Population Diversity Data

Selective sweeps are typically associated with a local reduction of genetic diversity around the adaptive site. However, selective sweeps can also quickly carry neutral mutations to observable population frequencies if they arise early in a sweep and hitchhike with the adaptive allele. We show that...

Descripción completa

Detalles Bibliográficos
Autores principales: Messer, Philipp W., Neher, Richard A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3374320/
https://www.ncbi.nlm.nih.gov/pubmed/22491190
http://dx.doi.org/10.1534/genetics.112.138461
_version_ 1782235631786655744
author Messer, Philipp W.
Neher, Richard A.
author_facet Messer, Philipp W.
Neher, Richard A.
author_sort Messer, Philipp W.
collection PubMed
description Selective sweeps are typically associated with a local reduction of genetic diversity around the adaptive site. However, selective sweeps can also quickly carry neutral mutations to observable population frequencies if they arise early in a sweep and hitchhike with the adaptive allele. We show that the interplay between mutation and exponential amplification through hitchhiking results in a characteristic frequency spectrum of the resulting novel haplotype variation that depends only on the ratio of the mutation rate and the selection coefficient of the sweep. On the basis of this result, we develop an estimator for the selection coefficient driving a sweep. Since this estimator utilizes the novel variation arising from mutations during a sweep, it does not rely on preexisting variation and can also be applied to loci that lack recombination. Compared with standard approaches that infer selection coefficients from the size of dips in genetic diversity around the adaptive site, our estimator requires much shorter sequences but sampled at high population depth to capture low-frequency variants; given such data, it consistently outperforms standard approaches. We investigate analytically and numerically how the accuracy of our estimator is affected by the decay of the sweep pattern over time as a consequence of random genetic drift and discuss potential effects of recombination, soft sweeps, and demography. As an example for its use, we apply our estimator to deep sequencing data from human immunodeficiency virus populations.
format Online
Article
Text
id pubmed-3374320
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-33743202012-06-25 Estimating the Strength of Selective Sweeps from Deep Population Diversity Data Messer, Philipp W. Neher, Richard A. Genetics Investigations Selective sweeps are typically associated with a local reduction of genetic diversity around the adaptive site. However, selective sweeps can also quickly carry neutral mutations to observable population frequencies if they arise early in a sweep and hitchhike with the adaptive allele. We show that the interplay between mutation and exponential amplification through hitchhiking results in a characteristic frequency spectrum of the resulting novel haplotype variation that depends only on the ratio of the mutation rate and the selection coefficient of the sweep. On the basis of this result, we develop an estimator for the selection coefficient driving a sweep. Since this estimator utilizes the novel variation arising from mutations during a sweep, it does not rely on preexisting variation and can also be applied to loci that lack recombination. Compared with standard approaches that infer selection coefficients from the size of dips in genetic diversity around the adaptive site, our estimator requires much shorter sequences but sampled at high population depth to capture low-frequency variants; given such data, it consistently outperforms standard approaches. We investigate analytically and numerically how the accuracy of our estimator is affected by the decay of the sweep pattern over time as a consequence of random genetic drift and discuss potential effects of recombination, soft sweeps, and demography. As an example for its use, we apply our estimator to deep sequencing data from human immunodeficiency virus populations. Genetics Society of America 2012-06 /pmc/articles/PMC3374320/ /pubmed/22491190 http://dx.doi.org/10.1534/genetics.112.138461 Text en Copyright © 2012 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Messer, Philipp W.
Neher, Richard A.
Estimating the Strength of Selective Sweeps from Deep Population Diversity Data
title Estimating the Strength of Selective Sweeps from Deep Population Diversity Data
title_full Estimating the Strength of Selective Sweeps from Deep Population Diversity Data
title_fullStr Estimating the Strength of Selective Sweeps from Deep Population Diversity Data
title_full_unstemmed Estimating the Strength of Selective Sweeps from Deep Population Diversity Data
title_short Estimating the Strength of Selective Sweeps from Deep Population Diversity Data
title_sort estimating the strength of selective sweeps from deep population diversity data
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3374320/
https://www.ncbi.nlm.nih.gov/pubmed/22491190
http://dx.doi.org/10.1534/genetics.112.138461
work_keys_str_mv AT messerphilippw estimatingthestrengthofselectivesweepsfromdeeppopulationdiversitydata
AT neherricharda estimatingthestrengthofselectivesweepsfromdeeppopulationdiversitydata