Cargando…

Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection

Longitudinal allele frequency data are becoming increasingly prevalent. Such samples permit statistical inference of the population genetics parameters that influence the fate of mutant variants. To infer these parameters by maximum likelihood, the mutant frequency is often assumed to evolve accordi...

Descripción completa

Detalles Bibliográficos
Autores principales: Lacerda, Miguel, Seoighe, Cathal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4224163/
https://www.ncbi.nlm.nih.gov/pubmed/25213172
http://dx.doi.org/10.1534/genetics.114.167957
_version_ 1782343315539099648
author Lacerda, Miguel
Seoighe, Cathal
author_facet Lacerda, Miguel
Seoighe, Cathal
author_sort Lacerda, Miguel
collection PubMed
description Longitudinal allele frequency data are becoming increasingly prevalent. Such samples permit statistical inference of the population genetics parameters that influence the fate of mutant variants. To infer these parameters by maximum likelihood, the mutant frequency is often assumed to evolve according to the Wright–Fisher model. For computational reasons, this discrete model is commonly approximated by a diffusion process that requires the assumption that the forces of natural selection and mutation are weak. This assumption is not always appropriate. For example, mutations that impart drug resistance in pathogens may evolve under strong selective pressure. Here, we present an alternative approximation to the mutant-frequency distribution that does not make any assumptions about the magnitude of selection or mutation and is much more computationally efficient than the standard diffusion approximation. Simulation studies are used to compare the performance of our method to that of the Wright–Fisher and Gaussian diffusion approximations. For large populations, our method is found to provide a much better approximation to the mutant-frequency distribution when selection is strong, while all three methods perform comparably when selection is weak. Importantly, maximum-likelihood estimates of the selection coefficient are severely attenuated when selection is strong under the two diffusion models, but not when our method is used. This is further demonstrated with an application to mutant-frequency data from an experimental study of bacteriophage evolution. We therefore recommend our method for estimating the selection coefficient when the effective population size is too large to utilize the discrete Wright–Fisher model.
format Online
Article
Text
id pubmed-4224163
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-42241632015-11-01 Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection Lacerda, Miguel Seoighe, Cathal Genetics Investigations Longitudinal allele frequency data are becoming increasingly prevalent. Such samples permit statistical inference of the population genetics parameters that influence the fate of mutant variants. To infer these parameters by maximum likelihood, the mutant frequency is often assumed to evolve according to the Wright–Fisher model. For computational reasons, this discrete model is commonly approximated by a diffusion process that requires the assumption that the forces of natural selection and mutation are weak. This assumption is not always appropriate. For example, mutations that impart drug resistance in pathogens may evolve under strong selective pressure. Here, we present an alternative approximation to the mutant-frequency distribution that does not make any assumptions about the magnitude of selection or mutation and is much more computationally efficient than the standard diffusion approximation. Simulation studies are used to compare the performance of our method to that of the Wright–Fisher and Gaussian diffusion approximations. For large populations, our method is found to provide a much better approximation to the mutant-frequency distribution when selection is strong, while all three methods perform comparably when selection is weak. Importantly, maximum-likelihood estimates of the selection coefficient are severely attenuated when selection is strong under the two diffusion models, but not when our method is used. This is further demonstrated with an application to mutant-frequency data from an experimental study of bacteriophage evolution. We therefore recommend our method for estimating the selection coefficient when the effective population size is too large to utilize the discrete Wright–Fisher model. Genetics Society of America 2014-11 2014-09-10 /pmc/articles/PMC4224163/ /pubmed/25213172 http://dx.doi.org/10.1534/genetics.114.167957 Text en Copyright © 2014 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Lacerda, Miguel
Seoighe, Cathal
Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection
title Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection
title_full Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection
title_fullStr Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection
title_full_unstemmed Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection
title_short Population Genetics Inference for Longitudinally-Sampled Mutants Under Strong Selection
title_sort population genetics inference for longitudinally-sampled mutants under strong selection
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4224163/
https://www.ncbi.nlm.nih.gov/pubmed/25213172
http://dx.doi.org/10.1534/genetics.114.167957
work_keys_str_mv AT lacerdamiguel populationgeneticsinferenceforlongitudinallysampledmutantsunderstrongselection
AT seoighecathal populationgeneticsinferenceforlongitudinallysampledmutantsunderstrongselection