Cargando…

Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model

Detecting genomic regions under selection is an important objective of population genetics. Typical analyses for this goal are based on exploiting genetic diversity patterns in present time data but rapid advances in DNA sequencing have increased the availability of time series genomic data. A commo...

Descripción completa

Detalles Bibliográficos
Autores principales: Paris, Cyriel, Servin, Bertrand, Boitard, Simon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6893182/
https://www.ncbi.nlm.nih.gov/pubmed/31597676
http://dx.doi.org/10.1534/g3.119.400778
_version_ 1783476156474851328
author Paris, Cyriel
Servin, Bertrand
Boitard, Simon
author_facet Paris, Cyriel
Servin, Bertrand
Boitard, Simon
author_sort Paris, Cyriel
collection PubMed
description Detecting genomic regions under selection is an important objective of population genetics. Typical analyses for this goal are based on exploiting genetic diversity patterns in present time data but rapid advances in DNA sequencing have increased the availability of time series genomic data. A common approach to analyze such data is to model the temporal evolution of an allele frequency as a Markov chain. Based on this principle, several methods have been proposed to infer selection intensity. One of their differences lies in how they model the transition probabilities of the Markov chain. Using the Wright-Fisher model is a natural choice but its computational cost is prohibitive for large population sizes so approximations to this model based on parametric distributions have been proposed. Here, we compared the performance of some of these approximations with respect to their power to detect selection and their estimation of the selection coefficient. We developped a new generic Hidden Markov Model likelihood calculator and applied it on genetic time series simulated under various evolutionary scenarios. The Beta with spikes approximation, which combines discrete fixation probabilities with a continuous Beta distribution, was found to perform consistently better than the others. This distribution provides an almost perfect fit to the Wright-Fisher model in terms of selection inference, for a computational cost that does not increase with population size. We further evaluated this model for population sizes not accessible to the Wright-Fisher model and illustrated its performance on a dataset of two divergently selected chicken populations.
format Online
Article
Text
id pubmed-6893182
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-68931822019-12-05 Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model Paris, Cyriel Servin, Bertrand Boitard, Simon G3 (Bethesda) Investigations Detecting genomic regions under selection is an important objective of population genetics. Typical analyses for this goal are based on exploiting genetic diversity patterns in present time data but rapid advances in DNA sequencing have increased the availability of time series genomic data. A common approach to analyze such data is to model the temporal evolution of an allele frequency as a Markov chain. Based on this principle, several methods have been proposed to infer selection intensity. One of their differences lies in how they model the transition probabilities of the Markov chain. Using the Wright-Fisher model is a natural choice but its computational cost is prohibitive for large population sizes so approximations to this model based on parametric distributions have been proposed. Here, we compared the performance of some of these approximations with respect to their power to detect selection and their estimation of the selection coefficient. We developped a new generic Hidden Markov Model likelihood calculator and applied it on genetic time series simulated under various evolutionary scenarios. The Beta with spikes approximation, which combines discrete fixation probabilities with a continuous Beta distribution, was found to perform consistently better than the others. This distribution provides an almost perfect fit to the Wright-Fisher model in terms of selection inference, for a computational cost that does not increase with population size. We further evaluated this model for population sizes not accessible to the Wright-Fisher model and illustrated its performance on a dataset of two divergently selected chicken populations. Genetics Society of America 2019-10-09 /pmc/articles/PMC6893182/ /pubmed/31597676 http://dx.doi.org/10.1534/g3.119.400778 Text en Copyright © 2019 Paris et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Paris, Cyriel
Servin, Bertrand
Boitard, Simon
Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model
title Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model
title_full Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model
title_fullStr Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model
title_full_unstemmed Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model
title_short Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model
title_sort inference of selection from genetic time series using various parametric approximations to the wright-fisher model
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6893182/
https://www.ncbi.nlm.nih.gov/pubmed/31597676
http://dx.doi.org/10.1534/g3.119.400778
work_keys_str_mv AT pariscyriel inferenceofselectionfromgenetictimeseriesusingvariousparametricapproximationstothewrightfishermodel
AT servinbertrand inferenceofselectionfromgenetictimeseriesusingvariousparametricapproximationstothewrightfishermodel
AT boitardsimon inferenceofselectionfromgenetictimeseriesusingvariousparametricapproximationstothewrightfishermodel