Cargando…
Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model
Detecting genomic regions under selection is an important objective of population genetics. Typical analyses for this goal are based on exploiting genetic diversity patterns in present time data but rapid advances in DNA sequencing have increased the availability of time series genomic data. A commo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6893182/ https://www.ncbi.nlm.nih.gov/pubmed/31597676 http://dx.doi.org/10.1534/g3.119.400778 |
_version_ | 1783476156474851328 |
---|---|
author | Paris, Cyriel Servin, Bertrand Boitard, Simon |
author_facet | Paris, Cyriel Servin, Bertrand Boitard, Simon |
author_sort | Paris, Cyriel |
collection | PubMed |
description | Detecting genomic regions under selection is an important objective of population genetics. Typical analyses for this goal are based on exploiting genetic diversity patterns in present time data but rapid advances in DNA sequencing have increased the availability of time series genomic data. A common approach to analyze such data is to model the temporal evolution of an allele frequency as a Markov chain. Based on this principle, several methods have been proposed to infer selection intensity. One of their differences lies in how they model the transition probabilities of the Markov chain. Using the Wright-Fisher model is a natural choice but its computational cost is prohibitive for large population sizes so approximations to this model based on parametric distributions have been proposed. Here, we compared the performance of some of these approximations with respect to their power to detect selection and their estimation of the selection coefficient. We developped a new generic Hidden Markov Model likelihood calculator and applied it on genetic time series simulated under various evolutionary scenarios. The Beta with spikes approximation, which combines discrete fixation probabilities with a continuous Beta distribution, was found to perform consistently better than the others. This distribution provides an almost perfect fit to the Wright-Fisher model in terms of selection inference, for a computational cost that does not increase with population size. We further evaluated this model for population sizes not accessible to the Wright-Fisher model and illustrated its performance on a dataset of two divergently selected chicken populations. |
format | Online Article Text |
id | pubmed-6893182 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-68931822019-12-05 Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model Paris, Cyriel Servin, Bertrand Boitard, Simon G3 (Bethesda) Investigations Detecting genomic regions under selection is an important objective of population genetics. Typical analyses for this goal are based on exploiting genetic diversity patterns in present time data but rapid advances in DNA sequencing have increased the availability of time series genomic data. A common approach to analyze such data is to model the temporal evolution of an allele frequency as a Markov chain. Based on this principle, several methods have been proposed to infer selection intensity. One of their differences lies in how they model the transition probabilities of the Markov chain. Using the Wright-Fisher model is a natural choice but its computational cost is prohibitive for large population sizes so approximations to this model based on parametric distributions have been proposed. Here, we compared the performance of some of these approximations with respect to their power to detect selection and their estimation of the selection coefficient. We developped a new generic Hidden Markov Model likelihood calculator and applied it on genetic time series simulated under various evolutionary scenarios. The Beta with spikes approximation, which combines discrete fixation probabilities with a continuous Beta distribution, was found to perform consistently better than the others. This distribution provides an almost perfect fit to the Wright-Fisher model in terms of selection inference, for a computational cost that does not increase with population size. We further evaluated this model for population sizes not accessible to the Wright-Fisher model and illustrated its performance on a dataset of two divergently selected chicken populations. Genetics Society of America 2019-10-09 /pmc/articles/PMC6893182/ /pubmed/31597676 http://dx.doi.org/10.1534/g3.119.400778 Text en Copyright © 2019 Paris et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Paris, Cyriel Servin, Bertrand Boitard, Simon Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model |
title | Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model |
title_full | Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model |
title_fullStr | Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model |
title_full_unstemmed | Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model |
title_short | Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model |
title_sort | inference of selection from genetic time series using various parametric approximations to the wright-fisher model |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6893182/ https://www.ncbi.nlm.nih.gov/pubmed/31597676 http://dx.doi.org/10.1534/g3.119.400778 |
work_keys_str_mv | AT pariscyriel inferenceofselectionfromgenetictimeseriesusingvariousparametricapproximationstothewrightfishermodel AT servinbertrand inferenceofselectionfromgenetictimeseriesusingvariousparametricapproximationstothewrightfishermodel AT boitardsimon inferenceofselectionfromgenetictimeseriesusingvariousparametricapproximationstothewrightfishermodel |