Cargando…

Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data

The distribution of fitness effects (DFE) encompasses the fraction of deleterious, neutral, and beneficial mutations. It conditions the evolutionary trajectory of populations, as well as the rate of adaptive molecular evolution (α). Inferring DFE and α from patterns of polymorphism, as given through...

Descripción completa

Detalles Bibliográficos
Autores principales: Tataru, Paula, Mollion, Maéva, Glémin, Sylvain, Bataillon, Thomas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5676230/
https://www.ncbi.nlm.nih.gov/pubmed/28951530
http://dx.doi.org/10.1534/genetics.117.300323
_version_ 1783277025978482688
author Tataru, Paula
Mollion, Maéva
Glémin, Sylvain
Bataillon, Thomas
author_facet Tataru, Paula
Mollion, Maéva
Glémin, Sylvain
Bataillon, Thomas
author_sort Tataru, Paula
collection PubMed
description The distribution of fitness effects (DFE) encompasses the fraction of deleterious, neutral, and beneficial mutations. It conditions the evolutionary trajectory of populations, as well as the rate of adaptive molecular evolution (α). Inferring DFE and α from patterns of polymorphism, as given through the site frequency spectrum (SFS) and divergence data, has been a longstanding goal of evolutionary genetics. A widespread assumption shared by previous inference methods is that beneficial mutations only contribute negligibly to the polymorphism data. Hence, a DFE comprising only deleterious mutations tends to be estimated from SFS data, and α is then predicted by contrasting the SFS with divergence data from an outgroup. We develop a hierarchical probabilistic framework that extends previous methods to infer DFE and α from polymorphism data alone. We use extensive simulations to examine the performance of our method. While an outgroup is still needed to obtain an unfolded SFS, we show that both a DFE, comprising both deleterious and beneficial mutations, and α can be inferred without using divergence data. We also show that not accounting for the contribution of beneficial mutations to polymorphism data leads to substantially biased estimates of the DFE and α. We compare our framework with one of the most widely used inference methods available and apply it on a recently published chimpanzee exome data set.
format Online
Article
Text
id pubmed-5676230
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-56762302017-11-09 Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data Tataru, Paula Mollion, Maéva Glémin, Sylvain Bataillon, Thomas Genetics Investigations The distribution of fitness effects (DFE) encompasses the fraction of deleterious, neutral, and beneficial mutations. It conditions the evolutionary trajectory of populations, as well as the rate of adaptive molecular evolution (α). Inferring DFE and α from patterns of polymorphism, as given through the site frequency spectrum (SFS) and divergence data, has been a longstanding goal of evolutionary genetics. A widespread assumption shared by previous inference methods is that beneficial mutations only contribute negligibly to the polymorphism data. Hence, a DFE comprising only deleterious mutations tends to be estimated from SFS data, and α is then predicted by contrasting the SFS with divergence data from an outgroup. We develop a hierarchical probabilistic framework that extends previous methods to infer DFE and α from polymorphism data alone. We use extensive simulations to examine the performance of our method. While an outgroup is still needed to obtain an unfolded SFS, we show that both a DFE, comprising both deleterious and beneficial mutations, and α can be inferred without using divergence data. We also show that not accounting for the contribution of beneficial mutations to polymorphism data leads to substantially biased estimates of the DFE and α. We compare our framework with one of the most widely used inference methods available and apply it on a recently published chimpanzee exome data set. Genetics Society of America 2017-11 2017-09-25 /pmc/articles/PMC5676230/ /pubmed/28951530 http://dx.doi.org/10.1534/genetics.117.300323 Text en Copyright © 2017 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Tataru, Paula
Mollion, Maéva
Glémin, Sylvain
Bataillon, Thomas
Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data
title Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data
title_full Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data
title_fullStr Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data
title_full_unstemmed Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data
title_short Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data
title_sort inference of distribution of fitness effects and proportion of adaptive substitutions from polymorphism data
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5676230/
https://www.ncbi.nlm.nih.gov/pubmed/28951530
http://dx.doi.org/10.1534/genetics.117.300323
work_keys_str_mv AT tatarupaula inferenceofdistributionoffitnesseffectsandproportionofadaptivesubstitutionsfrompolymorphismdata
AT mollionmaeva inferenceofdistributionoffitnesseffectsandproportionofadaptivesubstitutionsfrompolymorphismdata
AT gleminsylvain inferenceofdistributionoffitnesseffectsandproportionofadaptivesubstitutionsfrompolymorphismdata
AT bataillonthomas inferenceofdistributionoffitnesseffectsandproportionofadaptivesubstitutionsfrompolymorphismdata