Cargando…

Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution

Genomic time series data generated by evolve-and-resequence (E&R) experiments offer a powerful window into the mechanisms that drive evolution. However, standard population genetic inference procedures do not account for sampling serially over time, and new methods are needed to make full use of...

Descripción completa

Detalles Bibliográficos
Autores principales: Terhorst, Jonathan, Schlötterer, Christian, Song, Yun S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4388667/
https://www.ncbi.nlm.nih.gov/pubmed/25849855
http://dx.doi.org/10.1371/journal.pgen.1005069
_version_ 1782365421590020096
author Terhorst, Jonathan
Schlötterer, Christian
Song, Yun S.
author_facet Terhorst, Jonathan
Schlötterer, Christian
Song, Yun S.
author_sort Terhorst, Jonathan
collection PubMed
description Genomic time series data generated by evolve-and-resequence (E&R) experiments offer a powerful window into the mechanisms that drive evolution. However, standard population genetic inference procedures do not account for sampling serially over time, and new methods are needed to make full use of modern experimental evolution data. To address this problem, we develop a Gaussian process approximation to the multi-locus Wright-Fisher process with selection over a time course of tens of generations. The mean and covariance structure of the Gaussian process are obtained by computing the corresponding moments in discrete-time Wright-Fisher models conditioned on the presence of a linked selected site. This enables our method to account for the effects of linkage and selection, both along the genome and across sampled time points, in an approximate but principled manner. We first use simulated data to demonstrate the power of our method to correctly detect, locate and estimate the fitness of a selected allele from among several linked sites. We study how this power changes for different values of selection strength, initial haplotypic diversity, population size, sampling frequency, experimental duration, number of replicates, and sequencing coverage depth. In addition to providing quantitative estimates of selection parameters from experimental evolution data, our model can be used by practitioners to design E&R experiments with requisite power. We also explore how our likelihood-based approach can be used to infer other model parameters, including effective population size and recombination rate. Then, we apply our method to analyze genome-wide data from a real E&R experiment designed to study the adaptation of D. melanogaster to a new laboratory environment with alternating cold and hot temperatures.
format Online
Article
Text
id pubmed-4388667
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43886672015-04-21 Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution Terhorst, Jonathan Schlötterer, Christian Song, Yun S. PLoS Genet Research Article Genomic time series data generated by evolve-and-resequence (E&R) experiments offer a powerful window into the mechanisms that drive evolution. However, standard population genetic inference procedures do not account for sampling serially over time, and new methods are needed to make full use of modern experimental evolution data. To address this problem, we develop a Gaussian process approximation to the multi-locus Wright-Fisher process with selection over a time course of tens of generations. The mean and covariance structure of the Gaussian process are obtained by computing the corresponding moments in discrete-time Wright-Fisher models conditioned on the presence of a linked selected site. This enables our method to account for the effects of linkage and selection, both along the genome and across sampled time points, in an approximate but principled manner. We first use simulated data to demonstrate the power of our method to correctly detect, locate and estimate the fitness of a selected allele from among several linked sites. We study how this power changes for different values of selection strength, initial haplotypic diversity, population size, sampling frequency, experimental duration, number of replicates, and sequencing coverage depth. In addition to providing quantitative estimates of selection parameters from experimental evolution data, our model can be used by practitioners to design E&R experiments with requisite power. We also explore how our likelihood-based approach can be used to infer other model parameters, including effective population size and recombination rate. Then, we apply our method to analyze genome-wide data from a real E&R experiment designed to study the adaptation of D. melanogaster to a new laboratory environment with alternating cold and hot temperatures. Public Library of Science 2015-04-07 /pmc/articles/PMC4388667/ /pubmed/25849855 http://dx.doi.org/10.1371/journal.pgen.1005069 Text en © 2015 Terhorst et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Terhorst, Jonathan
Schlötterer, Christian
Song, Yun S.
Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution
title Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution
title_full Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution
title_fullStr Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution
title_full_unstemmed Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution
title_short Multi-locus Analysis of Genomic Time Series Data from Experimental Evolution
title_sort multi-locus analysis of genomic time series data from experimental evolution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4388667/
https://www.ncbi.nlm.nih.gov/pubmed/25849855
http://dx.doi.org/10.1371/journal.pgen.1005069
work_keys_str_mv AT terhorstjonathan multilocusanalysisofgenomictimeseriesdatafromexperimentalevolution
AT schlottererchristian multilocusanalysisofgenomictimeseriesdatafromexperimentalevolution
AT songyuns multilocusanalysisofgenomictimeseriesdatafromexperimentalevolution