Cargando…

Gaussian process test for high-throughput sequencing time series: application to experimental evolution

Motivation: Recent advances in high-throughput sequencing (HTS) have made it possible to monitor genomes in great detail. New experiments not only use HTS to measure genomic features at one time point but also monitor them changing over time with the aim of identifying significant changes in their a...

Descripción completa

Detalles Bibliográficos
Autores principales: Topa, Hande, Jónás, Ágnes, Kofler, Robert, Kosiol, Carolin, Honkela, Antti
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4443671/
https://www.ncbi.nlm.nih.gov/pubmed/25614471
http://dx.doi.org/10.1093/bioinformatics/btv014
_version_ 1782373036147605504
author Topa, Hande
Jónás, Ágnes
Kofler, Robert
Kosiol, Carolin
Honkela, Antti
author_facet Topa, Hande
Jónás, Ágnes
Kofler, Robert
Kosiol, Carolin
Honkela, Antti
author_sort Topa, Hande
collection PubMed
description Motivation: Recent advances in high-throughput sequencing (HTS) have made it possible to monitor genomes in great detail. New experiments not only use HTS to measure genomic features at one time point but also monitor them changing over time with the aim of identifying significant changes in their abundance. In population genetics, for example, allele frequencies are monitored over time to detect significant frequency changes that indicate selection pressures. Previous attempts at analyzing data from HTS experiments have been limited as they could not simultaneously include data at intermediate time points, replicate experiments and sources of uncertainty specific to HTS such as sequencing depth. Results: We present the beta-binomial Gaussian process model for ranking features with significant non-random variation in abundance over time. The features are assumed to represent proportions, such as proportion of an alternative allele in a population. We use the beta-binomial model to capture the uncertainty arising from finite sequencing depth and combine it with a Gaussian process model over the time series. In simulations that mimic the features of experimental evolution data, the proposed method clearly outperforms classical testing in average precision of finding selected alleles. We also present simulations exploring different experimental design choices and results on real data from Drosophila experimental evolution experiment in temperature adaptation. Availability and implementation: R software implementing the test is available at https://github.com/handetopa/BBGP. Contact: hande.topa@aalto.fi, agnes.jonas@vetmeduni.ac.at, carolin.kosiol@vetmeduni.ac.at, antti.honkela@hiit.fi Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4443671
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44436712015-06-05 Gaussian process test for high-throughput sequencing time series: application to experimental evolution Topa, Hande Jónás, Ágnes Kofler, Robert Kosiol, Carolin Honkela, Antti Bioinformatics Original Papers Motivation: Recent advances in high-throughput sequencing (HTS) have made it possible to monitor genomes in great detail. New experiments not only use HTS to measure genomic features at one time point but also monitor them changing over time with the aim of identifying significant changes in their abundance. In population genetics, for example, allele frequencies are monitored over time to detect significant frequency changes that indicate selection pressures. Previous attempts at analyzing data from HTS experiments have been limited as they could not simultaneously include data at intermediate time points, replicate experiments and sources of uncertainty specific to HTS such as sequencing depth. Results: We present the beta-binomial Gaussian process model for ranking features with significant non-random variation in abundance over time. The features are assumed to represent proportions, such as proportion of an alternative allele in a population. We use the beta-binomial model to capture the uncertainty arising from finite sequencing depth and combine it with a Gaussian process model over the time series. In simulations that mimic the features of experimental evolution data, the proposed method clearly outperforms classical testing in average precision of finding selected alleles. We also present simulations exploring different experimental design choices and results on real data from Drosophila experimental evolution experiment in temperature adaptation. Availability and implementation: R software implementing the test is available at https://github.com/handetopa/BBGP. Contact: hande.topa@aalto.fi, agnes.jonas@vetmeduni.ac.at, carolin.kosiol@vetmeduni.ac.at, antti.honkela@hiit.fi Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-06-01 2015-01-21 /pmc/articles/PMC4443671/ /pubmed/25614471 http://dx.doi.org/10.1093/bioinformatics/btv014 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Topa, Hande
Jónás, Ágnes
Kofler, Robert
Kosiol, Carolin
Honkela, Antti
Gaussian process test for high-throughput sequencing time series: application to experimental evolution
title Gaussian process test for high-throughput sequencing time series: application to experimental evolution
title_full Gaussian process test for high-throughput sequencing time series: application to experimental evolution
title_fullStr Gaussian process test for high-throughput sequencing time series: application to experimental evolution
title_full_unstemmed Gaussian process test for high-throughput sequencing time series: application to experimental evolution
title_short Gaussian process test for high-throughput sequencing time series: application to experimental evolution
title_sort gaussian process test for high-throughput sequencing time series: application to experimental evolution
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4443671/
https://www.ncbi.nlm.nih.gov/pubmed/25614471
http://dx.doi.org/10.1093/bioinformatics/btv014
work_keys_str_mv AT topahande gaussianprocesstestforhighthroughputsequencingtimeseriesapplicationtoexperimentalevolution
AT jonasagnes gaussianprocesstestforhighthroughputsequencingtimeseriesapplicationtoexperimentalevolution
AT koflerrobert gaussianprocesstestforhighthroughputsequencingtimeseriesapplicationtoexperimentalevolution
AT kosiolcarolin gaussianprocesstestforhighthroughputsequencingtimeseriesapplicationtoexperimentalevolution
AT honkelaantti gaussianprocesstestforhighthroughputsequencingtimeseriesapplicationtoexperimentalevolution