Cargando…

A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets

BACKGROUND: The recent advent of high-throughput SNP genotyping technologies has opened new avenues of research for population genetics. In particular, a growing interest in the identification of footprints of selection, based on genome scans for adaptive differentiation, has emerged. METHODOLOGY/PR...

Descripción completa

Detalles Bibliográficos
Autores principales: Gautier, Mathieu, Hocking, Toby Dylan, Foulley, Jean-Louis
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2914027/
https://www.ncbi.nlm.nih.gov/pubmed/20689851
http://dx.doi.org/10.1371/journal.pone.0011913
_version_ 1782184728079630336
author Gautier, Mathieu
Hocking, Toby Dylan
Foulley, Jean-Louis
author_facet Gautier, Mathieu
Hocking, Toby Dylan
Foulley, Jean-Louis
author_sort Gautier, Mathieu
collection PubMed
description BACKGROUND: The recent advent of high-throughput SNP genotyping technologies has opened new avenues of research for population genetics. In particular, a growing interest in the identification of footprints of selection, based on genome scans for adaptive differentiation, has emerged. METHODOLOGY/PRINCIPAL FINDINGS: The purpose of this study is to develop an efficient model-based approach to perform Bayesian exploratory analyses for adaptive differentiation in very large SNP data sets. The basic idea is to start with a very simple model for neutral loci that is easy to implement under a Bayesian framework and to identify selected loci as outliers via Posterior Predictive P-values (PPP-values). Applications of this strategy are considered using two different statistical models. The first one was initially interpreted in the context of populations evolving respectively under pure genetic drift from a common ancestral population while the second one relies on populations under migration-drift equilibrium. Robustness and power of the two resulting Bayesian model-based approaches to detect SNP under selection are further evaluated through extensive simulations. An application to a cattle data set is also provided. CONCLUSIONS/SIGNIFICANCE: The procedure described turns out to be much faster than former Bayesian approaches and also reasonably efficient especially to detect loci under positive selection.
format Text
id pubmed-2914027
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29140272010-08-04 A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets Gautier, Mathieu Hocking, Toby Dylan Foulley, Jean-Louis PLoS One Research Article BACKGROUND: The recent advent of high-throughput SNP genotyping technologies has opened new avenues of research for population genetics. In particular, a growing interest in the identification of footprints of selection, based on genome scans for adaptive differentiation, has emerged. METHODOLOGY/PRINCIPAL FINDINGS: The purpose of this study is to develop an efficient model-based approach to perform Bayesian exploratory analyses for adaptive differentiation in very large SNP data sets. The basic idea is to start with a very simple model for neutral loci that is easy to implement under a Bayesian framework and to identify selected loci as outliers via Posterior Predictive P-values (PPP-values). Applications of this strategy are considered using two different statistical models. The first one was initially interpreted in the context of populations evolving respectively under pure genetic drift from a common ancestral population while the second one relies on populations under migration-drift equilibrium. Robustness and power of the two resulting Bayesian model-based approaches to detect SNP under selection are further evaluated through extensive simulations. An application to a cattle data set is also provided. CONCLUSIONS/SIGNIFICANCE: The procedure described turns out to be much faster than former Bayesian approaches and also reasonably efficient especially to detect loci under positive selection. Public Library of Science 2010-08-02 /pmc/articles/PMC2914027/ /pubmed/20689851 http://dx.doi.org/10.1371/journal.pone.0011913 Text en Gautier et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Gautier, Mathieu
Hocking, Toby Dylan
Foulley, Jean-Louis
A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets
title A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets
title_full A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets
title_fullStr A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets
title_full_unstemmed A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets
title_short A Bayesian Outlier Criterion to Detect SNPs under Selection in Large Data Sets
title_sort bayesian outlier criterion to detect snps under selection in large data sets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2914027/
https://www.ncbi.nlm.nih.gov/pubmed/20689851
http://dx.doi.org/10.1371/journal.pone.0011913
work_keys_str_mv AT gautiermathieu abayesianoutliercriteriontodetectsnpsunderselectioninlargedatasets
AT hockingtobydylan abayesianoutliercriteriontodetectsnpsunderselectioninlargedatasets
AT foulleyjeanlouis abayesianoutliercriteriontodetectsnpsunderselectioninlargedatasets
AT gautiermathieu bayesianoutliercriteriontodetectsnpsunderselectioninlargedatasets
AT hockingtobydylan bayesianoutliercriteriontodetectsnpsunderselectioninlargedatasets
AT foulleyjeanlouis bayesianoutliercriteriontodetectsnpsunderselectioninlargedatasets