Cargando…

Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference

Single nucleotide polymorphisms (SNPs) in commercial arrays have often been discovered in a small number of samples from selected populations. This ascertainment skews patterns of nucleotide diversity and affects population genetic inferences. We propose a demographic inference pipeline that explici...

Descripción completa

Detalles Bibliográficos
Autores principales: Quinto-Cortés, Consuelo D., Woerner, August E., Watkins, Joseph C., Hammer, Michael F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033855/
https://www.ncbi.nlm.nih.gov/pubmed/29977040
http://dx.doi.org/10.1038/s41598-018-28539-y
_version_ 1783337755220115456
author Quinto-Cortés, Consuelo D.
Woerner, August E.
Watkins, Joseph C.
Hammer, Michael F.
author_facet Quinto-Cortés, Consuelo D.
Woerner, August E.
Watkins, Joseph C.
Hammer, Michael F.
author_sort Quinto-Cortés, Consuelo D.
collection PubMed
description Single nucleotide polymorphisms (SNPs) in commercial arrays have often been discovered in a small number of samples from selected populations. This ascertainment skews patterns of nucleotide diversity and affects population genetic inferences. We propose a demographic inference pipeline that explicitly models the SNP discovery protocol in an Approximate Bayesian Computation (ABC) framework. We simulated genomic regions according to a demographic model incorporating parameters for the divergence of three well-characterized HapMap populations and recreated the SNP distribution of a commercial array by varying the number of haploid samples and the allele frequency cut-off in the given regions. We then calculated summary statistics obtained from both the ascertained and genomic data and inferred ascertainment and demographic parameters. We implemented our pipeline to study the admixture process that gave rise to the present-day Mexican population. Our estimate of the time of admixture is closer to the historical dates than those in previous works which did not consider ascertainment bias. Although the use of whole genome sequences for demographic inference is becoming the norm, there are still underrepresented areas of the world from where only SNP array data are available. Our inference framework is applicable to those cases and will help with the demographic inference.
format Online
Article
Text
id pubmed-6033855
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-60338552018-07-12 Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference Quinto-Cortés, Consuelo D. Woerner, August E. Watkins, Joseph C. Hammer, Michael F. Sci Rep Article Single nucleotide polymorphisms (SNPs) in commercial arrays have often been discovered in a small number of samples from selected populations. This ascertainment skews patterns of nucleotide diversity and affects population genetic inferences. We propose a demographic inference pipeline that explicitly models the SNP discovery protocol in an Approximate Bayesian Computation (ABC) framework. We simulated genomic regions according to a demographic model incorporating parameters for the divergence of three well-characterized HapMap populations and recreated the SNP distribution of a commercial array by varying the number of haploid samples and the allele frequency cut-off in the given regions. We then calculated summary statistics obtained from both the ascertained and genomic data and inferred ascertainment and demographic parameters. We implemented our pipeline to study the admixture process that gave rise to the present-day Mexican population. Our estimate of the time of admixture is closer to the historical dates than those in previous works which did not consider ascertainment bias. Although the use of whole genome sequences for demographic inference is becoming the norm, there are still underrepresented areas of the world from where only SNP array data are available. Our inference framework is applicable to those cases and will help with the demographic inference. Nature Publishing Group UK 2018-07-05 /pmc/articles/PMC6033855/ /pubmed/29977040 http://dx.doi.org/10.1038/s41598-018-28539-y Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Quinto-Cortés, Consuelo D.
Woerner, August E.
Watkins, Joseph C.
Hammer, Michael F.
Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference
title Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference
title_full Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference
title_fullStr Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference
title_full_unstemmed Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference
title_short Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference
title_sort modeling snp array ascertainment with approximate bayesian computation for demographic inference
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033855/
https://www.ncbi.nlm.nih.gov/pubmed/29977040
http://dx.doi.org/10.1038/s41598-018-28539-y
work_keys_str_mv AT quintocortesconsuelod modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference
AT woernerauguste modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference
AT watkinsjosephc modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference
AT hammermichaelf modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference