Cargando…
Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference
Single nucleotide polymorphisms (SNPs) in commercial arrays have often been discovered in a small number of samples from selected populations. This ascertainment skews patterns of nucleotide diversity and affects population genetic inferences. We propose a demographic inference pipeline that explici...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033855/ https://www.ncbi.nlm.nih.gov/pubmed/29977040 http://dx.doi.org/10.1038/s41598-018-28539-y |
_version_ | 1783337755220115456 |
---|---|
author | Quinto-Cortés, Consuelo D. Woerner, August E. Watkins, Joseph C. Hammer, Michael F. |
author_facet | Quinto-Cortés, Consuelo D. Woerner, August E. Watkins, Joseph C. Hammer, Michael F. |
author_sort | Quinto-Cortés, Consuelo D. |
collection | PubMed |
description | Single nucleotide polymorphisms (SNPs) in commercial arrays have often been discovered in a small number of samples from selected populations. This ascertainment skews patterns of nucleotide diversity and affects population genetic inferences. We propose a demographic inference pipeline that explicitly models the SNP discovery protocol in an Approximate Bayesian Computation (ABC) framework. We simulated genomic regions according to a demographic model incorporating parameters for the divergence of three well-characterized HapMap populations and recreated the SNP distribution of a commercial array by varying the number of haploid samples and the allele frequency cut-off in the given regions. We then calculated summary statistics obtained from both the ascertained and genomic data and inferred ascertainment and demographic parameters. We implemented our pipeline to study the admixture process that gave rise to the present-day Mexican population. Our estimate of the time of admixture is closer to the historical dates than those in previous works which did not consider ascertainment bias. Although the use of whole genome sequences for demographic inference is becoming the norm, there are still underrepresented areas of the world from where only SNP array data are available. Our inference framework is applicable to those cases and will help with the demographic inference. |
format | Online Article Text |
id | pubmed-6033855 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-60338552018-07-12 Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference Quinto-Cortés, Consuelo D. Woerner, August E. Watkins, Joseph C. Hammer, Michael F. Sci Rep Article Single nucleotide polymorphisms (SNPs) in commercial arrays have often been discovered in a small number of samples from selected populations. This ascertainment skews patterns of nucleotide diversity and affects population genetic inferences. We propose a demographic inference pipeline that explicitly models the SNP discovery protocol in an Approximate Bayesian Computation (ABC) framework. We simulated genomic regions according to a demographic model incorporating parameters for the divergence of three well-characterized HapMap populations and recreated the SNP distribution of a commercial array by varying the number of haploid samples and the allele frequency cut-off in the given regions. We then calculated summary statistics obtained from both the ascertained and genomic data and inferred ascertainment and demographic parameters. We implemented our pipeline to study the admixture process that gave rise to the present-day Mexican population. Our estimate of the time of admixture is closer to the historical dates than those in previous works which did not consider ascertainment bias. Although the use of whole genome sequences for demographic inference is becoming the norm, there are still underrepresented areas of the world from where only SNP array data are available. Our inference framework is applicable to those cases and will help with the demographic inference. Nature Publishing Group UK 2018-07-05 /pmc/articles/PMC6033855/ /pubmed/29977040 http://dx.doi.org/10.1038/s41598-018-28539-y Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Quinto-Cortés, Consuelo D. Woerner, August E. Watkins, Joseph C. Hammer, Michael F. Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference |
title | Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference |
title_full | Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference |
title_fullStr | Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference |
title_full_unstemmed | Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference |
title_short | Modeling SNP array ascertainment with Approximate Bayesian Computation for demographic inference |
title_sort | modeling snp array ascertainment with approximate bayesian computation for demographic inference |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6033855/ https://www.ncbi.nlm.nih.gov/pubmed/29977040 http://dx.doi.org/10.1038/s41598-018-28539-y |
work_keys_str_mv | AT quintocortesconsuelod modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference AT woernerauguste modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference AT watkinsjosephc modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference AT hammermichaelf modelingsnparrayascertainmentwithapproximatebayesiancomputationfordemographicinference |