Cargando…

ABC inference of multi-population divergence with admixture from unphased population genomic data

Rapidly developing sequencing technologies and declining costs have made it possible to collect genome-scale data from population-level samples in nonmodel systems. Inferential tools for historical demography given these data sets are, at present, underdeveloped. In particular, approximate Bayesian...

Descripción completa

Detalles Bibliográficos
Autores principales: Robinson, John D, Bunnefeld, Lynsey, Hearn, Jack, Stone, Graham N, Hickerson, Michael J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BlackWell Publishing Ltd 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4285295/
https://www.ncbi.nlm.nih.gov/pubmed/25113024
http://dx.doi.org/10.1111/mec.12881
_version_ 1782351565025181696
author Robinson, John D
Bunnefeld, Lynsey
Hearn, Jack
Stone, Graham N
Hickerson, Michael J
author_facet Robinson, John D
Bunnefeld, Lynsey
Hearn, Jack
Stone, Graham N
Hickerson, Michael J
author_sort Robinson, John D
collection PubMed
description Rapidly developing sequencing technologies and declining costs have made it possible to collect genome-scale data from population-level samples in nonmodel systems. Inferential tools for historical demography given these data sets are, at present, underdeveloped. In particular, approximate Bayesian computation (ABC) has yet to be widely embraced by researchers generating these data. Here, we demonstrate the promise of ABC for analysis of the large data sets that are now attainable from nonmodel taxa through current genomic sequencing technologies. We develop and test an ABC framework for model selection and parameter estimation, given histories of three-population divergence with admixture. We then explore different sampling regimes to illustrate how sampling more loci, longer loci or more individuals affects the quality of model selection and parameter estimation in this ABC framework. Our results show that inferences improved substantially with increases in the number and/or length of sequenced loci, while less benefit was gained by sampling large numbers of individuals. Optimal sampling strategies given our inferential models included at least 2000 loci, each approximately 2 kb in length, sampled from five diploid individuals per population, although specific strategies are model and question dependent. We tested our ABC approach through simulation-based cross-validations and illustrate its application using previously analysed data from the oak gall wasp, Biorhiza pallida.
format Online
Article
Text
id pubmed-4285295
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BlackWell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-42852952015-01-26 ABC inference of multi-population divergence with admixture from unphased population genomic data Robinson, John D Bunnefeld, Lynsey Hearn, Jack Stone, Graham N Hickerson, Michael J Mol Ecol Original Articles Rapidly developing sequencing technologies and declining costs have made it possible to collect genome-scale data from population-level samples in nonmodel systems. Inferential tools for historical demography given these data sets are, at present, underdeveloped. In particular, approximate Bayesian computation (ABC) has yet to be widely embraced by researchers generating these data. Here, we demonstrate the promise of ABC for analysis of the large data sets that are now attainable from nonmodel taxa through current genomic sequencing technologies. We develop and test an ABC framework for model selection and parameter estimation, given histories of three-population divergence with admixture. We then explore different sampling regimes to illustrate how sampling more loci, longer loci or more individuals affects the quality of model selection and parameter estimation in this ABC framework. Our results show that inferences improved substantially with increases in the number and/or length of sequenced loci, while less benefit was gained by sampling large numbers of individuals. Optimal sampling strategies given our inferential models included at least 2000 loci, each approximately 2 kb in length, sampled from five diploid individuals per population, although specific strategies are model and question dependent. We tested our ABC approach through simulation-based cross-validations and illustrate its application using previously analysed data from the oak gall wasp, Biorhiza pallida. BlackWell Publishing Ltd 2014-09 2014-09-06 /pmc/articles/PMC4285295/ /pubmed/25113024 http://dx.doi.org/10.1111/mec.12881 Text en © 2014 The Authors. Molecular Ecology published by John Wiley & Sons Ltd. http://creativecommons.org/licenses/by/3.0/ This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Articles
Robinson, John D
Bunnefeld, Lynsey
Hearn, Jack
Stone, Graham N
Hickerson, Michael J
ABC inference of multi-population divergence with admixture from unphased population genomic data
title ABC inference of multi-population divergence with admixture from unphased population genomic data
title_full ABC inference of multi-population divergence with admixture from unphased population genomic data
title_fullStr ABC inference of multi-population divergence with admixture from unphased population genomic data
title_full_unstemmed ABC inference of multi-population divergence with admixture from unphased population genomic data
title_short ABC inference of multi-population divergence with admixture from unphased population genomic data
title_sort abc inference of multi-population divergence with admixture from unphased population genomic data
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4285295/
https://www.ncbi.nlm.nih.gov/pubmed/25113024
http://dx.doi.org/10.1111/mec.12881
work_keys_str_mv AT robinsonjohnd abcinferenceofmultipopulationdivergencewithadmixturefromunphasedpopulationgenomicdata
AT bunnefeldlynsey abcinferenceofmultipopulationdivergencewithadmixturefromunphasedpopulationgenomicdata
AT hearnjack abcinferenceofmultipopulationdivergencewithadmixturefromunphasedpopulationgenomicdata
AT stonegrahamn abcinferenceofmultipopulationdivergencewithadmixturefromunphasedpopulationgenomicdata
AT hickersonmichaelj abcinferenceofmultipopulationdivergencewithadmixturefromunphasedpopulationgenomicdata