Cargando…

Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species

Sequencing of pooled samples (Pool-Seq) using next-generation sequencing technologies has become increasingly popular, because it represents a rapid and cost-effective method to determine allele frequencies for single nucleotide polymorphisms (SNPs) in population pools. Validation of allele frequenc...

Descripción completa

Detalles Bibliográficos
Autores principales: Rellstab, Christian, Zoller, Stefan, Tedder, Andrew, Gugerli, Felix, Fischer, Martin C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3820589/
https://www.ncbi.nlm.nih.gov/pubmed/24244686
http://dx.doi.org/10.1371/journal.pone.0080422
_version_ 1782290164488339456
author Rellstab, Christian
Zoller, Stefan
Tedder, Andrew
Gugerli, Felix
Fischer, Martin C.
author_facet Rellstab, Christian
Zoller, Stefan
Tedder, Andrew
Gugerli, Felix
Fischer, Martin C.
author_sort Rellstab, Christian
collection PubMed
description Sequencing of pooled samples (Pool-Seq) using next-generation sequencing technologies has become increasingly popular, because it represents a rapid and cost-effective method to determine allele frequencies for single nucleotide polymorphisms (SNPs) in population pools. Validation of allele frequencies determined by Pool-Seq has been attempted using an individual genotyping approach, but these studies tend to use samples from existing model organism databases or DNA stores, and do not validate a realistic setup for sampling natural populations. Here we used pyrosequencing to validate allele frequencies determined by Pool-Seq in three natural populations of Arabidopsis halleri (Brassicaceae). The allele frequency estimates of the pooled population samples (consisting of 20 individual plant DNA samples) were determined after mapping Illumina reads to (i) the publicly available, high-quality reference genome of a closely related species (Arabidopsis thaliana) and (ii) our own de novo draft genome assembly of A. halleri. We then pyrosequenced nine selected SNPs using the same individuals from each population, resulting in a total of 540 samples. Our results show a highly significant and accurate relationship between pooled and individually determined allele frequencies, irrespective of the reference genome used. Allele frequencies differed on average by less than 4%. There was no tendency that either the Pool-Seq or the individual-based approach resulted in higher or lower estimates of allele frequencies. Moreover, the rather high coverage in the mapping to the two reference genomes, ranging from 55 to 284x, had no significant effect on the accuracy of the Pool-Seq. A resampling analysis showed that only very low coverage values (below 10-20x) would substantially reduce the precision of the method. We therefore conclude that a pooled re-sequencing approach is well suited for analyses of genetic variation in natural populations.
format Online
Article
Text
id pubmed-3820589
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38205892013-11-15 Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species Rellstab, Christian Zoller, Stefan Tedder, Andrew Gugerli, Felix Fischer, Martin C. PLoS One Research Article Sequencing of pooled samples (Pool-Seq) using next-generation sequencing technologies has become increasingly popular, because it represents a rapid and cost-effective method to determine allele frequencies for single nucleotide polymorphisms (SNPs) in population pools. Validation of allele frequencies determined by Pool-Seq has been attempted using an individual genotyping approach, but these studies tend to use samples from existing model organism databases or DNA stores, and do not validate a realistic setup for sampling natural populations. Here we used pyrosequencing to validate allele frequencies determined by Pool-Seq in three natural populations of Arabidopsis halleri (Brassicaceae). The allele frequency estimates of the pooled population samples (consisting of 20 individual plant DNA samples) were determined after mapping Illumina reads to (i) the publicly available, high-quality reference genome of a closely related species (Arabidopsis thaliana) and (ii) our own de novo draft genome assembly of A. halleri. We then pyrosequenced nine selected SNPs using the same individuals from each population, resulting in a total of 540 samples. Our results show a highly significant and accurate relationship between pooled and individually determined allele frequencies, irrespective of the reference genome used. Allele frequencies differed on average by less than 4%. There was no tendency that either the Pool-Seq or the individual-based approach resulted in higher or lower estimates of allele frequencies. Moreover, the rather high coverage in the mapping to the two reference genomes, ranging from 55 to 284x, had no significant effect on the accuracy of the Pool-Seq. A resampling analysis showed that only very low coverage values (below 10-20x) would substantially reduce the precision of the method. We therefore conclude that a pooled re-sequencing approach is well suited for analyses of genetic variation in natural populations. Public Library of Science 2013-11-07 /pmc/articles/PMC3820589/ /pubmed/24244686 http://dx.doi.org/10.1371/journal.pone.0080422 Text en © 2013 Rellstab et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rellstab, Christian
Zoller, Stefan
Tedder, Andrew
Gugerli, Felix
Fischer, Martin C.
Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species
title Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species
title_full Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species
title_fullStr Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species
title_full_unstemmed Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species
title_short Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species
title_sort validation of snp allele frequencies determined by pooled next-generation sequencing in natural populations of a non-model plant species
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3820589/
https://www.ncbi.nlm.nih.gov/pubmed/24244686
http://dx.doi.org/10.1371/journal.pone.0080422
work_keys_str_mv AT rellstabchristian validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies
AT zollerstefan validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies
AT tedderandrew validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies
AT gugerlifelix validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies
AT fischermartinc validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies