Cargando…
Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species
Sequencing of pooled samples (Pool-Seq) using next-generation sequencing technologies has become increasingly popular, because it represents a rapid and cost-effective method to determine allele frequencies for single nucleotide polymorphisms (SNPs) in population pools. Validation of allele frequenc...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3820589/ https://www.ncbi.nlm.nih.gov/pubmed/24244686 http://dx.doi.org/10.1371/journal.pone.0080422 |
_version_ | 1782290164488339456 |
---|---|
author | Rellstab, Christian Zoller, Stefan Tedder, Andrew Gugerli, Felix Fischer, Martin C. |
author_facet | Rellstab, Christian Zoller, Stefan Tedder, Andrew Gugerli, Felix Fischer, Martin C. |
author_sort | Rellstab, Christian |
collection | PubMed |
description | Sequencing of pooled samples (Pool-Seq) using next-generation sequencing technologies has become increasingly popular, because it represents a rapid and cost-effective method to determine allele frequencies for single nucleotide polymorphisms (SNPs) in population pools. Validation of allele frequencies determined by Pool-Seq has been attempted using an individual genotyping approach, but these studies tend to use samples from existing model organism databases or DNA stores, and do not validate a realistic setup for sampling natural populations. Here we used pyrosequencing to validate allele frequencies determined by Pool-Seq in three natural populations of Arabidopsis halleri (Brassicaceae). The allele frequency estimates of the pooled population samples (consisting of 20 individual plant DNA samples) were determined after mapping Illumina reads to (i) the publicly available, high-quality reference genome of a closely related species (Arabidopsis thaliana) and (ii) our own de novo draft genome assembly of A. halleri. We then pyrosequenced nine selected SNPs using the same individuals from each population, resulting in a total of 540 samples. Our results show a highly significant and accurate relationship between pooled and individually determined allele frequencies, irrespective of the reference genome used. Allele frequencies differed on average by less than 4%. There was no tendency that either the Pool-Seq or the individual-based approach resulted in higher or lower estimates of allele frequencies. Moreover, the rather high coverage in the mapping to the two reference genomes, ranging from 55 to 284x, had no significant effect on the accuracy of the Pool-Seq. A resampling analysis showed that only very low coverage values (below 10-20x) would substantially reduce the precision of the method. We therefore conclude that a pooled re-sequencing approach is well suited for analyses of genetic variation in natural populations. |
format | Online Article Text |
id | pubmed-3820589 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38205892013-11-15 Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species Rellstab, Christian Zoller, Stefan Tedder, Andrew Gugerli, Felix Fischer, Martin C. PLoS One Research Article Sequencing of pooled samples (Pool-Seq) using next-generation sequencing technologies has become increasingly popular, because it represents a rapid and cost-effective method to determine allele frequencies for single nucleotide polymorphisms (SNPs) in population pools. Validation of allele frequencies determined by Pool-Seq has been attempted using an individual genotyping approach, but these studies tend to use samples from existing model organism databases or DNA stores, and do not validate a realistic setup for sampling natural populations. Here we used pyrosequencing to validate allele frequencies determined by Pool-Seq in three natural populations of Arabidopsis halleri (Brassicaceae). The allele frequency estimates of the pooled population samples (consisting of 20 individual plant DNA samples) were determined after mapping Illumina reads to (i) the publicly available, high-quality reference genome of a closely related species (Arabidopsis thaliana) and (ii) our own de novo draft genome assembly of A. halleri. We then pyrosequenced nine selected SNPs using the same individuals from each population, resulting in a total of 540 samples. Our results show a highly significant and accurate relationship between pooled and individually determined allele frequencies, irrespective of the reference genome used. Allele frequencies differed on average by less than 4%. There was no tendency that either the Pool-Seq or the individual-based approach resulted in higher or lower estimates of allele frequencies. Moreover, the rather high coverage in the mapping to the two reference genomes, ranging from 55 to 284x, had no significant effect on the accuracy of the Pool-Seq. A resampling analysis showed that only very low coverage values (below 10-20x) would substantially reduce the precision of the method. We therefore conclude that a pooled re-sequencing approach is well suited for analyses of genetic variation in natural populations. Public Library of Science 2013-11-07 /pmc/articles/PMC3820589/ /pubmed/24244686 http://dx.doi.org/10.1371/journal.pone.0080422 Text en © 2013 Rellstab et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Rellstab, Christian Zoller, Stefan Tedder, Andrew Gugerli, Felix Fischer, Martin C. Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species |
title | Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species |
title_full | Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species |
title_fullStr | Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species |
title_full_unstemmed | Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species |
title_short | Validation of SNP Allele Frequencies Determined by Pooled Next-Generation Sequencing in Natural Populations of a Non-Model Plant Species |
title_sort | validation of snp allele frequencies determined by pooled next-generation sequencing in natural populations of a non-model plant species |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3820589/ https://www.ncbi.nlm.nih.gov/pubmed/24244686 http://dx.doi.org/10.1371/journal.pone.0080422 |
work_keys_str_mv | AT rellstabchristian validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies AT zollerstefan validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies AT tedderandrew validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies AT gugerlifelix validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies AT fischermartinc validationofsnpallelefrequenciesdeterminedbypoolednextgenerationsequencinginnaturalpopulationsofanonmodelplantspecies |