Cargando…

Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster

The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of lar...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yuan, Bergland, Alan O., González, Josefa, Petrov, Dmitri A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3406057/
https://www.ncbi.nlm.nih.gov/pubmed/22848651
http://dx.doi.org/10.1371/journal.pone.0041901
_version_ 1782239197549035520
author Zhu, Yuan
Bergland, Alan O.
González, Josefa
Petrov, Dmitri A.
author_facet Zhu, Yuan
Bergland, Alan O.
González, Josefa
Petrov, Dmitri A.
author_sort Zhu, Yuan
collection PubMed
description The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive.
format Online
Article
Text
id pubmed-3406057
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34060572012-07-30 Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster Zhu, Yuan Bergland, Alan O. González, Josefa Petrov, Dmitri A. PLoS One Research Article The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive. Public Library of Science 2012-07-26 /pmc/articles/PMC3406057/ /pubmed/22848651 http://dx.doi.org/10.1371/journal.pone.0041901 Text en © 2012 Zhu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhu, Yuan
Bergland, Alan O.
González, Josefa
Petrov, Dmitri A.
Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
title Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
title_full Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
title_fullStr Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
title_full_unstemmed Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
title_short Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
title_sort empirical validation of pooled whole genome population re-sequencing in drosophila melanogaster
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3406057/
https://www.ncbi.nlm.nih.gov/pubmed/22848651
http://dx.doi.org/10.1371/journal.pone.0041901
work_keys_str_mv AT zhuyuan empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster
AT berglandalano empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster
AT gonzalezjosefa empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster
AT petrovdmitria empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster