Cargando…
Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of lar...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3406057/ https://www.ncbi.nlm.nih.gov/pubmed/22848651 http://dx.doi.org/10.1371/journal.pone.0041901 |
_version_ | 1782239197549035520 |
---|---|
author | Zhu, Yuan Bergland, Alan O. González, Josefa Petrov, Dmitri A. |
author_facet | Zhu, Yuan Bergland, Alan O. González, Josefa Petrov, Dmitri A. |
author_sort | Zhu, Yuan |
collection | PubMed |
description | The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive. |
format | Online Article Text |
id | pubmed-3406057 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-34060572012-07-30 Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster Zhu, Yuan Bergland, Alan O. González, Josefa Petrov, Dmitri A. PLoS One Research Article The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive. Public Library of Science 2012-07-26 /pmc/articles/PMC3406057/ /pubmed/22848651 http://dx.doi.org/10.1371/journal.pone.0041901 Text en © 2012 Zhu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zhu, Yuan Bergland, Alan O. González, Josefa Petrov, Dmitri A. Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster |
title | Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
|
title_full | Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
|
title_fullStr | Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
|
title_full_unstemmed | Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
|
title_short | Empirical Validation of Pooled Whole Genome Population Re-Sequencing in Drosophila melanogaster
|
title_sort | empirical validation of pooled whole genome population re-sequencing in drosophila melanogaster |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3406057/ https://www.ncbi.nlm.nih.gov/pubmed/22848651 http://dx.doi.org/10.1371/journal.pone.0041901 |
work_keys_str_mv | AT zhuyuan empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster AT berglandalano empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster AT gonzalezjosefa empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster AT petrovdmitria empiricalvalidationofpooledwholegenomepopulationresequencingindrosophilamelanogaster |