Cargando…
Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization
High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient popu...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3068187/ https://www.ncbi.nlm.nih.gov/pubmed/21479135 http://dx.doi.org/10.1371/journal.pone.0018353 |
_version_ | 1782201204238974976 |
---|---|
author | Bansal, Vikas Tewhey, Ryan LeProust, Emily M. Schork, Nicholas J. |
author_facet | Bansal, Vikas Tewhey, Ryan LeProust, Emily M. Schork, Nicholas J. |
author_sort | Bansal, Vikas |
collection | PubMed |
description | High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled sequencing of 100 HapMap samples across ∼600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled sequence data, we were able to not only identify single nucleotide variants with a low false discovery rate (<1%) but also accurately detect short insertion/deletion variants. In addition, with sufficient coverage per individual in each pool (30-fold) we detected 97.2% of the total variants and 93.6% of variants below 5% in frequency. Finally, allele frequencies for single nucleotide variants (SNVs) estimated from the pooled data and the HapMap genotype data were tightly correlated (correlation coefficient > = 0.995). |
format | Text |
id | pubmed-3068187 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-30681872011-04-08 Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization Bansal, Vikas Tewhey, Ryan LeProust, Emily M. Schork, Nicholas J. PLoS One Research Article High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled sequencing of 100 HapMap samples across ∼600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled sequence data, we were able to not only identify single nucleotide variants with a low false discovery rate (<1%) but also accurately detect short insertion/deletion variants. In addition, with sufficient coverage per individual in each pool (30-fold) we detected 97.2% of the total variants and 93.6% of variants below 5% in frequency. Finally, allele frequencies for single nucleotide variants (SNVs) estimated from the pooled data and the HapMap genotype data were tightly correlated (correlation coefficient > = 0.995). Public Library of Science 2011-03-30 /pmc/articles/PMC3068187/ /pubmed/21479135 http://dx.doi.org/10.1371/journal.pone.0018353 Text en Bansal et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Bansal, Vikas Tewhey, Ryan LeProust, Emily M. Schork, Nicholas J. Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization |
title | Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization |
title_full | Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization |
title_fullStr | Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization |
title_full_unstemmed | Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization |
title_short | Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization |
title_sort | efficient and cost effective population resequencing by pooling and in-solution hybridization |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3068187/ https://www.ncbi.nlm.nih.gov/pubmed/21479135 http://dx.doi.org/10.1371/journal.pone.0018353 |
work_keys_str_mv | AT bansalvikas efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization AT tewheyryan efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization AT leproustemilym efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization AT schorknicholasj efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization |