Cargando…

Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization

High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient popu...

Descripción completa

Detalles Bibliográficos
Autores principales: Bansal, Vikas, Tewhey, Ryan, LeProust, Emily M., Schork, Nicholas J.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3068187/
https://www.ncbi.nlm.nih.gov/pubmed/21479135
http://dx.doi.org/10.1371/journal.pone.0018353
_version_ 1782201204238974976
author Bansal, Vikas
Tewhey, Ryan
LeProust, Emily M.
Schork, Nicholas J.
author_facet Bansal, Vikas
Tewhey, Ryan
LeProust, Emily M.
Schork, Nicholas J.
author_sort Bansal, Vikas
collection PubMed
description High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled sequencing of 100 HapMap samples across ∼600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled sequence data, we were able to not only identify single nucleotide variants with a low false discovery rate (<1%) but also accurately detect short insertion/deletion variants. In addition, with sufficient coverage per individual in each pool (30-fold) we detected 97.2% of the total variants and 93.6% of variants below 5% in frequency. Finally, allele frequencies for single nucleotide variants (SNVs) estimated from the pooled data and the HapMap genotype data were tightly correlated (correlation coefficient > =  0.995).
format Text
id pubmed-3068187
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30681872011-04-08 Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization Bansal, Vikas Tewhey, Ryan LeProust, Emily M. Schork, Nicholas J. PLoS One Research Article High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled sequencing of 100 HapMap samples across ∼600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled sequence data, we were able to not only identify single nucleotide variants with a low false discovery rate (<1%) but also accurately detect short insertion/deletion variants. In addition, with sufficient coverage per individual in each pool (30-fold) we detected 97.2% of the total variants and 93.6% of variants below 5% in frequency. Finally, allele frequencies for single nucleotide variants (SNVs) estimated from the pooled data and the HapMap genotype data were tightly correlated (correlation coefficient > =  0.995). Public Library of Science 2011-03-30 /pmc/articles/PMC3068187/ /pubmed/21479135 http://dx.doi.org/10.1371/journal.pone.0018353 Text en Bansal et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Bansal, Vikas
Tewhey, Ryan
LeProust, Emily M.
Schork, Nicholas J.
Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization
title Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization
title_full Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization
title_fullStr Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization
title_full_unstemmed Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization
title_short Efficient and Cost Effective Population Resequencing by Pooling and In-Solution Hybridization
title_sort efficient and cost effective population resequencing by pooling and in-solution hybridization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3068187/
https://www.ncbi.nlm.nih.gov/pubmed/21479135
http://dx.doi.org/10.1371/journal.pone.0018353
work_keys_str_mv AT bansalvikas efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization
AT tewheyryan efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization
AT leproustemilym efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization
AT schorknicholasj efficientandcosteffectivepopulationresequencingbypoolingandinsolutionhybridization