Cargando…
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
As DNA sequencing technology has markedly advanced in recent years(2), it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought(3). In contrast, array-based genotyping has failed to identify a significant contribution of c...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MyJove Corporation
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3471313/ https://www.ncbi.nlm.nih.gov/pubmed/22760212 http://dx.doi.org/10.3791/3943 |
_version_ | 1782246402510815232 |
---|---|
author | Vallania, Francesco Ramos, Enrique Cresci, Sharon Mitra, Robi D. Druley, Todd E. |
author_facet | Vallania, Francesco Ramos, Enrique Cresci, Sharon Mitra, Robi D. Druley, Todd E. |
author_sort | Vallania, Francesco |
collection | PubMed |
description | As DNA sequencing technology has markedly advanced in recent years(2), it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought(3). In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease(4,5). Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants(6-8). However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators. To address this need, we have developed a pooled sequencing approach(1,9) and a novel software package(1) for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (http://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA. |
format | Online Article Text |
id | pubmed-3471313 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | MyJove Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-34713132012-10-15 Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER Vallania, Francesco Ramos, Enrique Cresci, Sharon Mitra, Robi D. Druley, Todd E. J Vis Exp Genetics As DNA sequencing technology has markedly advanced in recent years(2), it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought(3). In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease(4,5). Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants(6-8). However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators. To address this need, we have developed a pooled sequencing approach(1,9) and a novel software package(1) for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (http://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA. MyJove Corporation 2012-06-23 /pmc/articles/PMC3471313/ /pubmed/22760212 http://dx.doi.org/10.3791/3943 Text en Copyright © 2012, Journal of Visualized Experiments http://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visithttp://creativecommons.org/licenses/by-nc-nd/3.0/ |
spellingShingle | Genetics Vallania, Francesco Ramos, Enrique Cresci, Sharon Mitra, Robi D. Druley, Todd E. Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER |
title | Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER |
title_full | Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER |
title_fullStr | Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER |
title_full_unstemmed | Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER |
title_short | Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER |
title_sort | detection of rare genomic variants from pooled sequencing using splinter |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3471313/ https://www.ncbi.nlm.nih.gov/pubmed/22760212 http://dx.doi.org/10.3791/3943 |
work_keys_str_mv | AT vallaniafrancesco detectionofraregenomicvariantsfrompooledsequencingusingsplinter AT ramosenrique detectionofraregenomicvariantsfrompooledsequencingusingsplinter AT crescisharon detectionofraregenomicvariantsfrompooledsequencingusingsplinter AT mitrarobid detectionofraregenomicvariantsfrompooledsequencingusingsplinter AT druleytodde detectionofraregenomicvariantsfrompooledsequencingusingsplinter |