Cargando…

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

As DNA sequencing technology has markedly advanced in recent years(2), it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought(3). In contrast, array-based genotyping has failed to identify a significant contribution of c...

Descripción completa

Detalles Bibliográficos
Autores principales: Vallania, Francesco, Ramos, Enrique, Cresci, Sharon, Mitra, Robi D., Druley, Todd E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MyJove Corporation 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3471313/
https://www.ncbi.nlm.nih.gov/pubmed/22760212
http://dx.doi.org/10.3791/3943
_version_ 1782246402510815232
author Vallania, Francesco
Ramos, Enrique
Cresci, Sharon
Mitra, Robi D.
Druley, Todd E.
author_facet Vallania, Francesco
Ramos, Enrique
Cresci, Sharon
Mitra, Robi D.
Druley, Todd E.
author_sort Vallania, Francesco
collection PubMed
description As DNA sequencing technology has markedly advanced in recent years(2), it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought(3). In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease(4,5). Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants(6-8). However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators. To address this need, we have developed a pooled sequencing approach(1,9) and a novel software package(1) for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (http://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA.
format Online
Article
Text
id pubmed-3471313
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher MyJove Corporation
record_format MEDLINE/PubMed
spelling pubmed-34713132012-10-15 Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER Vallania, Francesco Ramos, Enrique Cresci, Sharon Mitra, Robi D. Druley, Todd E. J Vis Exp Genetics As DNA sequencing technology has markedly advanced in recent years(2), it has become increasingly evident that the amount of genetic variation between any two individuals is greater than previously thought(3). In contrast, array-based genotyping has failed to identify a significant contribution of common sequence variants to the phenotypic variability of common disease(4,5). Taken together, these observations have led to the evolution of the Common Disease / Rare Variant hypothesis suggesting that the majority of the "missing heritability" in common and complex phenotypes is instead due to an individual's personal profile of rare or private DNA variants(6-8). However, characterizing how rare variation impacts complex phenotypes requires the analysis of many affected individuals at many genomic loci, and is ideally compared to a similar survey in an unaffected cohort. Despite the sequencing power offered by today's platforms, a population-based survey of many genomic loci and the subsequent computational analysis required remains prohibitive for many investigators. To address this need, we have developed a pooled sequencing approach(1,9) and a novel software package(1) for highly accurate rare variant detection from the resulting data. The ability to pool genomes from entire populations of affected individuals and survey the degree of genetic variation at multiple targeted regions in a single sequencing library provides excellent cost and time savings to traditional single-sample sequencing methodology. With a mean sequencing coverage per allele of 25-fold, our custom algorithm, SPLINTER, uses an internal variant calling control strategy to call insertions, deletions and substitutions up to four base pairs in length with high sensitivity and specificity from pools of up to 1 mutant allele in 500 individuals. Here we describe the method for preparing the pooled sequencing library followed by step-by-step instructions on how to use the SPLINTER package for pooled sequencing analysis (http://www.ibridgenetwork.org/wustl/splinter). We show a comparison between pooled sequencing of 947 individuals, all of whom also underwent genome-wide array, at over 20kb of sequencing per person. Concordance between genotyping of tagged and novel variants called in the pooled sample were excellent. This method can be easily scaled up to any number of genomic loci and any number of individuals. By incorporating the internal positive and negative amplicon controls at ratios that mimic the population under study, the algorithm can be calibrated for optimal performance. This strategy can also be modified for use with hybridization capture or individual-specific barcodes and can be applied to the sequencing of naturally heterogeneous samples, such as tumor DNA. MyJove Corporation 2012-06-23 /pmc/articles/PMC3471313/ /pubmed/22760212 http://dx.doi.org/10.3791/3943 Text en Copyright © 2012, Journal of Visualized Experiments http://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visithttp://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle Genetics
Vallania, Francesco
Ramos, Enrique
Cresci, Sharon
Mitra, Robi D.
Druley, Todd E.
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
title Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
title_full Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
title_fullStr Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
title_full_unstemmed Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
title_short Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
title_sort detection of rare genomic variants from pooled sequencing using splinter
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3471313/
https://www.ncbi.nlm.nih.gov/pubmed/22760212
http://dx.doi.org/10.3791/3943
work_keys_str_mv AT vallaniafrancesco detectionofraregenomicvariantsfrompooledsequencingusingsplinter
AT ramosenrique detectionofraregenomicvariantsfrompooledsequencingusingsplinter
AT crescisharon detectionofraregenomicvariantsfrompooledsequencingusingsplinter
AT mitrarobid detectionofraregenomicvariantsfrompooledsequencingusingsplinter
AT druleytodde detectionofraregenomicvariantsfrompooledsequencingusingsplinter