Cargando…

IDENTIFICATION OF GENETIC VARIANTS USING BARCODED MULTIPLEXED SEQUENCING

We developed a generalized framework for multiplexed resequencing of targeted regions of the human genome on the Illumina Genome Analyzer using degenerate indexed DNA sequence barcodes ligated to fragmented DNA prior to sequencing. Using this method, the DNA of multiple HapMap individuals was simult...

Descripción completa

Detalles Bibliográficos
Autores principales: Craig, David W., Pearson, John V., Szelinger, Szabolcs, Sekar, Aswin, Margot, Redma, Corneveaux, Jason J., Pawlowski, Traci L., Laub, Trisha, Nunn, Gary, Stephan, Dietrich A., Homer, Nils, Huentelman, Matthew J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171277/
https://www.ncbi.nlm.nih.gov/pubmed/18794863
http://dx.doi.org/10.1038/nmeth.1251
Descripción
Sumario:We developed a generalized framework for multiplexed resequencing of targeted regions of the human genome on the Illumina Genome Analyzer using degenerate indexed DNA sequence barcodes ligated to fragmented DNA prior to sequencing. Using this method, the DNA of multiple HapMap individuals was simultaneously sequenced at several ENCODE (ENCyclopedia of DNA Elements) regions. We then evaluated the use of Bayes factors for discovering and genotyping polymorphisms from aligned sequenced reads. If we required that predicted polymorphisms be either previously identified by dbSNP or be visually evident upon reinspection of archived ENCODE traces, we observed a false-positive rate of 11.3% using strict thresholds (K(s)>1,000) for predicting variants and 69.6% for lax thresholds (K(s)>10). Conversely, false-negative rates ranged from 10.8% to 90.8%, with those at stricter cut-offs occurring at lower coverage (< 10 aligned reads). These results suggest that >90% of genetic variants are discoverable using multiplexed sequencing provided sufficient coverage at the polymorphic base.