Cargando…

A genome-wide approach for detecting novel insertion-deletion variants of mid-range size

We present SWAN, a statistical framework for robust detection of genomic structural variants in next-generation sequencing data and an analysis of mid-range size insertion and deletions (<10 Kb) for whole genome analysis and DNA mixtures. To identify these mid-range size events, SWAN collectively...

Descripción completa

Detalles Bibliográficos
Autores principales: Xia, Li C., Sakshuwong, Sukolsak, Hopmans, Erik S., Bell, John M., Grimes, Susan M., Siegmund, David O., Ji, Hanlee P., Zhang, Nancy R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5009736/
https://www.ncbi.nlm.nih.gov/pubmed/27325742
http://dx.doi.org/10.1093/nar/gkw481
Descripción
Sumario:We present SWAN, a statistical framework for robust detection of genomic structural variants in next-generation sequencing data and an analysis of mid-range size insertion and deletions (<10 Kb) for whole genome analysis and DNA mixtures. To identify these mid-range size events, SWAN collectively uses information from read-pair, read-depth and one end mapped reads through statistical likelihoods based on Poisson field models. SWAN also uses soft-clip/split read remapping to supplement the likelihood analysis and determine variant boundaries. The accuracy of SWAN is demonstrated by in silico spike-ins and by identification of known variants in the NA12878 genome. We used SWAN to identify a series of novel set of mid-range insertion/deletion detection that were confirmed by targeted deep re-sequencing. An R package implementation of SWAN is open source and freely available.